Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

output length of tandem repeats #14

Open
jorondo1 opened this issue Aug 2, 2022 · 4 comments
Open

output length of tandem repeats #14

jorondo1 opened this issue Aug 2, 2022 · 4 comments

Comments

@jorondo1
Copy link

jorondo1 commented Aug 2, 2022

Hi, I am trying to figure out how to estimate the total length of tandem repeats found in a sequence and I can't figure out which metric to use. I have several genomes I would like to analyse, I would be interested in how to generate this data from TRF output, do you have any recommendation ?

@bricoletc
Copy link

bricoletc commented Sep 20, 2022

(Not a maintainer of this project but) what exactly do you mean by 'total length'? Sum of Pattern size * copy-number for all repeats identified in a sequence? Or all locations in a sequence covered by at least one tandem repeat?

@jorondo1
Copy link
Author

I want to find out what % of the genome is tandem repeats. Would Pattern size * copy-number / genome length be the way to go?

@bricoletc
Copy link

Not quite- you'd need to account for positionally-overlapping repeats. I guess ideal would be a BED output describing locations of all the tandem repeats, followed by bedtools genomecov for example.

Bed output already requested in #12. I'd actually be happy to propose an implementation, but it seems like maintainers are not responsive overall - @yzhernand @GaryEBenson ?

@tomsauv
Copy link

tomsauv commented Nov 6, 2023

Hi,
This is couple years old now but In case this is still of any help, you may use the following script to get total length of detected tandems.

https://github.com/tomsauv/TROP-Tandem-Repeat-Overlap-Parser

The script merge the coordinates of overlapping tandem to get total length.

Hope this helps

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants