Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1088 |
Symbol | |
ID | 8415378 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1314539 |
End bp | 1316179 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645024051 |
Product | transcription termination factor Rho |
Protein accession | YP_003181448 |
Protein GI | 257790842 |
COG category | [K] Transcription |
COG ID | [COG1158] Transcription termination factor |
TIGRFAM ID | [TIGR00767] transcription termination factor Rho |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00846053 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.0000000000000541512 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGAAA CCGAAGCAAC CACGGAGGCT CCTGCGGCCG CTCCCGTCAA GCGACGTTCC ACCAGCGTGA AGGCGAAGAC GCAGGGCAAG GCCGAGGGCG CAGCCACTCC GGGCCGCAAC TCGTCGGCCA CGCGCACGCC GCGCAAATCG GTGAAGGCGG GCGGCGCGCA GCAGCAACAG CCGCAGGGCG GTTCGAACAA CCGCCGCCAG AACGCCTCGA ACGCCGGCAA GCAGAACAAC GGCGGCAGCA ATCGCCAGAA GCAGGGCGGC GGCCAGAACC GGCAGAACAA CAGCCAGAAC CAGCGTCGTC AGCGTCGCCA GCACGACAAC AACCGCGGCA ACCGCGAAGT GCAGCCCAGC GTGTCGCGCG AGGAGCTGGC GAAGCTCAAG GTGGCCGAGC TGCGCGAGAA GGCCGCCGAG CTCAACCTCG ACGTGACGGG CCTCAAGAAG GCCGAGCTGG TGGAGGCCGT GTTCGAAGCC AGTGTCAAGG CCGAGGGCTT CATCGAGGTG TCGGGCATCC TCGACATCCT GGCCGACGGC TACGGCTTCC TCCGCACGCA GGGCTACCTG CCCAGCGAGA CGGATTGCTA CGTGGGTCTT TCCACCATCC GCCGCAACGG CCTGCGCAAG GGCGATCTCG TGTCCGGTCA GACGCGCCCG GCGCGCGAGA ACGAGAAGTA CGCCGCCATC CAGAAGGTCA CGGCCGTCAA CGGCACGCCC GTCGAGGAGC TGGGCAGCCG CGTGCGCTTC GGCGACCTCA CGCCGGTCTA CCCGGACGAA TGCCTCGTCA TGGAGCACGG CAAGAACACC GTCACCGCCC GCGTCATCGA CCTCGTGTCG CCCATCGGCA AGGGCCAGCG CGGACTGATC GTCAGCCCCC CGAAGGCGGG CAAGACCACC ATCCTCAAGG ACATCGCCGC CGCCATCAGC GCGAACAACC CCGAAGTGCA CCTCATGTGC CTGCTCGTGG ACGAGCGTCC CGAGGAAGTC ACCGACATGG AGCGCTCCAT CAAGGGCGAG GTCATCTCCT CCACGTTCGA TATGCCCACC GAGAACCACA TCGCCGTGTC CGAGCTGGTC ATCGAGCGCG CGAAGCGCCT CGTGGAATGC GGCAAGGACG TTGTCGTGCT GCTGGACTCG CTCACGCGCC TCGCGCGCGC CTACAACCTG GCGCAGCCGG CATCGGGCCG CATCCTGTCG GGCGGCGTGG ACTCCACGGC GCTCTACCCG CCGAAGCGAT TCCTGGGCGC TGCGCGCAAC ATCGAGCACG GCGGTAGCCT GACCATCCTG GCCTCAGCCC TGGTGGACAC GGGCTCGAAG ATGGACGAGG TCATCTTCGA GGAGTTCAAG GGCACCGGCA ACATGGAGCT CAAGCTGGAT CGCAACCTGG CCGACCGCCG CATCTTCCCG GCCATCGACC CGGTGGCGTC GGGCACCCGC AAGGAGGATC TGCTGCTGGA GCCGCAGGAG GCGCCGCTCA TCTGGGCCGT GCGCCGCATC CTCGCGAACA CGAACAGCAC CGAACGCGCC ATGGACATGC TGATCAAGTC GCTCAAGCAG ACCGAGACGA ACCAGGAGTT CCTCGTGCGC ACGGCGAAGA AGGCCCAGCA CACAAAGGCC GACGGCTCGC TGGAGTTCTA G
|
Protein sequence | MSETEATTEA PAAAPVKRRS TSVKAKTQGK AEGAATPGRN SSATRTPRKS VKAGGAQQQQ PQGGSNNRRQ NASNAGKQNN GGSNRQKQGG GQNRQNNSQN QRRQRRQHDN NRGNREVQPS VSREELAKLK VAELREKAAE LNLDVTGLKK AELVEAVFEA SVKAEGFIEV SGILDILADG YGFLRTQGYL PSETDCYVGL STIRRNGLRK GDLVSGQTRP ARENEKYAAI QKVTAVNGTP VEELGSRVRF GDLTPVYPDE CLVMEHGKNT VTARVIDLVS PIGKGQRGLI VSPPKAGKTT ILKDIAAAIS ANNPEVHLMC LLVDERPEEV TDMERSIKGE VISSTFDMPT ENHIAVSELV IERAKRLVEC GKDVVVLLDS LTRLARAYNL AQPASGRILS GGVDSTALYP PKRFLGAARN IEHGGSLTIL ASALVDTGSK MDEVIFEEFK GTGNMELKLD RNLADRRIFP AIDPVASGTR KEDLLLEPQE APLIWAVRRI LANTNSTERA MDMLIKSLKQ TETNQEFLVR TAKKAQHTKA DGSLEF
|
| |