Gene Elen_1088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1088 
Symbol 
ID8415378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1314539 
End bp1316179 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content67% 
IMG OID645024051 
Producttranscription termination factor Rho 
Protein accessionYP_003181448 
Protein GI257790842 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00846053 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000000541512 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGAAA CCGAAGCAAC CACGGAGGCT CCTGCGGCCG CTCCCGTCAA GCGACGTTCC 
ACCAGCGTGA AGGCGAAGAC GCAGGGCAAG GCCGAGGGCG CAGCCACTCC GGGCCGCAAC
TCGTCGGCCA CGCGCACGCC GCGCAAATCG GTGAAGGCGG GCGGCGCGCA GCAGCAACAG
CCGCAGGGCG GTTCGAACAA CCGCCGCCAG AACGCCTCGA ACGCCGGCAA GCAGAACAAC
GGCGGCAGCA ATCGCCAGAA GCAGGGCGGC GGCCAGAACC GGCAGAACAA CAGCCAGAAC
CAGCGTCGTC AGCGTCGCCA GCACGACAAC AACCGCGGCA ACCGCGAAGT GCAGCCCAGC
GTGTCGCGCG AGGAGCTGGC GAAGCTCAAG GTGGCCGAGC TGCGCGAGAA GGCCGCCGAG
CTCAACCTCG ACGTGACGGG CCTCAAGAAG GCCGAGCTGG TGGAGGCCGT GTTCGAAGCC
AGTGTCAAGG CCGAGGGCTT CATCGAGGTG TCGGGCATCC TCGACATCCT GGCCGACGGC
TACGGCTTCC TCCGCACGCA GGGCTACCTG CCCAGCGAGA CGGATTGCTA CGTGGGTCTT
TCCACCATCC GCCGCAACGG CCTGCGCAAG GGCGATCTCG TGTCCGGTCA GACGCGCCCG
GCGCGCGAGA ACGAGAAGTA CGCCGCCATC CAGAAGGTCA CGGCCGTCAA CGGCACGCCC
GTCGAGGAGC TGGGCAGCCG CGTGCGCTTC GGCGACCTCA CGCCGGTCTA CCCGGACGAA
TGCCTCGTCA TGGAGCACGG CAAGAACACC GTCACCGCCC GCGTCATCGA CCTCGTGTCG
CCCATCGGCA AGGGCCAGCG CGGACTGATC GTCAGCCCCC CGAAGGCGGG CAAGACCACC
ATCCTCAAGG ACATCGCCGC CGCCATCAGC GCGAACAACC CCGAAGTGCA CCTCATGTGC
CTGCTCGTGG ACGAGCGTCC CGAGGAAGTC ACCGACATGG AGCGCTCCAT CAAGGGCGAG
GTCATCTCCT CCACGTTCGA TATGCCCACC GAGAACCACA TCGCCGTGTC CGAGCTGGTC
ATCGAGCGCG CGAAGCGCCT CGTGGAATGC GGCAAGGACG TTGTCGTGCT GCTGGACTCG
CTCACGCGCC TCGCGCGCGC CTACAACCTG GCGCAGCCGG CATCGGGCCG CATCCTGTCG
GGCGGCGTGG ACTCCACGGC GCTCTACCCG CCGAAGCGAT TCCTGGGCGC TGCGCGCAAC
ATCGAGCACG GCGGTAGCCT GACCATCCTG GCCTCAGCCC TGGTGGACAC GGGCTCGAAG
ATGGACGAGG TCATCTTCGA GGAGTTCAAG GGCACCGGCA ACATGGAGCT CAAGCTGGAT
CGCAACCTGG CCGACCGCCG CATCTTCCCG GCCATCGACC CGGTGGCGTC GGGCACCCGC
AAGGAGGATC TGCTGCTGGA GCCGCAGGAG GCGCCGCTCA TCTGGGCCGT GCGCCGCATC
CTCGCGAACA CGAACAGCAC CGAACGCGCC ATGGACATGC TGATCAAGTC GCTCAAGCAG
ACCGAGACGA ACCAGGAGTT CCTCGTGCGC ACGGCGAAGA AGGCCCAGCA CACAAAGGCC
GACGGCTCGC TGGAGTTCTA G
 
Protein sequence
MSETEATTEA PAAAPVKRRS TSVKAKTQGK AEGAATPGRN SSATRTPRKS VKAGGAQQQQ 
PQGGSNNRRQ NASNAGKQNN GGSNRQKQGG GQNRQNNSQN QRRQRRQHDN NRGNREVQPS
VSREELAKLK VAELREKAAE LNLDVTGLKK AELVEAVFEA SVKAEGFIEV SGILDILADG
YGFLRTQGYL PSETDCYVGL STIRRNGLRK GDLVSGQTRP ARENEKYAAI QKVTAVNGTP
VEELGSRVRF GDLTPVYPDE CLVMEHGKNT VTARVIDLVS PIGKGQRGLI VSPPKAGKTT
ILKDIAAAIS ANNPEVHLMC LLVDERPEEV TDMERSIKGE VISSTFDMPT ENHIAVSELV
IERAKRLVEC GKDVVVLLDS LTRLARAYNL AQPASGRILS GGVDSTALYP PKRFLGAARN
IEHGGSLTIL ASALVDTGSK MDEVIFEEFK GTGNMELKLD RNLADRRIFP AIDPVASGTR
KEDLLLEPQE APLIWAVRRI LANTNSTERA MDMLIKSLKQ TETNQEFLVR TAKKAQHTKA
DGSLEF