Gene Cthe_2174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2174 
Symbolrho 
ID4810887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2587649 
End bp2589610 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content43% 
IMG OID640107577 
Producttranscription termination factor Rho 
Protein accessionYP_001038569 
Protein GI125974659 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000672374 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGAAA TAAAGCTGAG GGAAAAAACG CTTGAGGATT TAAGGTATAT TGCTAAAATG 
TTGGGGATAA AAAGAGTTAC CACATATAAA AAAAGTGAAC TTATCGAAAA AATTTGCGAA
GTGGGCAGAA ACAATGGTAT AGATGATGCT CAGCAACAAG TCGTTGGAGA AGATGAAAAA
GAAAAAACAG TTGTTGGAGA AGATGAAAAA GAAAAAACAG TTGATAATAA AGATGGGCAA
AAGGATGAGA AAAATGACGG CCAGGTTTCT CAAAACACAG AAGCTCCTGT AGCTGAGGAA
CAGCCGGTGG TGCTGAGAAA ATCCAAAAGG GGGAGACCGA AATCAGTCAA GGTTCAGCAA
CAGCAGGAAG AGGCAAATGT CGAGTCTGCT CCGGTAAAAG CTGAGGAAAA CAAATCCGAA
GCTGAATCCA AGATTGAGTC AAAATCCGAA TCTGAAAAAG CCGAATCAAA ATCCGAATCC
AAAGAACCTG AATCGAAATC CGAATCCAAA ACAAAGAGAG GACCAAAATC AAAAACTGAA
TCCAAAGAGG CTGAAGCTGC TCAAAACAAT CAGGATGCAG CTGAAAGTGC TGATGCTTCA
AAGGCAGATT CTGAGGAAGC TTTAGCGCAG CAAAAAGAGC AAAGCGATGA CAAAGCTTCG
GAACAGGATG CTGTAAAACA GGAACAAGCC GTAAGCACTG CAGAAGGTTC GATGGCTAAA
GCGGAAACTG AGACGGTGCC GGATGCCGAT GCAGAAAAGG CAAAGGCGGA GCGCAAACAG
CCCGAGCAAA AGAAAGAAGG CGACAAACTT CCAAGTGTAT TTGAAAAGAT TGAAAGTGAC
GACCCGGTGG AAGGAGTACT GGAAGTATTG CCTGACGGCT ATGGATTTTT AAGGAGCGAC
AATTATCTTT CCGGTCCTAA AGATGTGTAT GTATCACCGT CGCAAATCAG ACGTTTCAAC
TTGAAAACAG GAGATAAAAT AAAAGGAAAA GGGCGTATTC CGAAAGAAGG AGAGAAATTC
CAGGCTCTGC TCTATGTCCA ATCGGTTAAT GGAGATCCTC CGGAAGTTGC GGCCAAGAGA
ATACCTTTTG ACCAGTTAAC GCCGATTTAT CCTGACGAAA GGATTACTCT TGAAACCACT
CCGAGAGAAT TGTCAACGAG GATGATTGAT TTAATAGCTC CCATTGGAAA AGGACAGCGC
GGTATGATTG TTTCACCTCC CAAAGCGGGT AAGACCGTAC TGTTAAAGAA AATTGCAAAC
GCTATTAGTA CCAATTATCC TGAGATGGAG CTGATTGTAC TTCTTATAGA TGAAAGACCT
GAAGAGGTAA CAGACATGCA GCGCTCCATT AAGGGCGAGG TAATATATTC CACTTTTGAT
GAAGTTCCGG AGCATCATAT AAAGGTTGCC GAAATGGTGC TTGAAAGGGC TCAGAGACTT
GTTGAACAGA AAAAAGATGT TGTAATATTG CTTGACAGTA TCACAAGGCT TGCAAGGGCA
TACAATCTTA CAATTCCTCC TACAGGAAGA ACTCTTTCGG GTGGTCTTGA CCCGGGGGCG
CTTCACAAGC CGAAAAGATT CTTTGGTGCA GCAAGAAATA TTGAGAACGG CGGAAGCCTT
ACAATTATGG CAACGGCTTT GATTGAAACG GGAAGCAGAA TGGACGACGT TATATTTGAA
GAGTTCAAGG GAACCGGAAA CATGGAGATC CATCTTGACA GAAAGCTTTC CGAAAAGAGA
ATATTCCCTG CAATAGATAT AAACAAATCC GGAACCAGAA GAGAGGAATT GCTCCTTGAC
CAGAAGGAGC TTGAAGGAAT TTGGGCTATC AGGAAAGCAA TGAGCAATCT GGGAACGGCT
GAAGTTACTG AAATAATTAT AAACCGTTTG ATGCAGACCA AAAGCAATGC TGAATTTGTA
AACAGTATAA ACGTTGCATT TCTTGGGGAA GTTGTAAAAT AA
 
Protein sequence
MDEIKLREKT LEDLRYIAKM LGIKRVTTYK KSELIEKICE VGRNNGIDDA QQQVVGEDEK 
EKTVVGEDEK EKTVDNKDGQ KDEKNDGQVS QNTEAPVAEE QPVVLRKSKR GRPKSVKVQQ
QQEEANVESA PVKAEENKSE AESKIESKSE SEKAESKSES KEPESKSESK TKRGPKSKTE
SKEAEAAQNN QDAAESADAS KADSEEALAQ QKEQSDDKAS EQDAVKQEQA VSTAEGSMAK
AETETVPDAD AEKAKAERKQ PEQKKEGDKL PSVFEKIESD DPVEGVLEVL PDGYGFLRSD
NYLSGPKDVY VSPSQIRRFN LKTGDKIKGK GRIPKEGEKF QALLYVQSVN GDPPEVAAKR
IPFDQLTPIY PDERITLETT PRELSTRMID LIAPIGKGQR GMIVSPPKAG KTVLLKKIAN
AISTNYPEME LIVLLIDERP EEVTDMQRSI KGEVIYSTFD EVPEHHIKVA EMVLERAQRL
VEQKKDVVIL LDSITRLARA YNLTIPPTGR TLSGGLDPGA LHKPKRFFGA ARNIENGGSL
TIMATALIET GSRMDDVIFE EFKGTGNMEI HLDRKLSEKR IFPAIDINKS GTRREELLLD
QKELEGIWAI RKAMSNLGTA EVTEIIINRL MQTKSNAEFV NSINVAFLGE VVK