Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2174 |
Symbol | rho |
ID | 4810887 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2587649 |
End bp | 2589610 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640107577 |
Product | transcription termination factor Rho |
Protein accession | YP_001038569 |
Protein GI | 125974659 |
COG category | [K] Transcription |
COG ID | [COG1158] Transcription termination factor |
TIGRFAM ID | [TIGR00767] transcription termination factor Rho |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000672374 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGAAA TAAAGCTGAG GGAAAAAACG CTTGAGGATT TAAGGTATAT TGCTAAAATG TTGGGGATAA AAAGAGTTAC CACATATAAA AAAAGTGAAC TTATCGAAAA AATTTGCGAA GTGGGCAGAA ACAATGGTAT AGATGATGCT CAGCAACAAG TCGTTGGAGA AGATGAAAAA GAAAAAACAG TTGTTGGAGA AGATGAAAAA GAAAAAACAG TTGATAATAA AGATGGGCAA AAGGATGAGA AAAATGACGG CCAGGTTTCT CAAAACACAG AAGCTCCTGT AGCTGAGGAA CAGCCGGTGG TGCTGAGAAA ATCCAAAAGG GGGAGACCGA AATCAGTCAA GGTTCAGCAA CAGCAGGAAG AGGCAAATGT CGAGTCTGCT CCGGTAAAAG CTGAGGAAAA CAAATCCGAA GCTGAATCCA AGATTGAGTC AAAATCCGAA TCTGAAAAAG CCGAATCAAA ATCCGAATCC AAAGAACCTG AATCGAAATC CGAATCCAAA ACAAAGAGAG GACCAAAATC AAAAACTGAA TCCAAAGAGG CTGAAGCTGC TCAAAACAAT CAGGATGCAG CTGAAAGTGC TGATGCTTCA AAGGCAGATT CTGAGGAAGC TTTAGCGCAG CAAAAAGAGC AAAGCGATGA CAAAGCTTCG GAACAGGATG CTGTAAAACA GGAACAAGCC GTAAGCACTG CAGAAGGTTC GATGGCTAAA GCGGAAACTG AGACGGTGCC GGATGCCGAT GCAGAAAAGG CAAAGGCGGA GCGCAAACAG CCCGAGCAAA AGAAAGAAGG CGACAAACTT CCAAGTGTAT TTGAAAAGAT TGAAAGTGAC GACCCGGTGG AAGGAGTACT GGAAGTATTG CCTGACGGCT ATGGATTTTT AAGGAGCGAC AATTATCTTT CCGGTCCTAA AGATGTGTAT GTATCACCGT CGCAAATCAG ACGTTTCAAC TTGAAAACAG GAGATAAAAT AAAAGGAAAA GGGCGTATTC CGAAAGAAGG AGAGAAATTC CAGGCTCTGC TCTATGTCCA ATCGGTTAAT GGAGATCCTC CGGAAGTTGC GGCCAAGAGA ATACCTTTTG ACCAGTTAAC GCCGATTTAT CCTGACGAAA GGATTACTCT TGAAACCACT CCGAGAGAAT TGTCAACGAG GATGATTGAT TTAATAGCTC CCATTGGAAA AGGACAGCGC GGTATGATTG TTTCACCTCC CAAAGCGGGT AAGACCGTAC TGTTAAAGAA AATTGCAAAC GCTATTAGTA CCAATTATCC TGAGATGGAG CTGATTGTAC TTCTTATAGA TGAAAGACCT GAAGAGGTAA CAGACATGCA GCGCTCCATT AAGGGCGAGG TAATATATTC CACTTTTGAT GAAGTTCCGG AGCATCATAT AAAGGTTGCC GAAATGGTGC TTGAAAGGGC TCAGAGACTT GTTGAACAGA AAAAAGATGT TGTAATATTG CTTGACAGTA TCACAAGGCT TGCAAGGGCA TACAATCTTA CAATTCCTCC TACAGGAAGA ACTCTTTCGG GTGGTCTTGA CCCGGGGGCG CTTCACAAGC CGAAAAGATT CTTTGGTGCA GCAAGAAATA TTGAGAACGG CGGAAGCCTT ACAATTATGG CAACGGCTTT GATTGAAACG GGAAGCAGAA TGGACGACGT TATATTTGAA GAGTTCAAGG GAACCGGAAA CATGGAGATC CATCTTGACA GAAAGCTTTC CGAAAAGAGA ATATTCCCTG CAATAGATAT AAACAAATCC GGAACCAGAA GAGAGGAATT GCTCCTTGAC CAGAAGGAGC TTGAAGGAAT TTGGGCTATC AGGAAAGCAA TGAGCAATCT GGGAACGGCT GAAGTTACTG AAATAATTAT AAACCGTTTG ATGCAGACCA AAAGCAATGC TGAATTTGTA AACAGTATAA ACGTTGCATT TCTTGGGGAA GTTGTAAAAT AA
|
Protein sequence | MDEIKLREKT LEDLRYIAKM LGIKRVTTYK KSELIEKICE VGRNNGIDDA QQQVVGEDEK EKTVVGEDEK EKTVDNKDGQ KDEKNDGQVS QNTEAPVAEE QPVVLRKSKR GRPKSVKVQQ QQEEANVESA PVKAEENKSE AESKIESKSE SEKAESKSES KEPESKSESK TKRGPKSKTE SKEAEAAQNN QDAAESADAS KADSEEALAQ QKEQSDDKAS EQDAVKQEQA VSTAEGSMAK AETETVPDAD AEKAKAERKQ PEQKKEGDKL PSVFEKIESD DPVEGVLEVL PDGYGFLRSD NYLSGPKDVY VSPSQIRRFN LKTGDKIKGK GRIPKEGEKF QALLYVQSVN GDPPEVAAKR IPFDQLTPIY PDERITLETT PRELSTRMID LIAPIGKGQR GMIVSPPKAG KTVLLKKIAN AISTNYPEME LIVLLIDERP EEVTDMQRSI KGEVIYSTFD EVPEHHIKVA EMVLERAQRL VEQKKDVVIL LDSITRLARA YNLTIPPTGR TLSGGLDPGA LHKPKRFFGA ARNIENGGSL TIMATALIET GSRMDDVIFE EFKGTGNMEI HLDRKLSEKR IFPAIDINKS GTRREELLLD QKELEGIWAI RKAMSNLGTA EVTEIIINRL MQTKSNAEFV NSINVAFLGE VVK
|
| |