Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0054 |
Symbol | |
ID | 4808749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 71948 |
End bp | 74137 |
Gene Length | 2190 bp |
Protein Length | 729 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640105463 |
Product | hypothetical protein |
Protein accession | YP_001036488 |
Protein GI | 125972578 |
COG category | [S] Function unknown |
COG ID | [COG1649] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000080624 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGAGAC GTTGCAAAAG ACTTGTATGT ATTTTGTTGT CATTTCTTGT GATTGCCGGT GTTTTTACGT TCAGTGGTGC AAAAACCGAG CCCTGGAGTG CCTATCAAAA GTTTATACCC AATGAGACAC CGGTTGTAAA AAGGCATCTT AGGGGAGTGT GGATTAGTAC TGTTGCAAAC CTTGACTGGC CGTCCGTAGA GACACGAAAA ATAGAAAATC CTTCGGAACG GATAAGAAAA ACTAAAGAAG AGCTTGTGGA GATTTTCGAC AAGGCTGTGG AGATGAATTT AAATGCCGTT TTCCTCCAGG TCAGTCCGGA GGGGGATGCA TTCTACAAAT CAGATATAGT GCCCTGGTCA CGTTATCTTA CAGGAACCTT TGGAGAGGAC CCCGGCTTTG ATCCCTTGGA GTTTGCAATT GAGGAAGCTC ACAAGCGAAA TCTGGAGCTC CATGCATGGT TTAATCCTTA CAGGGTGTCG ACAAACACGT CGGCTGCAAC CATTTCATCT TTAAAAGTCG AAAAAAGTGT GTACAAGGAA CATCCTGACT GGATTAGGAC AGCCATGAAC AGGTTTGTTG TCGATCCTGG AATACCTGAA GCGAGGCAAT GGGTGATTGA CCGTGTTATG GAGGTGGTGA AAAAATATGA TGTGGACGGA GTGCATTTTG ACGACTATTT TTATTATGAG CAATATGTGG GAGAGCTGAA GGATCAGGAT ACTTACAACA AGTACAATAA GGGACAGTTT TCCAATATAG GCGATTTCAG AAGAAACAAC ACGTATTTGC TGGTAAAGGA GCTTTCGCAG AAGATAAGGG CAACCAAGCC CTGGGTTAAA TTTGGCATTA GTCCTTCCGG CGTATGGGGG AACAAAAGCG ACGGCCACAG CTACGGTTCC AATACGAGTG CAAGTCTTAC AAATTACGAT AAAAGTTTTG CGGATACAAA GAAATGGGTT CAGGAGGAGC TTATCGATTA CATTGCTCCC CAGGTTTATT TCACTTTTGC AAATTCCAGA GCACCTTACG GTGAGATTGC TTTGTGGTGG TCGGATGTTT GCAGGGGGAA AAATGTGCAT CTTTATATAG GTCAGGCGTT TTATAAGATA AATGATGACA GCGATCAATA TTTTAAAGGT GAGAATGCTG TGCCGGAGCT GACAAGGCAA TTGAAATTCA ATGCGGTAAA ACCTGAGATA ATGGGAACTG TTTTGTTCCG TTTTGCAAAT TTTAAAGATT CCGGTAAACA GCAGGCGGTA AATGCCGTAA AGAATGACTT GTGGTCACAA AAAGCCTTGA TTCCACCAAT GCCGTGGAAG GGCGGCAATG CTCCTGATGC GCCTATACTG GGAAGATTGG AATCCCTGCC CGACGGAGTG GAAATATCGT GGATGGATAA TGACCCGGAC ACCGCGTATT TTGCAATTTA CCGCTTTAAT GCCGGAGAAA AAATGGACAT TACCTCTGAC AGCAGTGCAT ACAAACTTAT TGCCACTGTC AGAAAAAACA GTAACGGTGT GCAGAAATTT GTGGATTATG GGGTTTTGGA TGCTGACAGC GTATATTATG TTGTTACTGC TTTGGACCGG CTGCACAATG AAAGTGAAGG ACTTGCAATA AGCACCAATC AGTCCGAATA TTTTCCGGAT GTCGGGATGA AATATTCCTG GGCCGTTGAT GCAATTGACA TGCTTTATGA AAAAGGAGTT GTCAAGGGTG ATGAAAGCGG GATGTTCAAC CCGGGGGTGA ACACGAAAAG AGCTGATTTT ACTATTATGA TTGTAAAGGC GCTGGCCCTG AAAGCTGATT TTGAAGACAA TTTTGCCGAT GTCAGGAAAG ATGCATATTA CTATGAGGCG GTAGGCGTTG CCAGGGCTCT GGGAATTGTA AAAGGAGACG GAAAGAATTT TAATCCCGAT GCCAATATAA CCCGGGAGGA TATGATGGTT ATCGTGGTCA ATGCCCTTAA AGCGGCCGGG GCAAAGATTG ACGAAGCCGA TGAGCAATTC CTTGAAAATT ACGGTGATGC GAACAGTATA AGCGGCTATG CAAGGAAATC GGTGGCTGTT CTTACGAAAG CGGGAGTTGT AAACGGCTAC GACGGGAAAA TACATCCTAA AAGTCTGGCC ACAAGGGCTG AGATTGCAGT GGTAGTATCA AAGCTGTTAA CCAATATTGA GTATTTATAA
|
Protein sequence | MERRCKRLVC ILLSFLVIAG VFTFSGAKTE PWSAYQKFIP NETPVVKRHL RGVWISTVAN LDWPSVETRK IENPSERIRK TKEELVEIFD KAVEMNLNAV FLQVSPEGDA FYKSDIVPWS RYLTGTFGED PGFDPLEFAI EEAHKRNLEL HAWFNPYRVS TNTSAATISS LKVEKSVYKE HPDWIRTAMN RFVVDPGIPE ARQWVIDRVM EVVKKYDVDG VHFDDYFYYE QYVGELKDQD TYNKYNKGQF SNIGDFRRNN TYLLVKELSQ KIRATKPWVK FGISPSGVWG NKSDGHSYGS NTSASLTNYD KSFADTKKWV QEELIDYIAP QVYFTFANSR APYGEIALWW SDVCRGKNVH LYIGQAFYKI NDDSDQYFKG ENAVPELTRQ LKFNAVKPEI MGTVLFRFAN FKDSGKQQAV NAVKNDLWSQ KALIPPMPWK GGNAPDAPIL GRLESLPDGV EISWMDNDPD TAYFAIYRFN AGEKMDITSD SSAYKLIATV RKNSNGVQKF VDYGVLDADS VYYVVTALDR LHNESEGLAI STNQSEYFPD VGMKYSWAVD AIDMLYEKGV VKGDESGMFN PGVNTKRADF TIMIVKALAL KADFEDNFAD VRKDAYYYEA VGVARALGIV KGDGKNFNPD ANITREDMMV IVVNALKAAG AKIDEADEQF LENYGDANSI SGYARKSVAV LTKAGVVNGY DGKIHPKSLA TRAEIAVVVS KLLTNIEYL
|
| |