Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3073 |
Symbol | |
ID | 4809947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3613910 |
End bp | 3614581 |
Gene Length | 672 bp |
Protein Length | 223 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640108497 |
Product | HAD family hydrolase |
Protein accession | YP_001039462 |
Protein GI | 125975552 |
COG category | [R] General function prediction only |
COG ID | [COG0637] Predicted phosphatase/phosphohexomutase |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00582956 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAAAAAG TTAAAGCGGT TATTTTTGAT ATGGATGGAT TGATGATTGA TACGGAGCGA CTGTATTTTG AAGTGGAAAG AATAATGGCT CGAAAATTTG GTAAAGAGGT AAAGGATGAA ACCCTGTGGA AGATGATGGG AAGAAAGCCA TTGGAAGCCA TCACTGTTTT TGCAGAGGAT CTGGAACTTG ATATTTCACC AAAGAAGCTT TTGGAGATTA GGGATGAGCT GTTTGTCAAA AAGCTTGTAA ATGAAGTTGA ACCGATGCCG GGGCTTTTTG ATATTCTGAA CATCCTTAAA GGAAAGGTGA AGATGGCTAT AGCCACAGGT TCTCCTCAAA AATTTCTGAA AATTGTGCTG GATAAGCTTA AGATAGAAAG CTATTTTGAT GTTTTTGTGA CATCGGATGA GGTTGAAAAG GGAAAGCCCG ATCCCGAAGT TTACAATACT GCGGTAAAGA GGCTTAAAGT GGCACCTTTT GAGTGTGTTG TTCTTGAGGA TTCGAGCAAC GGTGCTCTTG CGGCGGTTAG GGCCGGATGT TATACCATTG CCGTACCTAC GGTGTATACA AACAAACAGG ATTTTAGTTT TGTAAATTAT GTTGCCAAAG ATTTAAAGGA TGCTGCGGAA AAAATCAATG AATTTCTTTG CAGTCAAGAA ATTGAGTTTT GA
|
Protein sequence | MKKVKAVIFD MDGLMIDTER LYFEVERIMA RKFGKEVKDE TLWKMMGRKP LEAITVFAED LELDISPKKL LEIRDELFVK KLVNEVEPMP GLFDILNILK GKVKMAIATG SPQKFLKIVL DKLKIESYFD VFVTSDEVEK GKPDPEVYNT AVKRLKVAPF ECVVLEDSSN GALAAVRAGC YTIAVPTVYT NKQDFSFVNY VAKDLKDAAE KINEFLCSQE IEF
|
| |