Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2713 |
Symbol | |
ID | 4810707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3200988 |
End bp | 3202652 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640108132 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001039105 |
Protein GI | 125975195 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.387248 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAGCG ATGCGGTAAA AAAAGGCATA GAAAGAGCCC CTCACAGGGC TTTGTTTAAA GCAATGGGCT ATACAGATGA AGAATTGGAA AGACCGCTTA TAGGAGTTGT TAATTCCAGA AACGAAATTG TTCCGGGACA TATACATCTG GACAAGATTG CCGAAGCTGT AAAGGCAGGT ATCAGAATGG CAGGAGGTAC TCCTGTTGAG TTCGGTGCAA TCGGTGTGTG TGACGGGATA GCGATGGGTC ATACGGGAAT GAAATATTCC CTGGCCACAA GGGAGCTTAT AGCCGACTCC TGCGAGGCAA TGGCGTTGGC CCACAGCTTT GACGGAATGG TTTTCATACC CAATTGTGAC AAGATAGTGC CGGGAATGCT GATGGCAGCT GCAAGAATAA ATGTTCCCGC CATTGTGGTA AGCGGAGGTC CCATGCTGTC TTTAAGGCAT AATGACAAAA ACCTGGATTT AAACAGCGTG TTTGAAGCTG TAGGGGCATA CAAGGCGGGA AAGATGACGG AGAAAGAAGT TTGGGAGTAT GAGGAAAAAG CTTGTCCCGG CTGCGGTTCC TGTTCCGGTA TGTTTACCGC CAACTCCATG AACTGCCTCA CTGAGGTTTT GGGAATGGGT CTTCCGGGCA ACGGAACGGT CCCTGCGGTT TATGCGGAAA GAATACGCCT TGCAAAGAAA GCCGGAATGA AGATAGTGGA ATTGGTTGAA AAAGATATAA AACCTTCGGA TATTCTCACT CCAAAGGCTT TCGAGAATGC TCTGGCCGTG GACATGGCTT TGGGCTGCTC GACAAACTCT GTGCTTCATC TTCCTGCTAT TGCCAATGAA GTGGGAATGG AGATAAACCT TGACATAATA AACGAAATAA GCAGCAAGGT ACCGAACCTT TGCAAGCTGG CTCCGGCGGG CCACCATCAT GTTCAGGACC TCTATGCGGC GGGAGGAATA CCTGCTGTGA TGAAGGAACT TTCAAAGAAG AATTTGCTGC ATCTGGATTT GATAACCGTT ACCGGCAAAA CTGTAAGGGA AAACATTGAA AACGCAAAAG TCAGGGACTA TGAGGTTATA AGAAGCATTG ACAATCCTTA CAGTCCGACG GGCGGTATAG CGGTGCTGAG GGGTAATCTT GCTCCGGACG GTGCGGTTGT AAAGCGCTCG GCTGTTGCCC CTGAAATGTT GGTTCACAAG GGACCGGCAA GGGTGTTTGA CTCGGAGGAT GCTGCCATAG AAGCAATTTA CAACGGTAAA ATAAACAAAG GTGACGTGGT CATAATACGC TATGAAGGTC CCAAAGGAGG TCCCGGCATG AGGGAAATGC TGTCCCCGAC TTCCGCAATT GCAGGTATGG GACTTGACAA GGACGTTGCC TTGATTACTG ACGGACGTTT TTCCGGTGCT ACGAGAGGAG CTTCAATAGG TCATGTGTCT CCGGAGGCTA TGGCGGGCGG ACCTATAGCA ATTGTCAGAG ACGGGGATAT TATCAGCATA GACATACCTA ACGGAAAGCT TGATGTAGAA ATCCCCGACA GCGAAATTCA GAAGAGACTT AAAGAGTGGA AGGCACCGGC GCCGAAAATA ACAAAGGGTT ACCTTGGAAG ATATGCAAAA CTTGTTTCTT CTGCAAACAA AGGCGCCATC CTGGAAAACA AATAA
|
Protein sequence | MRSDAVKKGI ERAPHRALFK AMGYTDEELE RPLIGVVNSR NEIVPGHIHL DKIAEAVKAG IRMAGGTPVE FGAIGVCDGI AMGHTGMKYS LATRELIADS CEAMALAHSF DGMVFIPNCD KIVPGMLMAA ARINVPAIVV SGGPMLSLRH NDKNLDLNSV FEAVGAYKAG KMTEKEVWEY EEKACPGCGS CSGMFTANSM NCLTEVLGMG LPGNGTVPAV YAERIRLAKK AGMKIVELVE KDIKPSDILT PKAFENALAV DMALGCSTNS VLHLPAIANE VGMEINLDII NEISSKVPNL CKLAPAGHHH VQDLYAAGGI PAVMKELSKK NLLHLDLITV TGKTVRENIE NAKVRDYEVI RSIDNPYSPT GGIAVLRGNL APDGAVVKRS AVAPEMLVHK GPARVFDSED AAIEAIYNGK INKGDVVIIR YEGPKGGPGM REMLSPTSAI AGMGLDKDVA LITDGRFSGA TRGASIGHVS PEAMAGGPIA IVRDGDIISI DIPNGKLDVE IPDSEIQKRL KEWKAPAPKI TKGYLGRYAK LVSSANKGAI LENK
|
| |