Gene Cthe_2713 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2713 
Symbol 
ID4810707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3200988 
End bp3202652 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content48% 
IMG OID640108132 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001039105 
Protein GI125975195 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.387248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGCG ATGCGGTAAA AAAAGGCATA GAAAGAGCCC CTCACAGGGC TTTGTTTAAA 
GCAATGGGCT ATACAGATGA AGAATTGGAA AGACCGCTTA TAGGAGTTGT TAATTCCAGA
AACGAAATTG TTCCGGGACA TATACATCTG GACAAGATTG CCGAAGCTGT AAAGGCAGGT
ATCAGAATGG CAGGAGGTAC TCCTGTTGAG TTCGGTGCAA TCGGTGTGTG TGACGGGATA
GCGATGGGTC ATACGGGAAT GAAATATTCC CTGGCCACAA GGGAGCTTAT AGCCGACTCC
TGCGAGGCAA TGGCGTTGGC CCACAGCTTT GACGGAATGG TTTTCATACC CAATTGTGAC
AAGATAGTGC CGGGAATGCT GATGGCAGCT GCAAGAATAA ATGTTCCCGC CATTGTGGTA
AGCGGAGGTC CCATGCTGTC TTTAAGGCAT AATGACAAAA ACCTGGATTT AAACAGCGTG
TTTGAAGCTG TAGGGGCATA CAAGGCGGGA AAGATGACGG AGAAAGAAGT TTGGGAGTAT
GAGGAAAAAG CTTGTCCCGG CTGCGGTTCC TGTTCCGGTA TGTTTACCGC CAACTCCATG
AACTGCCTCA CTGAGGTTTT GGGAATGGGT CTTCCGGGCA ACGGAACGGT CCCTGCGGTT
TATGCGGAAA GAATACGCCT TGCAAAGAAA GCCGGAATGA AGATAGTGGA ATTGGTTGAA
AAAGATATAA AACCTTCGGA TATTCTCACT CCAAAGGCTT TCGAGAATGC TCTGGCCGTG
GACATGGCTT TGGGCTGCTC GACAAACTCT GTGCTTCATC TTCCTGCTAT TGCCAATGAA
GTGGGAATGG AGATAAACCT TGACATAATA AACGAAATAA GCAGCAAGGT ACCGAACCTT
TGCAAGCTGG CTCCGGCGGG CCACCATCAT GTTCAGGACC TCTATGCGGC GGGAGGAATA
CCTGCTGTGA TGAAGGAACT TTCAAAGAAG AATTTGCTGC ATCTGGATTT GATAACCGTT
ACCGGCAAAA CTGTAAGGGA AAACATTGAA AACGCAAAAG TCAGGGACTA TGAGGTTATA
AGAAGCATTG ACAATCCTTA CAGTCCGACG GGCGGTATAG CGGTGCTGAG GGGTAATCTT
GCTCCGGACG GTGCGGTTGT AAAGCGCTCG GCTGTTGCCC CTGAAATGTT GGTTCACAAG
GGACCGGCAA GGGTGTTTGA CTCGGAGGAT GCTGCCATAG AAGCAATTTA CAACGGTAAA
ATAAACAAAG GTGACGTGGT CATAATACGC TATGAAGGTC CCAAAGGAGG TCCCGGCATG
AGGGAAATGC TGTCCCCGAC TTCCGCAATT GCAGGTATGG GACTTGACAA GGACGTTGCC
TTGATTACTG ACGGACGTTT TTCCGGTGCT ACGAGAGGAG CTTCAATAGG TCATGTGTCT
CCGGAGGCTA TGGCGGGCGG ACCTATAGCA ATTGTCAGAG ACGGGGATAT TATCAGCATA
GACATACCTA ACGGAAAGCT TGATGTAGAA ATCCCCGACA GCGAAATTCA GAAGAGACTT
AAAGAGTGGA AGGCACCGGC GCCGAAAATA ACAAAGGGTT ACCTTGGAAG ATATGCAAAA
CTTGTTTCTT CTGCAAACAA AGGCGCCATC CTGGAAAACA AATAA
 
Protein sequence
MRSDAVKKGI ERAPHRALFK AMGYTDEELE RPLIGVVNSR NEIVPGHIHL DKIAEAVKAG 
IRMAGGTPVE FGAIGVCDGI AMGHTGMKYS LATRELIADS CEAMALAHSF DGMVFIPNCD
KIVPGMLMAA ARINVPAIVV SGGPMLSLRH NDKNLDLNSV FEAVGAYKAG KMTEKEVWEY
EEKACPGCGS CSGMFTANSM NCLTEVLGMG LPGNGTVPAV YAERIRLAKK AGMKIVELVE
KDIKPSDILT PKAFENALAV DMALGCSTNS VLHLPAIANE VGMEINLDII NEISSKVPNL
CKLAPAGHHH VQDLYAAGGI PAVMKELSKK NLLHLDLITV TGKTVRENIE NAKVRDYEVI
RSIDNPYSPT GGIAVLRGNL APDGAVVKRS AVAPEMLVHK GPARVFDSED AAIEAIYNGK
INKGDVVIIR YEGPKGGPGM REMLSPTSAI AGMGLDKDVA LITDGRFSGA TRGASIGHVS
PEAMAGGPIA IVRDGDIISI DIPNGKLDVE IPDSEIQKRL KEWKAPAPKI TKGYLGRYAK
LVSSANKGAI LENK