Gene Athe_0665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0665 
Symbol 
ID7407089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp749942 
End bp751600 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content42% 
IMG OID643715046 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002572562 
Protein GI222528680 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000174611 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGTG ATACTGTCAA GAAAGGGTTT GAAAAGGCTC CTCAGCGTTC GCTTTTCAAA 
GCAATGGGAT ACACTGATGA AGAGATAAGA AGACCACTTA TTGCAGTTGT GAATTCATGG
AATGAAGTTG TACCTGGACA CATTCATCTT GACAGAATTG CAGAGGCAGT GAAAGCTGGT
ATCAGGCTTG CTGGTGCAAC TCCAATGGAG TTTAATGTCA TAGGTGTATG TGATGGTATC
GCTATGGGTC ACATTGGCAT GAAGTATTCG CTCATCACAA GAGAGCTCAT TGCAGATTCA
ATCGAGGCAA TGGTAATGGC ACACCAGTTT GATGGCATGG TCTTGATTCC AAACTGTGAC
AAAATAGTCC CTGGAATGCT AATAGCAGCA GCAAGAGTAA ACATCCCTGC CATTTTAATA
AGTGGTGGAC CTATGCTTGC GGGTAAAATT GGTGATAAGG TATGTGACCT TAACTCTGTA
TTTGAAGGTG TAGGTGCATA CTCTGCAGGC AAGATTTCTG AAGAAGATTT ATATGCCTTA
GAAGAAAATG CATGTCCTGG ATGTGGTTCA TGTTCTGGAA TGTTTACAGC AAACACCATG
AACTGTTTGA GCGAGGTTTT GGGGCTTGCT CTTCCTGGAA ATGGAACAAT TCCGGCTGTA
ATGGCAGCAC GCATCCGTCT TGCTAAAATG GCAGGTATGA AGATTGTTGA GCTTGTTGAA
AAGGACATAA AACCGTCTGA TATTTTGACA GTTGAAGCAT TTGAAAATGC CTTAGCAGTT
GACATGGCGC TTGGTGGGTC AACAAACACT ATCTTGCATC TTCCTGCTAT TGCAAATGAA
GTTGGAATAA AGTTAAATCT TGATATAATA AACGCTATAA GTGATAGAAC ACCAAATCTT
TGTAAGCTCT CACCGGCAGG ACAACATCAT ATTGAGGACC TTTACTTTGC AGGCGGCGTT
CAGGCTGTTA TGAATGAGCT TTCTAAAAAA GGTTTGCTTC ATTTAAATCT TATGACAGTT
ACAGGTAAAA CAGTTGGTGA GAATATTAAA GATGCAAATG TTAAGAATTA CAATGTCATA
AGACCAATTG ACAATCCATA TTCTGAAACA GGCGGGCTTG TAATTGTGAG GGGTAACCTT
GCACCAGATG GTGCTGTTGT CAAAAAAAGT GCTGTGCCAC CAAAGCTAAT GAAGCACAGA
GGACCTGCGC GTGTGTTTGA AAGCGGTGAA GAGGTGTTTG AGGCAATCTT GAAAGGGAAA
ATCCAAAAAG GAGATGTTAT TGTCATAAGA TATGAAGGGC CAAAAGGCGG ACCTGGTATG
AGAGAGATGC TCTCTCCTAC ATCAGCACTG GCAGGAGTTG GGCTAATTGA AGATGTTGCG
CTGATAACTG ATGGAAGGTT TTCAGGTGCA ACAAGAGGTG CATGTTTTGG TCATGTATCG
CCGGAGGCAG CAGAAAGAGG ACCAATTGCA GCAGTTCAGG ATGGAGATAT GATTTCAATT
GACATAGAAA ACAAGACTCT TACGTTAGAA GTACCAGAAG AAGAAATCAA AAGAAGACTT
GAAATCTTAC CACCGTTTGA GCCAAAGGTG AAAAAAGGGT ATCTTTACAG ATACTCAAAA
CTTGTCAGGT CTGCGTCAAC TGGTGCTATA CTTGAGTAA
 
Protein sequence
MRSDTVKKGF EKAPQRSLFK AMGYTDEEIR RPLIAVVNSW NEVVPGHIHL DRIAEAVKAG 
IRLAGATPME FNVIGVCDGI AMGHIGMKYS LITRELIADS IEAMVMAHQF DGMVLIPNCD
KIVPGMLIAA ARVNIPAILI SGGPMLAGKI GDKVCDLNSV FEGVGAYSAG KISEEDLYAL
EENACPGCGS CSGMFTANTM NCLSEVLGLA LPGNGTIPAV MAARIRLAKM AGMKIVELVE
KDIKPSDILT VEAFENALAV DMALGGSTNT ILHLPAIANE VGIKLNLDII NAISDRTPNL
CKLSPAGQHH IEDLYFAGGV QAVMNELSKK GLLHLNLMTV TGKTVGENIK DANVKNYNVI
RPIDNPYSET GGLVIVRGNL APDGAVVKKS AVPPKLMKHR GPARVFESGE EVFEAILKGK
IQKGDVIVIR YEGPKGGPGM REMLSPTSAL AGVGLIEDVA LITDGRFSGA TRGACFGHVS
PEAAERGPIA AVQDGDMISI DIENKTLTLE VPEEEIKRRL EILPPFEPKV KKGYLYRYSK
LVRSASTGAI LE