Gene Athe_0562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0562 
Symbol 
ID7408688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp634842 
End bp636050 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content35% 
IMG OID643714945 
Product3,4-dihydroxy-2-butanone 4-phosphate synthase 
Protein accessionYP_002572461 
Protein GI222528579 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase 
TIGRFAM ID[TIGR00505] GTP cyclohydrolase II
[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATGAGT TAAAAGATGT AATAGAAAAT CTTAAAAATG GAGAATTTGT CATAGTATTT 
GACAGCCAAA ACCGTGAAGA TGAGGCAGAT TTAATTCTGC CCGCTCAGTT TTCAACACCT
GAAAAGATAA GTTTTGTTTT GAATCACGCA AAAGGAATGT TCTGTGTTGC GATAGATAAG
GAAATTCAAA GGCGCTTGAA TTTGTATGTT CCTTACAATA CTCAAAACAC CTGTACCTTT
ACAGTTACTG TTGACCACAA AGATACAAAA ACAGGTATAA CTGCTAAAGA GAGAAGCAAA
ACATGCAAAG AACTTGCAAA CCCAAAAGCA GTGCCATCTG ATTTTAAAAT CCCTGGTCAT
GTAAATCCTG TTGTTGCGCA TGAAGGGGGG GTTTTGATAA GAAAAGGGCA CACAGAGGCA
GCTGTAGAAC TTTGTAGAAT CTCAAAGCTT TTTCCTGCTG CGGTGATGAT TGAAATCTTA
GACGAAAAAG GAGATAGTCA CAACAAAGAG TATGTAAAAA GCTTAGCAAA GAAATTTTCA
ATACCCATTA CTACAATTGA TGAGATTGAA AAATATATTC TACTTAGCCA GCCTACAGTG
CAAAAAGAAG CTGAAGCAGA GCTTCCAACA CAGTATGGAA AGTTTAAGAT TTTTGCTTTT
AAAAATCATT TTTCTCAAAA AGAACATGCT GTGCTGATAA ACCAAACTTT TAACCCAAGC
CAGCCTGTAA ATGTCCGAAT ACACTCATCG TGTAAAACAG GAGATATTTT TCACAGTTTA
AGGTGTGACT GCCATCAGCA ACTTGAATTT TTCTTGCAGT TTATGGCAGA GAACAAAAAC
TGCATGCTCA TTTACCTTGA CCAGGAAGGA AGAGGAATTG GATTTGCCAA TAAGATTAAA
GCATATGCAT ATCAAGAGCA AGGCTTTGAT ACCTATGAGG CAAACAATTT ACTTGGCTTT
GACGATGATT TGAGAGATTA CTTCGATGCT CTGCATATTT TAAGGTTCTT TGGTATAAGC
AAAATTAATC TTGCAACATC CAATCCTGAA AAAATTTCTT TTTTGCGGGA TTGCGGAATT
GAAATTCTGA AGAGGATAGC TATTCCGGTT TCTGTAAATC CATATAACAG GAAATATATA
CATTCTAAAA TGTTAAAGAA AAAGCATGAA ATCTTAATAA AAGGAGAGGA TAGCTTTGAG
AACTTTTGA
 
Protein sequence
MYELKDVIEN LKNGEFVIVF DSQNREDEAD LILPAQFSTP EKISFVLNHA KGMFCVAIDK 
EIQRRLNLYV PYNTQNTCTF TVTVDHKDTK TGITAKERSK TCKELANPKA VPSDFKIPGH
VNPVVAHEGG VLIRKGHTEA AVELCRISKL FPAAVMIEIL DEKGDSHNKE YVKSLAKKFS
IPITTIDEIE KYILLSQPTV QKEAEAELPT QYGKFKIFAF KNHFSQKEHA VLINQTFNPS
QPVNVRIHSS CKTGDIFHSL RCDCHQQLEF FLQFMAENKN CMLIYLDQEG RGIGFANKIK
AYAYQEQGFD TYEANNLLGF DDDLRDYFDA LHILRFFGIS KINLATSNPE KISFLRDCGI
EILKRIAIPV SVNPYNRKYI HSKMLKKKHE ILIKGEDSFE NF