Gene GWCH70_2237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2237 
Symbol 
ID7978404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2290355 
End bp2291548 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content44% 
IMG OID644799051 
Productbifunctional 3,4-dihydroxy-2-butanone 4-phosphate synthase/GTP cyclohydrolase II protein 
Protein accessionYP_002950211 
Protein GI239827587 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase 
TIGRFAM ID[TIGR00505] GTP cyclohydrolase II
[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGACA TGATCGAAGA AGCGATTTAT GAATTGATGC AAGGGAAAGT CATTATTGTA 
TGTGATGACG AGGATCGTGA AAACGAAGGA GATTTTGTCG CATTGGCGGA AAAAGCGACC
CCAGAAGTGA TTAATTTTAT GATTAAATAC GGACGCGGCC TTGTTTGCGT TCCGATTACG
GAAGAATTAG CCGATAAGCT TGATTTAGCC CCAATGGTCA ATCATAATAC GGATTCTCAT
GGCACTGCTT TTACGGTTAG CATTGATTAT AAATCAACGA CAACAGGCAT TAGCGCTTAT
GAACGTTCGA TGACGATTCA AGCGCTGTTA GATCCGAATG CGAAAGCGAG CGATTTTAAA
CGTCCTGGGC ACGTTTTTCC ACTTGTGGCG AAAAAAGGAG GCGTATTGCG GCGCGCCGGC
CATACGGAAG CGGCGGTTGA TTTAGCGCGA TTATGTGGTG CAAAGCCGGC CGGTGTGATT
TGCGAAATCA TTAAAGAGGA TGGCACGATG GCGCGTGTTT CGGATTTAAG AAAAATCGCT
GATGAATTTG ATTTGAAAAT GATCACGATT AAAGATTTAA TCGAGTATCG GAGACGAAAA
GAAAAATTAG TGAAACGCGA AGTAGAAGTG ATGCTTCCAA CAGAGTTTGG CAAGTTTAAA
GCAATTGGCT ATACAAATAT TGTTGATGGA AAAGAGCATG TTGCTTTAGT CAAAGGCGAA
ATCATTCCAG ATGAACCGAC GCTTGTTCGG GTTCATTCCG AATGCTTAAC AGGCGATGTG
TTTGGCTCCT GCCGTTGTGA TTGCGGACCG CAGCTTCATG CGGCGCTCCG CCAAATTGAA
GAAGAAGGCC GCGGCGTTTT ATTATATATG CGTCAAGAAG GCCGCGGCAT CGGGTTAATC
AACAAATTGC GCGCGTATAA GCTGCAAGAG CAAGGCTATG ATACGGTGGA AGCAAATGAA
AGGCTTGGAT TCCCTGCCGA TTTGCGTGAC TATGGGATTG GCGCGCAAAT TTTAAAAGAT
CTCGGCGTGA CGAAAATGCG ACTATTGACA AATAATCCGC GGAAAATCAC TGGATTAAAA
GGACACGGCC TTGAAGTCGT CGAGCGTGTT CCACTGCAAA TGCCGGCGAA CAAGGAAAAT
GAAAAATACT TGCGGACGAA GTATGAAAAA TTAGGACATA TGTTGCATTT TTAA
 
Protein sequence
MFDMIEEAIY ELMQGKVIIV CDDEDRENEG DFVALAEKAT PEVINFMIKY GRGLVCVPIT 
EELADKLDLA PMVNHNTDSH GTAFTVSIDY KSTTTGISAY ERSMTIQALL DPNAKASDFK
RPGHVFPLVA KKGGVLRRAG HTEAAVDLAR LCGAKPAGVI CEIIKEDGTM ARVSDLRKIA
DEFDLKMITI KDLIEYRRRK EKLVKREVEV MLPTEFGKFK AIGYTNIVDG KEHVALVKGE
IIPDEPTLVR VHSECLTGDV FGSCRCDCGP QLHAALRQIE EEGRGVLLYM RQEGRGIGLI
NKLRAYKLQE QGYDTVEANE RLGFPADLRD YGIGAQILKD LGVTKMRLLT NNPRKITGLK
GHGLEVVERV PLQMPANKEN EKYLRTKYEK LGHMLHF