Gene GWCH70_1584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1584 
Symbol 
ID7976235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1655706 
End bp1656977 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content43% 
IMG OID644798473 
Productthreonine dehydratase 
Protein accessionYP_002949645 
Protein GI239827021 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01124] threonine ammonia-lyase, biosynthetic, long form
[TIGR01127] threonine dehydratase, medium form
[TIGR02079] threonine dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000192202 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAAC AACTTAAACG AAAACAAGGG ACCGTTTATG TTGAAGATAT TTTAATCGCT 
TATCATACGT TAAAAGATGT TGTACACCAT ACTCCGCTGC AAAAAAATCC GTTATTATCG
GAACGTTACG AATGTAACGT GTATTTGAAG CGCGAAGATT TACAAGTCGT ACGCTCATTC
AAAATTCGTG GCGCATACAA CCGGATGAAA CATTTAAGCG AAGAAGAACG GAAAAACGGC
ATTGTCTGCG CTAGTGCAGG CAACCATGCG CAAGGAGTAG CGTATTCATG TCGGGCGCTA
GGCGTGCATG GAAAGGTGTA CATGCCGGCG ACAACGCCGA GACAAAAAGT ATCGCAAGTA
CAGCTGTTCG GAAAAGATAT GGTCGACATT GTTTTAGTAG GCGATACGTT TGATGATTCA
TTTAACGAAG CAATAGAGTG TGCCAAAAAA GAAGGACGCA CATTTATTCA TCCGTTTGAT
GATGAATATG TCATTGCCGG ACAAGGAACG ATCGGTGTAG AAGTGTTGAA CGACTGTGAA
GAGCCGATTG ATTTTGTGTT TGCAAGCATC GGCGGTGGCG GATTGATGTC GGGGATTGGT
ACATATGTGA AAAGCATTTC TCCAGCTACA AAAATTATTG GCGTGGAACC AGAGGGTGCA
CCATCGATGA AAGCTGCCCT CGAGCAAGGG CATGTTGTGA CATTAGAAGA GATTGATAAA
TTTGTGGATG GAGCGGCAGT CAAAACGGTT GGCGAAAAAA CGTATGCGCT TTGTAAAGAA
ATCATTGACG ATATTGTTGT CGTACCGGAA GGAAAAGTAT GCACAACGAT CTTAGAGCTA
TATAACGAAA ATGCAATTGT CGTTGAACCG GCGGGGGCGC TTCCGATTGC AGCACTTGAT
TTTTACAAAG ACAAAATCCG TGGGAAAACG GTTGTCTGTA TTGTCAGCGG AGGAAACAAT
GACATTGATC GAATGCAGGA AATTAAAGAG CGTTCGATGA TTTACGAAGG ACTGCAGCAT
TATTTTATCG TCAATTTTCC ACAGCGTGCC GGCGCGCTTC GCGAATTTTT AGATGAAGTA
TTAGGGCCTA CCGATGATAT TACTCGTTTT GAATACACAA AGAAAAATAA CAAAGAAAAC
GGTCCGGCGT TGGTCGGCAT TGAATTAAAA CGCCGCGAAG ATTACGAGCC GCTCATTGAG
CGCATGAAGA AAAAAGGGTT TCCGTTTCAA GAAGTGAATA AAAATCCAAA CTTATTCCAT
TTACTCATTT AG
 
Protein sequence
MEQQLKRKQG TVYVEDILIA YHTLKDVVHH TPLQKNPLLS ERYECNVYLK REDLQVVRSF 
KIRGAYNRMK HLSEEERKNG IVCASAGNHA QGVAYSCRAL GVHGKVYMPA TTPRQKVSQV
QLFGKDMVDI VLVGDTFDDS FNEAIECAKK EGRTFIHPFD DEYVIAGQGT IGVEVLNDCE
EPIDFVFASI GGGGLMSGIG TYVKSISPAT KIIGVEPEGA PSMKAALEQG HVVTLEEIDK
FVDGAAVKTV GEKTYALCKE IIDDIVVVPE GKVCTTILEL YNENAIVVEP AGALPIAALD
FYKDKIRGKT VVCIVSGGNN DIDRMQEIKE RSMIYEGLQH YFIVNFPQRA GALREFLDEV
LGPTDDITRF EYTKKNNKEN GPALVGIELK RREDYEPLIE RMKKKGFPFQ EVNKNPNLFH
LLI