Gene GWCH70_1852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1852 
Symbol 
ID7976473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1916778 
End bp1917986 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content45% 
IMG OID644798686 
Productthreonine dehydratase 
Protein accessionYP_002949856 
Protein GI239827232 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01127] threonine dehydratase, medium form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000042395 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAACAT TGAACGATGT TTTAGAGGCA AGGGAAAAAA TGAAAGGGAT TGTTCATCAA 
ACTCCGCTTG AGCATTCCCA AACATTTACC AATCTTTCTG GCAATGAAGT ATACATGAAG
CTCGAAAATT TGCAAAAGAC GGGATCGTTT AAAGTGCGGG GATCATTTAA TAAAATTATG
TCGTTAAGCG AAGAGGAACG AAAGCGCGGG GTGATTGCCG CTTCGGCTGG AAACCATGCT
CAAGGGGTCG CCTACTCCAG CGGCATGATC GGTATTCCAT GTACGATCGT CATGCCCAAA
GGAGCGCCGT TAAGCAAAGT GGAAGCAACG AAAGGTTATG GAGCAGAAGT TATTTTACAT
GGCGACGTAT TTGATGAATC GCTTGAGTAT GCGTTCGAAT TGCAGCGGCA GCGAGGAGCG
ACGTTTGTTC ATCCGTTTGA TGATTTAGCG GTCATGGCAG GCCAGGGAAC GATTGGCTTA
GAAATTATCG AACAGCTCCC GGATGTCGAT GTTGTCCTAT GTCCGGTTGG GGGCGGCGGA
CTTCTTGCCG GCTTGGCATT TACATTAAAA CAGTTGAAAC CATCGATTCA AGTATATGGA
GTCGAATCGT CTGCTTGTCC TGGAATGACG GCGGCGCTCC GTCATAAAAA GCCAATAACC
ATTACTTCTT CTGATACGAT TGCCGATGGC ATCGCGGTGA AAAAACCAGG AAATATTACG
TATCAATATA TTGAAAAATA TGTCGACGGC GTTGTCTGTG TCGAGGAAGC GGAAATTTCT
CGAACGATGC TATATTTATT GGAACGTAAC AAGCTGCTGG TCGAAGGTTC TGCGGCGTGT
CCGCTTGCGG CGCTATTGTA TCAAAAATTT CCATTTACTA GAAAAAAAGT TGTCACCATA
TTAAGTGGGG GCAATGTCGA TGTCACCCTC ATTTCACGAA TCATTGAACG CGGGCTCGTC
GAATCTGGAC GGTTTGTGAC ATTTACGACT GTTATCTCCG ATAAGCCTGG CCAGCTCAAT
AAACTGTTGC GAATTATCGC CGAATTGGAA GCAAACGTTA TGTCGATTCA CCATCAGCGC
ATCGGAGCAA AAGTGTTGCC GGGGCAGGCA GAAATTCATT TCTCTCTTGA AACAAAAAAT
CAAGATCATA TCCAGCAAAT TCATCAAGTA TTATTGAAAG AAGGATATGA CGTCGAGTTT
CTCCAATAA
 
Protein sequence
MLTLNDVLEA REKMKGIVHQ TPLEHSQTFT NLSGNEVYMK LENLQKTGSF KVRGSFNKIM 
SLSEEERKRG VIAASAGNHA QGVAYSSGMI GIPCTIVMPK GAPLSKVEAT KGYGAEVILH
GDVFDESLEY AFELQRQRGA TFVHPFDDLA VMAGQGTIGL EIIEQLPDVD VVLCPVGGGG
LLAGLAFTLK QLKPSIQVYG VESSACPGMT AALRHKKPIT ITSSDTIADG IAVKKPGNIT
YQYIEKYVDG VVCVEEAEIS RTMLYLLERN KLLVEGSAAC PLAALLYQKF PFTRKKVVTI
LSGGNVDVTL ISRIIERGLV ESGRFVTFTT VISDKPGQLN KLLRIIAELE ANVMSIHHQR
IGAKVLPGQA EIHFSLETKN QDHIQQIHQV LLKEGYDVEF LQ