Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1852 |
Symbol | |
ID | 7976473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 1916778 |
End bp | 1917986 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644798686 |
Product | threonine dehydratase |
Protein accession | YP_002949856 |
Protein GI | 239827232 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1171] Threonine dehydratase |
TIGRFAM ID | [TIGR01127] threonine dehydratase, medium form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000042395 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAACAT TGAACGATGT TTTAGAGGCA AGGGAAAAAA TGAAAGGGAT TGTTCATCAA ACTCCGCTTG AGCATTCCCA AACATTTACC AATCTTTCTG GCAATGAAGT ATACATGAAG CTCGAAAATT TGCAAAAGAC GGGATCGTTT AAAGTGCGGG GATCATTTAA TAAAATTATG TCGTTAAGCG AAGAGGAACG AAAGCGCGGG GTGATTGCCG CTTCGGCTGG AAACCATGCT CAAGGGGTCG CCTACTCCAG CGGCATGATC GGTATTCCAT GTACGATCGT CATGCCCAAA GGAGCGCCGT TAAGCAAAGT GGAAGCAACG AAAGGTTATG GAGCAGAAGT TATTTTACAT GGCGACGTAT TTGATGAATC GCTTGAGTAT GCGTTCGAAT TGCAGCGGCA GCGAGGAGCG ACGTTTGTTC ATCCGTTTGA TGATTTAGCG GTCATGGCAG GCCAGGGAAC GATTGGCTTA GAAATTATCG AACAGCTCCC GGATGTCGAT GTTGTCCTAT GTCCGGTTGG GGGCGGCGGA CTTCTTGCCG GCTTGGCATT TACATTAAAA CAGTTGAAAC CATCGATTCA AGTATATGGA GTCGAATCGT CTGCTTGTCC TGGAATGACG GCGGCGCTCC GTCATAAAAA GCCAATAACC ATTACTTCTT CTGATACGAT TGCCGATGGC ATCGCGGTGA AAAAACCAGG AAATATTACG TATCAATATA TTGAAAAATA TGTCGACGGC GTTGTCTGTG TCGAGGAAGC GGAAATTTCT CGAACGATGC TATATTTATT GGAACGTAAC AAGCTGCTGG TCGAAGGTTC TGCGGCGTGT CCGCTTGCGG CGCTATTGTA TCAAAAATTT CCATTTACTA GAAAAAAAGT TGTCACCATA TTAAGTGGGG GCAATGTCGA TGTCACCCTC ATTTCACGAA TCATTGAACG CGGGCTCGTC GAATCTGGAC GGTTTGTGAC ATTTACGACT GTTATCTCCG ATAAGCCTGG CCAGCTCAAT AAACTGTTGC GAATTATCGC CGAATTGGAA GCAAACGTTA TGTCGATTCA CCATCAGCGC ATCGGAGCAA AAGTGTTGCC GGGGCAGGCA GAAATTCATT TCTCTCTTGA AACAAAAAAT CAAGATCATA TCCAGCAAAT TCATCAAGTA TTATTGAAAG AAGGATATGA CGTCGAGTTT CTCCAATAA
|
Protein sequence | MLTLNDVLEA REKMKGIVHQ TPLEHSQTFT NLSGNEVYMK LENLQKTGSF KVRGSFNKIM SLSEEERKRG VIAASAGNHA QGVAYSSGMI GIPCTIVMPK GAPLSKVEAT KGYGAEVILH GDVFDESLEY AFELQRQRGA TFVHPFDDLA VMAGQGTIGL EIIEQLPDVD VVLCPVGGGG LLAGLAFTLK QLKPSIQVYG VESSACPGMT AALRHKKPIT ITSSDTIADG IAVKKPGNIT YQYIEKYVDG VVCVEEAEIS RTMLYLLERN KLLVEGSAAC PLAALLYQKF PFTRKKVVTI LSGGNVDVTL ISRIIERGLV ESGRFVTFTT VISDKPGQLN KLLRIIAELE ANVMSIHHQR IGAKVLPGQA EIHFSLETKN QDHIQQIHQV LLKEGYDVEF LQ
|
| |