Gene GWCH70_2135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2135 
Symbol 
ID7976946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2201338 
End bp2202624 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content47% 
IMG OID644798951 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_002950111 
Protein GI239827487 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.014689 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAACCGT TGCGAACCAA TATTTCATCA TTACGAGGAA CCATCAATGT TCCAGGGGAT 
AAATCGATTT CCCATCGCGC TGTGATGCTT GGAGCAATTG CTAATGGAAC AACGACCATC
GCGAATTTTT TACAGGGAGA AGATTGTTTA AGTACGATCG ATTGTTTTCG AAAATTGGGA
GTGTCGATTG AGCAAAACGG AAGCGATGTT GTTGTCGAAG GAAAGGGATT AAAAGGTCTT
AAGGAGCCAT CTGACATTTT AAATGTTGGC AATTCCGGGA CAACGGCAAG ATTATTGCTC
GGGATTCTAG CGGGATGTCC GTTCCATTCT TGCTTAATTG GCGATGAATC GATCGCCAAG
CGGCCGATGG GTAGAGTGAC AAAGCCGCTA AAAATGATGG GTGCGCACAT TGACGGCCGC
GAGCATGGGA ACTATACCCC GTTATCCATT CGCGGCGGCG AACTTCAGCC CATTCATTAC
GAGTCTTCTG TCGCGAGCGC ACAAGTGAAG TCGGCGATTT TATTGGCGGG ATTGACAACA
AATGGAACTA CGACAGTAAC GGAACCTCAT CGTTCTCGCG ATCATACCGA ACGAATGATT
CGGTTGTTCG GTGGAAGCGT AACAGTGGAC GACCTTACAG TTTCTATTAC CGGACCGCAG
CAGCTAATAG GCGCAAATAT ATACGTTCCG GGAGATATTT CGTCGGCAGC CTTTTTCTTA
GTAGCTGGCG CAATTGTACC AAACAGCGAA ATTACGTTAA AAAATGTCGG GCTCAATCCG
ACAAGAACGG GAATTATCGA TGTGCTGCAA AAAATGGGTG CGGAAATGAC GATCGAAAAC
ATTCGTAACG AGCAAACAGA ACCGCTTGGC GATATTACCA TTCGCACCTC CAATTTAACA
GCGACGGAAA TCAGCGGCGC TCTTATTCCG CGATTAATCG ACGAAATCCC GATCATTGCC
TTGCTTGCAA CGCAGGCGGA AGGTACGACC GTTATTAAAG ATGCGAGCGA ATTGAAAGTG
AAGGAAACGA ATCGAATTGA TACGGTTGTG ACAGAGCTGC GAAAACTTGG CGCGGATATT
AAAGCGACAG CTGATGGCAT GGTCATTCAT GGAAAATCAG CGTTAAAGGC AAAGGACGTT
GTCGTTGATA GCTACGGTGA TCACCGTATT GGCATGATGC TAGCGATTGC TGCCTGCATT
ACGCAAGGAA CTGTCTGTTT AAAACGTCCA GAAGCGGTGG CAGTCTCTTA TCCATCGTTT
TTTGATCATC TTCATTCCTT AATGTAG
 
Protein sequence
MQPLRTNISS LRGTINVPGD KSISHRAVML GAIANGTTTI ANFLQGEDCL STIDCFRKLG 
VSIEQNGSDV VVEGKGLKGL KEPSDILNVG NSGTTARLLL GILAGCPFHS CLIGDESIAK
RPMGRVTKPL KMMGAHIDGR EHGNYTPLSI RGGELQPIHY ESSVASAQVK SAILLAGLTT
NGTTTVTEPH RSRDHTERMI RLFGGSVTVD DLTVSITGPQ QLIGANIYVP GDISSAAFFL
VAGAIVPNSE ITLKNVGLNP TRTGIIDVLQ KMGAEMTIEN IRNEQTEPLG DITIRTSNLT
ATEISGALIP RLIDEIPIIA LLATQAEGTT VIKDASELKV KETNRIDTVV TELRKLGADI
KATADGMVIH GKSALKAKDV VVDSYGDHRI GMMLAIAACI TQGTVCLKRP EAVAVSYPSF
FDHLHSLM