Gene GWCH70_2145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2145 
SymbolaroB 
ID7976955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2211386 
End bp2212513 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content47% 
IMG OID644798961 
Product3-dehydroquinate synthase 
Protein accessionYP_002950121 
Protein GI239827497 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000281313 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAAA TCGTTATTGA AACGAAAACA AAACAGTACC CGTTATTTCT CGGTGAAGGG 
ATCATTGAGT CTCTTCCGGA CATTCTCCGG CAATTGTCTT TTTCCAAAGG GACGAAATTG
CTTATTATCA CCGATAAAAC GTTGGAACAA TTGTATTTAT CGAGACTTTG CGCATTGCTT
GCCAATGATT ATGATGTGTA CACATATGTC ATACCGAGCG GGGAAGAGGC GAAATCATTT
GAGCAATACT ATGCATGTCA AACCGCTGCG CTTCAATACG GGCTTGACCG CAAATCGCTT
ATTCTTGCCT TTGGCGGCGG CGTTGTCGGC GATTTAGCTG GATTTGTCGC TGCCACTTAT
ATGCGCGGCA TCCCATACAT TCAAATCCCG ACGACGCTTC TTGCGCATGA CAGCGCCGTT
GGCGGCAAGG TGGCGATCAA TCATCCGCTT GGAAAAAACA TGATCGGAGC GTTTTACCAG
CCGGAAGCGG TCGTTTATGA TATTGCTTTT TTGCGTTCCT TGCCGGAAAA AGAATTGCGC
TCCGGTTTTG CCGAAGTAAT TAAGCACGCG CTTATTCGCG ACCGCGATTT TTATCAATGG
CTGCGGCAAG AAATCCGCGA GCTTGCGGAC TTAAAAGGGG AGCGATTGCA ATATTGCATT
AAAAAAGGAA TTGAAGTAAA GGCAAGCGTC GTGCGGGAAG ATGAAAAAGA AACTGGCGTT
CGCGCGCATT TAAATTTTGG GCATACGCTT GGGCATGCGC TTGAGAATGA ACTTGGCTAT
GGAGCGATGA CGCATGGCGA TGCGGTGGCG CTTGGGATGC TCTTTGCGAT TTTTGTAAGC
GAGCGGGTGT ATAACATATC GCTGGATTAC GATCGTTTTT CTTCTTGGTT TCGTACATAT
GGATTCCCTG TTTCCATTCC GAAACAACTA AATATAAACC GTCTCCTTGA AAAAATGAAA
GGGGATAAAA AAGCAAGAGC AGGAACGGTC CGCATGGTGC TTTTGAAAGA CATTGGCATG
GCAGAAATAA AACCGCTCGA TGATGAAACG CTGCTGGCAT TGCTTCGCAA ATTTCAGCGG
GAGGAGGGAG AGAATGATCC GCGGAATTCG AGGTGCCATT ACTGTTGA
 
Protein sequence
MEQIVIETKT KQYPLFLGEG IIESLPDILR QLSFSKGTKL LIITDKTLEQ LYLSRLCALL 
ANDYDVYTYV IPSGEEAKSF EQYYACQTAA LQYGLDRKSL ILAFGGGVVG DLAGFVAATY
MRGIPYIQIP TTLLAHDSAV GGKVAINHPL GKNMIGAFYQ PEAVVYDIAF LRSLPEKELR
SGFAEVIKHA LIRDRDFYQW LRQEIRELAD LKGERLQYCI KKGIEVKASV VREDEKETGV
RAHLNFGHTL GHALENELGY GAMTHGDAVA LGMLFAIFVS ERVYNISLDY DRFSSWFRTY
GFPVSIPKQL NINRLLEKMK GDKKARAGTV RMVLLKDIGM AEIKPLDDET LLALLRKFQR
EEGENDPRNS RCHYC