Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2145 |
Symbol | aroB |
ID | 7976955 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2211386 |
End bp | 2212513 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644798961 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_002950121 |
Protein GI | 239827497 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000281313 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACAAA TCGTTATTGA AACGAAAACA AAACAGTACC CGTTATTTCT CGGTGAAGGG ATCATTGAGT CTCTTCCGGA CATTCTCCGG CAATTGTCTT TTTCCAAAGG GACGAAATTG CTTATTATCA CCGATAAAAC GTTGGAACAA TTGTATTTAT CGAGACTTTG CGCATTGCTT GCCAATGATT ATGATGTGTA CACATATGTC ATACCGAGCG GGGAAGAGGC GAAATCATTT GAGCAATACT ATGCATGTCA AACCGCTGCG CTTCAATACG GGCTTGACCG CAAATCGCTT ATTCTTGCCT TTGGCGGCGG CGTTGTCGGC GATTTAGCTG GATTTGTCGC TGCCACTTAT ATGCGCGGCA TCCCATACAT TCAAATCCCG ACGACGCTTC TTGCGCATGA CAGCGCCGTT GGCGGCAAGG TGGCGATCAA TCATCCGCTT GGAAAAAACA TGATCGGAGC GTTTTACCAG CCGGAAGCGG TCGTTTATGA TATTGCTTTT TTGCGTTCCT TGCCGGAAAA AGAATTGCGC TCCGGTTTTG CCGAAGTAAT TAAGCACGCG CTTATTCGCG ACCGCGATTT TTATCAATGG CTGCGGCAAG AAATCCGCGA GCTTGCGGAC TTAAAAGGGG AGCGATTGCA ATATTGCATT AAAAAAGGAA TTGAAGTAAA GGCAAGCGTC GTGCGGGAAG ATGAAAAAGA AACTGGCGTT CGCGCGCATT TAAATTTTGG GCATACGCTT GGGCATGCGC TTGAGAATGA ACTTGGCTAT GGAGCGATGA CGCATGGCGA TGCGGTGGCG CTTGGGATGC TCTTTGCGAT TTTTGTAAGC GAGCGGGTGT ATAACATATC GCTGGATTAC GATCGTTTTT CTTCTTGGTT TCGTACATAT GGATTCCCTG TTTCCATTCC GAAACAACTA AATATAAACC GTCTCCTTGA AAAAATGAAA GGGGATAAAA AAGCAAGAGC AGGAACGGTC CGCATGGTGC TTTTGAAAGA CATTGGCATG GCAGAAATAA AACCGCTCGA TGATGAAACG CTGCTGGCAT TGCTTCGCAA ATTTCAGCGG GAGGAGGGAG AGAATGATCC GCGGAATTCG AGGTGCCATT ACTGTTGA
|
Protein sequence | MEQIVIETKT KQYPLFLGEG IIESLPDILR QLSFSKGTKL LIITDKTLEQ LYLSRLCALL ANDYDVYTYV IPSGEEAKSF EQYYACQTAA LQYGLDRKSL ILAFGGGVVG DLAGFVAATY MRGIPYIQIP TTLLAHDSAV GGKVAINHPL GKNMIGAFYQ PEAVVYDIAF LRSLPEKELR SGFAEVIKHA LIRDRDFYQW LRQEIRELAD LKGERLQYCI KKGIEVKASV VREDEKETGV RAHLNFGHTL GHALENELGY GAMTHGDAVA LGMLFAIFVS ERVYNISLDY DRFSSWFRTY GFPVSIPKQL NINRLLEKMK GDKKARAGTV RMVLLKDIGM AEIKPLDDET LLALLRKFQR EEGENDPRNS RCHYC
|
| |