Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2660 |
Symbol | aroB |
ID | 8138002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3095978 |
End bp | 3097066 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644870264 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_003022454 |
Protein GI | 253701265 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.000170461 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGATAGCTG AAAAGATCAG GGTCGCGCTC GACGAACGGA GCTACGACAT CGAAATGGGC GCCGGCAATC TCGACAGAAT CGGTTCCCTT TGCCGCGAAG TCGGTCTCTC CGGAACGGCG GCGGTGGTCA GCAACACCAC CGTGGCCCCT CTCTACTACG AAACGGTCCG CCTCTCCATG GAGCGTGCAG GTTATCGGGT GGTGCCGGTA ACTCTCCCGG ACGGCGAGGG GTACAAGAAC AGCGCCACGC TCAACCTGAT CTACGACGGC CTGGTCGACG CCTCGCTGGA CCGCGGCTCC TTCATCCTGG CCTTGGGCGG AGGGGTGATC GGCGACATGG CCGGGTTCGC CGCCGCTAGT TACCTGCGCG GCATTCCCTT CGTCCAGATC CCCACTACGC TCCTCTCCCA GGTCGACTCC AGTGTCGGCG GCAAGACCGG CATCAACCAT CCCCGCGGCA AGAACCTGAT CGGCTCCTTC TACCAGCCGA AAGCGGTACT CATCGACGTC GCCACACTCG ATACCCTCCC GGAAAGAGAG TTCCTGAGCG GCCTGGGAGA GATCGTCAAG TACGGTGCGG TGCTGGACGG CGGCTTTTTC GACTTCCTGG AACAAAACGC GAAACTGCTA TTGGCCCGCG ACAAGGAGGC CCTGATCCAG GCGGTCAGCC GCAGCTGCGC CATCAAGGCG AAGGTCGTGG CGGAGGACGA ACGGGAAGGG GGGGTGCGCG CTGTGCTGAA CTTCGGGCAC ACCCTGGGGC ACGCCGTGGA GACCCTTACC GGTTACACCC GCTACCTGCA CGGTGAGGCG GTAGCTATCG GCATGGTACA GGCGGCGCGG ATCTCCCAGC ACTACGGGTT CTGCTCACAG GCGGACCGGG AGCGCATCGA GGCTCTCATC GTGGCGCTTG GGCTGCCGAT AGAGCTTCCT ATCTTCCCCG CCCAGCAGTA CAGGGAGGCG CTCTCGCACG ACAAGAAGGT ACGCGACAAG GGGCTCTTGT TCATCTGCAA CCAGGGGATA GGCGCCTACC GCATGGAAAG GCTCACAGAC CTTGGGGCGC TTCTGGAGAT CTGTGGCATA GGAGAATGA
|
Protein sequence | MIAEKIRVAL DERSYDIEMG AGNLDRIGSL CREVGLSGTA AVVSNTTVAP LYYETVRLSM ERAGYRVVPV TLPDGEGYKN SATLNLIYDG LVDASLDRGS FILALGGGVI GDMAGFAAAS YLRGIPFVQI PTTLLSQVDS SVGGKTGINH PRGKNLIGSF YQPKAVLIDV ATLDTLPERE FLSGLGEIVK YGAVLDGGFF DFLEQNAKLL LARDKEALIQ AVSRSCAIKA KVVAEDEREG GVRAVLNFGH TLGHAVETLT GYTRYLHGEA VAIGMVQAAR ISQHYGFCSQ ADRERIEALI VALGLPIELP IFPAQQYREA LSHDKKVRDK GLLFICNQGI GAYRMERLTD LGALLEICGI GE
|
| |