Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_07971 |
Symbol | aroB |
ID | 5730097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 702009 |
End bp | 703121 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641285161 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001550682 |
Protein GI | 159903338 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.277449 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00421114 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAACCAAA ACTCAATCCG AATAAAAATA AAATTAGCTC ACAACCCATA TGAAGTTGTA ATTAAGAAAA ATGGTTTAGC GCGAATAGGC GAAGAGTTAA AAAAAATAGG CTTCAAGAAA GCAACAAAAG TTCTTGTAGT TACCAATAAA GATGTTTCAG TCCACTATGG AAAAGAGTTT ATTCACAACC TAAGTGACAA TGGCTTCAAC CCAACCTTAA TTGAGATAAA AGCAGGGGAA GAAAGAAAGA ATCTCGCAAC CATATCTGAT ATTCACAATG CTGCTTACAC ATCAAGACTT GAAAGGGGTT CATTAATGAT TGCCCTTGGG GGTGGAGTTA TTGGCGATAT GACAGGCTTT GCAGCTGCTA CCTGGCTAAG AGGAGTTTCT TTTGTACAAG TTCCTACAAC TTTACTAGCC ATGGTTGATG CCTCTGTTGG AGGCAAAACG GGAGTCAACC ATCCCAAAGG GAAGAACCTA ATAGGTGCGT TTCATCAGCC AAAACTAGTT CTGATTGATC CAATAACATT AAAAACCCTG CCCGAACGTG AATTCAAAGC TGGGATGGCA GAAGTCATCA AATATGGTGT TATTAGTGAC AAAAAGTTAT TCCGGAAACT GGAAGATGCA CCAAGACTTG ACAAGCTAGA AACACTTACT GACAGATTTT TATTGGAGAT AATCCAAAGG TCCGTTCAAA CTAAAGCACA TATCGTAGAA CTAGATGAGC GAGAGGGTGG CATACGAGCT GTACTTAATT ATGGTCATAC ATTTGGACAT GCGATTGAAG CTTTATGTGG CTATGGTACA TGGCTTCACG GTGAAGCTGT TTCTATGGGC ATGATCGCCA TAGGTCAACT AGCTTTAGAG CGAAACATAT GGAATATTAG CGACCTAGAA AGACAACGTA AGGTTCTGTG TCAAGCAGGG TTGCCTACAA TTTGGCCAAG GGTTTGTGCT GAAGATGTTA TAGAAATACT TAAAAGTGAT AAAAAAGTTA AAGATGGTGA GATCAACTTT ATCGTTCCAA CTGAAATTGG GAAAGTAGAA ATTATTAAAA ATTTTACCGT CAATGAAATC AAACAAGCAC TTCAGAAGTT AGCATCCAAA TAA
|
Protein sequence | MNQNSIRIKI KLAHNPYEVV IKKNGLARIG EELKKIGFKK ATKVLVVTNK DVSVHYGKEF IHNLSDNGFN PTLIEIKAGE ERKNLATISD IHNAAYTSRL ERGSLMIALG GGVIGDMTGF AAATWLRGVS FVQVPTTLLA MVDASVGGKT GVNHPKGKNL IGAFHQPKLV LIDPITLKTL PEREFKAGMA EVIKYGVISD KKLFRKLEDA PRLDKLETLT DRFLLEIIQR SVQTKAHIVE LDEREGGIRA VLNYGHTFGH AIEALCGYGT WLHGEAVSMG MIAIGQLALE RNIWNISDLE RQRKVLCQAG LPTIWPRVCA EDVIEILKSD KKVKDGEINF IVPTEIGKVE IIKNFTVNEI KQALQKLASK
|
| |