Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PMN2A_0116 |
Symbol | aroB |
ID | 3605467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL2A |
Kingdom | Bacteria |
Replicon accession | NC_007335 |
Strand | - |
Start bp | 670935 |
End bp | 672041 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 637686972 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_291311 |
Protein GI | 72381956 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATAAAG ACAACCACCA TATTAAAGTC TCTCTAACTA ACAACCCATA CGAAATAGTT ATCGGAAAAA GTAGTCTTGA AAGCATAGGA GATGAGCTAT TTAATATTGG TTTTAGGGAA GGACTAAAAG TATTAGTCGT TTCCAACAAG GAAGTCTCTG ATCACTATGG TGATTGCATA ATCAAAAGTC TGATAAAAAG TAAATTCAAA CCAAAGCTTT TAATAATAAA AGCTGGAGAA GATCAAAAAA ATCAATCTTC TATAGATTTA ATCCACAATG CGGCATATGA AGCGAGGTTA GAGAGAGGAT CATTAATGAT TGCCCTTGGA GGTGGCGTGA TTGGAGACAT GACTGGCTTT GCAGCTGCTA CATGGTTGCG TGGAGTAAAT GTAGTCCAAA TCCCCACCAC GTTACTCGCC ATGGTTGATG CTTCTATTGG TGGTAAAACA GGGATAAATC ATTCAAAAGG TAAAAATCTT ATAGGTGCTT TTCATCAACC TAGATTGGTC TTAATAGACC CTAAAACATT AATTACTCTC CCATCACGAG AGTTCAAAGC AGGTATGGCT GAAATAATAA AGTACGGAGT TATATCAGAC TTAGAACTAT TCGAACTTCT AGAAAGGCAA GAAAATATTT CTGATCTTTC AAACATAAAA GAAAAACTAC TAATAGAAAT AATTAAGCGT TCTGCTAAAT CTAAAGCAGA AATTGTTATA AAAGATGAGA AGGAAAGTGG AGTTAGAGCA TTTTTAAATT ATGGTCACAC ATTTGGCCAC GTAATAGAAA ATCTTTGTGG TTATGGGAAA TGGCTGCATG GTGAGGCAGT TGCAATGGGT ATGGTTGCAG TTGGTCAGTT AGCGGTTCAG AGGGGACTAT GGAAAGAGGA TAACGCGAAA AGGCAGAAAC GATTAATAGA GAAAGCAGGC TTACCTTCTA ATTGGCCTCA GCTTGAGATA GAAAGTGTTC TAAGCTCACT TCAAGGAGAC AAGAAAGTTA AGAACGGCAA GGTGAGTTTC GTTATGCCCT TAAAAATTGG TGATGTAAAA TTATTTAATA ATATTTCTAA TAAAGAAATA CGTGAATGCT TGCAAAAAAT TAGCTAA
|
Protein sequence | MNKDNHHIKV SLTNNPYEIV IGKSSLESIG DELFNIGFRE GLKVLVVSNK EVSDHYGDCI IKSLIKSKFK PKLLIIKAGE DQKNQSSIDL IHNAAYEARL ERGSLMIALG GGVIGDMTGF AAATWLRGVN VVQIPTTLLA MVDASIGGKT GINHSKGKNL IGAFHQPRLV LIDPKTLITL PSREFKAGMA EIIKYGVISD LELFELLERQ ENISDLSNIK EKLLIEIIKR SAKSKAEIVI KDEKESGVRA FLNYGHTFGH VIENLCGYGK WLHGEAVAMG MVAVGQLAVQ RGLWKEDNAK RQKRLIEKAG LPSNWPQLEI ESVLSSLQGD KKVKNGKVSF VMPLKIGDVK LFNNISNKEI RECLQKIS
|
| |