Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_07371 |
Symbol | aroB |
ID | 4717442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 656255 |
End bp | 657346 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640078451 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001009130 |
Protein GI | 123968272 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATAAGA GAAAAATATT AGTCCCATTA GGTGATAAGT CATACGAAGT AACTCTAGAA GCAGGGATAC TGAATAACAT TAGCGAAGAA CTCTTAAAAA TTGGAATAAC AAAGAAAAGA AAAATACTTG TGATTTCAAA TGAAGAAATA TCAAATTTGT ATGGTGAGAA ATTCTTAAAT AATTTAAAAG ATAATAAATT TCAGGCCAAA ATGTTCCTTA TCAAGGCTGG AGAATCATAT AAAAACTTAA AAACCTTAAG TGAAATATAT GATGTAGCAT TTGAATTTGG CTTAGATAGA AATTCAATAA TTATTGCCCT TGGAGGAGGA ATTGTTGGAG ATGTAAGTGG TTTTGCAGCT GCTACTTGGC TTAGAGGTAT CGAATATATT CAGATTCCAA CAACATTATT ATCAATGGTT GATTCATCTG TGGGAGGAAA AACAGGAGTA AATCATCCAA AAGGTAAGAA TTTAATTGGA GCTTTCAATC AACCTAAAGC AGTTTTTATT GATCCAGAAA CTTTAAAAAG TTTGCCCAAA AGAGAATTTA GTGCAGGCAT GGCTGAAGTA ATAAAATACG GAGTAATAAG AGATAAAGAA CTTTTCGAAT ACTTAGAAAT TGAAAAAAAC AAAAATGAAC TTATAAATCT CAAAAATGAA TATTTAATTA AAATAATTAA TAGTTCAATT AAAACAAAGT CTAATGTTGT TTCTCAAGAC GAACATGAAA ATGGTGTTAG AGCAATATTG AATTATGGTC ATTCTTTTGG TCACGTTATT GAAAATTTAT GTGGATACGG CAAATTTCTG CATGGTGAGG CAATATCAAT TGGTATGAAT ATTGCGGGGA AAATAGCAAT TGAAAAAGGG TTATGGTCTA AAGAAGAATT AGAGAGACAG CGAATTCTCT TAGAGAGTTA TGATCTTCCT ACCGAGATCC CCAAAATAAA TAAAGAAGAC GTTCTAACAA TACTTATGGG TGATAAAAAA GTTCGTGATG GCAAAATGAG ATTTATATTA CCGAAAGAAA TTGGTGCTGT TGATATATAT GATGACGTGG AAGATTCATT ATTTTTAAAG TTTTTTTCTT AA
|
Protein sequence | MNKRKILVPL GDKSYEVTLE AGILNNISEE LLKIGITKKR KILVISNEEI SNLYGEKFLN NLKDNKFQAK MFLIKAGESY KNLKTLSEIY DVAFEFGLDR NSIIIALGGG IVGDVSGFAA ATWLRGIEYI QIPTTLLSMV DSSVGGKTGV NHPKGKNLIG AFNQPKAVFI DPETLKSLPK REFSAGMAEV IKYGVIRDKE LFEYLEIEKN KNELINLKNE YLIKIINSSI KTKSNVVSQD EHENGVRAIL NYGHSFGHVI ENLCGYGKFL HGEAISIGMN IAGKIAIEKG LWSKEELERQ RILLESYDLP TEIPKINKED VLTILMGDKK VRDGKMRFIL PKEIGAVDIY DDVEDSLFLK FFS
|
| |