Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_07351 |
Symbol | aroB |
ID | 4912439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 655091 |
End bp | 656182 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640160317 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001090959 |
Protein GI | 126696073 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.374407 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATAAGA GAAAAATATT AGTCCCATTA GGTGATAAGT CATATGAAGT AACTCTAGAA GCAGGGATAC TGAATAATAT CAGCGAAGAA CTTTTAAAAA TTGGAATAAC AAAGAATAGA AAAATACTTG TGATTTCAAA TGAAGAAATA TCAAACTTAT ATGGAGAAAA ATTTTTAAAT AATTTAAAAG ATAATAAATT TCAGGCCAAA ATGGTCCTTA TCAAGGCTGG AGAATCATAT AAAAACTTAA AAACCTTAAG TGAAATATAT GATGTAGCAT TTGAATTTGG CTTAGATAGA AATTCAATAA TTATTGCCCT TGGAGGAGGA ATTGTTGGAG ATGTAAGTGG TTTTGCAGCT GCGACTTGGC TGAGAGGTAT CGAATATATT CAGATTCCAA CAACATTATT ATCAATGGTT GATTCATCTG TGGGAGGAAA AACAGGAGTA AATCATCCAA AAGGTAAGAA TTTAATTGGA GCTTTCAATC AACCTAAAGC AGTTTTTATT GATCCAGAAA CTTTAAAAAG TTTGCCCAAA AGAGAATTTA GTGCAGGCAT GGCCGAAGTA ATAAAATACG GAGTAATAAG AGATAAAGAA CTTTTCAAAT ACTTAGAAAT TGAAAAAAAC AAAAATGAAC TTATAAATCT CAAAAATGAA TATCTAATTA AAATAATTAA TAGTTCAATT AAAACAAAGT CTCATATTGT TTCTCAAGAC GAACATGAAA ATGGTGTTAG AGCAATATTG AATTATGGTC ATTCTTTTGG TCACGTTATT GAAAATTTAT GTGGATACGG CAAATTTCTA CATGGTGAGG CAATATCAAT TGGTATGAAT ATTGCAGGTA AAATAGCAAT TGAGAAGGGG TTATGGTCTA AAGAAGAATT AGAAAGGCAG AAAATTCTAT TAGAGAGTTA CGATCTTCCT ACCGAGATCC CCAAAATAAA TAAAGAAGAC GTTCTAACAA TACTTATGGG CGATAAAAAA GTTCGTGATG GCAAAATGAG ATTTATATTA CCGAAAGAAA TTGGTGCTGT TGATATATAT GATGACGTAG AAGATTCATT ATTTTTAAAA TTTTTTTCTT AA
|
Protein sequence | MNKRKILVPL GDKSYEVTLE AGILNNISEE LLKIGITKNR KILVISNEEI SNLYGEKFLN NLKDNKFQAK MVLIKAGESY KNLKTLSEIY DVAFEFGLDR NSIIIALGGG IVGDVSGFAA ATWLRGIEYI QIPTTLLSMV DSSVGGKTGV NHPKGKNLIG AFNQPKAVFI DPETLKSLPK REFSAGMAEV IKYGVIRDKE LFKYLEIEKN KNELINLKNE YLIKIINSSI KTKSHIVSQD EHENGVRAIL NYGHSFGHVI ENLCGYGKFL HGEAISIGMN IAGKIAIEKG LWSKEELERQ KILLESYDLP TEIPKINKED VLTILMGDKK VRDGKMRFIL PKEIGAVDIY DDVEDSLFLK FFS
|
| |