Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_30051 |
Symbol | aroE |
ID | 4778533 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2662260 |
End bp | 2663228 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640088529 |
Product | shikimate 5-dehydrogenase |
Protein accession | YP_001019000 |
Protein GI | 124024693 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0169] Shikimate 5-dehydrogenase |
TIGRFAM ID | [TIGR00507] shikimate 5-dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.355106 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTTCCCG AACAGCTCGC CCGGCAGGCT GGGCACAGAA CCGATCAAGC AATGGGAATG ATTAGCGGCA CAACCTCCCT AATAGGCCTG CTCGGCCAAC CAGTGCACCA TTCCCTCTCA CCAGTGATGC AAAACGCAGC CCTCACTGCA ATGAATCTCG ACTGGTGCTA CATGGCCATG CCCTGCGAAA CCAACGATCT GGCCAATGTG CTCTGCGCTC TGCGTTCAAT CAACTGCCTT GGCCTGAACA TCACCATCCC CCACAAACAA GACGTGGCCA AAACCTGTCG AGAGCTCAGC CCACTAGCAA AACGACTTAA AGCCGTCAAC ACTCTGATCC CTCATGTCGA CGGCGGCTGG ACAGGCACCA ACACAGACGT GGCCGGTTTT ATTGCACCCC TTCAAGAAAG CAAGTGCGAG TGGCATGGGC GTCGTGCCGT CGTGCTCGGT TGCGGTGGTA GCGCCCGCGC AGTCGTTGCA GGTCTGCAAG ATTTAAAATT GGCTCAAATC ATGGTGGTTG GCCGCCGATC TGATGCGCTG AAGAGATTTC TCGATGATCT CCAGCCCAAC CCAGCCAGCT CCGAATCTGA TTGCCAAGTG CTCTTGCAAG GAATTCTCCA ACAGGACCCT GCTCTGGTTG AACAGCTAAC CAAAGCCGAT CTGGTGGTCA ACACCACACC AGTAGGCATG TCCCAAAACC GTTCGGAAAC ATCAACTCCT AGAGCGCCAA TGCCCCTGGG GAAGAACATT TGGCAAAACC TAAGCCCAAA GACAACTCTC TATGACCTGA TTTACACACC AAAACCAACC GCCTGGCTGA CCTTAGGAAC TGAACATGGC TGCCATTGCA TAGATGGCCT CGAAATGCTT GTTCAACAAG GCGCTGCCTC TCTAAGGCTC TGGAGCGGCA ACAACCAGGT GCCTGTCGAA GAGATGAGAA AAGCTGCTCT GGGCTGGCTC ACGGTTTAG
|
Protein sequence | MVPEQLARQA GHRTDQAMGM ISGTTSLIGL LGQPVHHSLS PVMQNAALTA MNLDWCYMAM PCETNDLANV LCALRSINCL GLNITIPHKQ DVAKTCRELS PLAKRLKAVN TLIPHVDGGW TGTNTDVAGF IAPLQESKCE WHGRRAVVLG CGGSARAVVA GLQDLKLAQI MVVGRRSDAL KRFLDDLQPN PASSESDCQV LLQGILQQDP ALVEQLTKAD LVVNTTPVGM SQNRSETSTP RAPMPLGKNI WQNLSPKTTL YDLIYTPKPT AWLTLGTEHG CHCIDGLEML VQQGAASLRL WSGNNQVPVE EMRKAALGWL TV
|
| |