Gene P9303_30051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_30051 
SymbolaroE 
ID4778533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2662260 
End bp2663228 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content55% 
IMG OID640088529 
Productshikimate 5-dehydrogenase 
Protein accessionYP_001019000 
Protein GI124024693 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0169] Shikimate 5-dehydrogenase 
TIGRFAM ID[TIGR00507] shikimate 5-dehydrogenase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.355106 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTCCCG AACAGCTCGC CCGGCAGGCT GGGCACAGAA CCGATCAAGC AATGGGAATG 
ATTAGCGGCA CAACCTCCCT AATAGGCCTG CTCGGCCAAC CAGTGCACCA TTCCCTCTCA
CCAGTGATGC AAAACGCAGC CCTCACTGCA ATGAATCTCG ACTGGTGCTA CATGGCCATG
CCCTGCGAAA CCAACGATCT GGCCAATGTG CTCTGCGCTC TGCGTTCAAT CAACTGCCTT
GGCCTGAACA TCACCATCCC CCACAAACAA GACGTGGCCA AAACCTGTCG AGAGCTCAGC
CCACTAGCAA AACGACTTAA AGCCGTCAAC ACTCTGATCC CTCATGTCGA CGGCGGCTGG
ACAGGCACCA ACACAGACGT GGCCGGTTTT ATTGCACCCC TTCAAGAAAG CAAGTGCGAG
TGGCATGGGC GTCGTGCCGT CGTGCTCGGT TGCGGTGGTA GCGCCCGCGC AGTCGTTGCA
GGTCTGCAAG ATTTAAAATT GGCTCAAATC ATGGTGGTTG GCCGCCGATC TGATGCGCTG
AAGAGATTTC TCGATGATCT CCAGCCCAAC CCAGCCAGCT CCGAATCTGA TTGCCAAGTG
CTCTTGCAAG GAATTCTCCA ACAGGACCCT GCTCTGGTTG AACAGCTAAC CAAAGCCGAT
CTGGTGGTCA ACACCACACC AGTAGGCATG TCCCAAAACC GTTCGGAAAC ATCAACTCCT
AGAGCGCCAA TGCCCCTGGG GAAGAACATT TGGCAAAACC TAAGCCCAAA GACAACTCTC
TATGACCTGA TTTACACACC AAAACCAACC GCCTGGCTGA CCTTAGGAAC TGAACATGGC
TGCCATTGCA TAGATGGCCT CGAAATGCTT GTTCAACAAG GCGCTGCCTC TCTAAGGCTC
TGGAGCGGCA ACAACCAGGT GCCTGTCGAA GAGATGAGAA AAGCTGCTCT GGGCTGGCTC
ACGGTTTAG
 
Protein sequence
MVPEQLARQA GHRTDQAMGM ISGTTSLIGL LGQPVHHSLS PVMQNAALTA MNLDWCYMAM 
PCETNDLANV LCALRSINCL GLNITIPHKQ DVAKTCRELS PLAKRLKAVN TLIPHVDGGW
TGTNTDVAGF IAPLQESKCE WHGRRAVVLG CGGSARAVVA GLQDLKLAQI MVVGRRSDAL
KRFLDDLQPN PASSESDCQV LLQGILQQDP ALVEQLTKAD LVVNTTPVGM SQNRSETSTP
RAPMPLGKNI WQNLSPKTTL YDLIYTPKPT AWLTLGTEHG CHCIDGLEML VQQGAASLRL
WSGNNQVPVE EMRKAALGWL TV