Gene P9211_02471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_02471 
SymbolaroC 
ID5731734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp237674 
End bp238765 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content42% 
IMG OID641284591 
Productchorismate synthase 
Protein accessionYP_001550132 
Protein GI159902788 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAGCA GTTTTGGAGA TCTTTTTCGA ATAAGTACTT TTGGAGAGTC TCATGGAGGA 
GGGGTGGGTG TAATTTTGGA GGGTTGCCCT CCTAGGCTTG CAATAGATGT TGATGCAATT
CAGGCAGAAC TGGATAGGAG GAGACCAGGA CAAAGCAAAA TTACAACCCC TAGAAATGAA
GTTGACCAAG TAGAAATCTT GAGTGGGCTT GTTGACAACA AAACTTTAGG CACACCAATA
TCGATGGTCG TTAGGAATAA AGATTTCCGA CCAAATGACT ATGGGGAAAT GCAAAATATT
TTTAGGCCTT CCCATGCAGA TGGAACTTAT CATTTGAAAT ATGGAGTCCA AGCTGCTAGT
GGTGGAGGGA GAGCTTCTGC TCGAGAAACG ATTGGTCGTG TAGCTGCTGG AGCAATTGCG
AAACAATTAC TTAGAAAAGT CAATCAGACG GAAGTTTTGG CATGGGTGAA ACGAATTCAC
ACTATTGAGG CTGATGTGGA TCCAAATTCT GTTCAGATTA AAGATATTGA ATCGAATATT
GTTCGATGTC CTGATCCCAA AATTGCAAAA CTGATGGTTG AAAGAATTGA AGAAGTTAGT
CGGGATGGAG ACTCTTGTGG TGGAGTTATT GAATGCATTG TGCGTAATCC TCCAGCAGGA
TTGGGAATGC CTGTCTTTGA CAAATTAGAA GCTGATTTAG CCAAGGCATT GATGTCATTA
CCTGCAAGCA AGGGCTTTGA AATTGGATCA GGATTTAGTG GCACTTTTCT GAAAGGAAGT
GAACATAATG ACGCTTTTAT TCCTTCAGGT AAAGGCATTT TGAGAACAGC GACTAATAAT
TCTGGAGGCA TACAAGGTGG GATTAGTAAT GGAGAGTTAA TTGTTTTAAG AGTTGCCTTT
AAACCCACAG CCACAATTCG CAAAGATCAA AAGACTGTTG ATTCTGACGG GAAGGAAAGA
ACATTGTCAG CCAAAGGAAG ACATGATCCA TGTGTTTTGC CAAGAGCAGT ACCTATGGTG
GAATCTATGG TGGCATTAGT TTTGGCTGAT CATCTTTTAA GACAGCAAGG TCAATGCGGC
CTTTGGCAGT AA
 
Protein sequence
MGSSFGDLFR ISTFGESHGG GVGVILEGCP PRLAIDVDAI QAELDRRRPG QSKITTPRNE 
VDQVEILSGL VDNKTLGTPI SMVVRNKDFR PNDYGEMQNI FRPSHADGTY HLKYGVQAAS
GGGRASARET IGRVAAGAIA KQLLRKVNQT EVLAWVKRIH TIEADVDPNS VQIKDIESNI
VRCPDPKIAK LMVERIEEVS RDGDSCGGVI ECIVRNPPAG LGMPVFDKLE ADLAKALMSL
PASKGFEIGS GFSGTFLKGS EHNDAFIPSG KGILRTATNN SGGIQGGISN GELIVLRVAF
KPTATIRKDQ KTVDSDGKER TLSAKGRHDP CVLPRAVPMV ESMVALVLAD HLLRQQGQCG
LWQ