Gene P9301_02461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_02461 
SymbolaroC 
ID4911581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp228980 
End bp230077 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content36% 
IMG OID640159812 
Productchorismate synthase 
Protein accessionYP_001090470 
Protein GI126695584 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAGTA GTTTTGGAAA AATTTTTCGT GTTAGTACTT TTGGAGAATC ACATGGTGGT 
GCAGTAGGAG TTATCCTTGA TGGATGTCCA CCTAAATTAA AAATAGATAT AAATCTGATA
CAAAATGAAT TAGATAGGCG CAGACCTGGC CAAAGTGACA TTACAACACC CAGAAATGAA
GAAGATAAAA TTGAAATATT AAGTGGAATA AAGGAAGGGT TAACACTTGG AACTCCAATA
GCAATGTTGG TAAGAAATAA GGATCAAAGA CCAGCAGATT ATAATAATAT GGAGCAGGTA
TTTAGACCTT CTCATGCAGA TGGTACATAT CATCTGAAAT ATGGAATTCA GGCAAGTTCT
GGCGGTGGAA GAGCCTCTGC TAGAGAAACA ATTGGGAGAG TAGCTGCTGG TGCTGTAGCA
AAACAATTAT TAAAAACCTT CTGTAACACT GAAATACTAT CTTGGGTAAA GCGTATACAT
GATATTGATT CTGATATAAA TAAAGAGAAG ATTTCTCTCA AAAAAATAGA TTCAAATATT
GTTAGATGTC CTGATGAAAA GGTATCAACA GAAATGATTG AGAGAATTAA GGAATTAAAG
CGTCAAGGAG ACTCTTGCGG CGGTGTTATT GAATGTCTAG TAAGAAATGT TCCCTCTGGT
CTTGGAATGC CTGTTTTTGA TAAATTAGAA GCTGATTTAG CAAAGGCTTT GATGTCTTTG
CCTGCCACGA AAGGCTTTGA AATAGGTTCA GGTTTCTCTG GCACTTATTT AAAAGGAAGC
GAGCATAATG ATTCGTTCAT TAAGTCTGAT GATTCTAGTA AATTAAGAAC AACATCAAAC
AATTCAGGAG GTATACAGGG TGGAATAAGT AATGGTGAAA ATATTGAGAT GAAGATAGCT
TTTAAACCTA CAGCAACTAT CGGGAAAGAA CAAAAAACAG TAAATGCTGA AGGGAAAGAA
GTTTTGATGA AAGCAAAAGG GAGACATGAT CCATGCGTTT TACCAAGAGC AGTTCCCATG
GTTGATGCTA TGGTTGCCTT AGTACTTGCT GATCATTTGC TTCTAAATCA TGCTCAATGT
GACTTAATCA ATAACTAG
 
Protein sequence
MSSSFGKIFR VSTFGESHGG AVGVILDGCP PKLKIDINLI QNELDRRRPG QSDITTPRNE 
EDKIEILSGI KEGLTLGTPI AMLVRNKDQR PADYNNMEQV FRPSHADGTY HLKYGIQASS
GGGRASARET IGRVAAGAVA KQLLKTFCNT EILSWVKRIH DIDSDINKEK ISLKKIDSNI
VRCPDEKVST EMIERIKELK RQGDSCGGVI ECLVRNVPSG LGMPVFDKLE ADLAKALMSL
PATKGFEIGS GFSGTYLKGS EHNDSFIKSD DSSKLRTTSN NSGGIQGGIS NGENIEMKIA
FKPTATIGKE QKTVNAEGKE VLMKAKGRHD PCVLPRAVPM VDAMVALVLA DHLLLNHAQC
DLINN