Gene P9303_23791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_23791 
SymbolaroC 
ID4776524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2096588 
End bp2097676 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content55% 
IMG OID640087899 
Productchorismate synthase 
Protein accessionYP_001018377 
Protein GI124024070 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0240204 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAGCA GCTTTGGGGA TCTATTCCGA ATCAGCACCT TCGGTGAATC TCATGGAGGG 
GGCGTGGGAG TCATCGTGGA AGGCTGCCCA CCAAGGCTTG AGCTTGACCT ACAGAAGATT
CAGGCCGAGC TAGATCGACG CAAACCTGGC CAAAGCAAGA TCAGTACACC CCGCAAGGAA
GAAGATCAAG TCGAAATCCT CAGCGGTCTA CTCAACAACA CAACCCTTGG AACACCAATC
GCCATGGTGG TGCGCAACAA GGATCACAAA CCTGGCGACT ACAAGGAGAT GAATGTTGCA
TTTCGGCCTT CCCATGCCGA TGCCACCTAT CAGGCGAAGT ACGGCATCCA AGCTCGAAGC
GGTGGAGGGC GAGCCTCCGC AAGAGAGACG ATTGCGCGGG TAGCCGCTGG AGCGATTGCC
AAGCAATTAC TGACCAAAGC CCATAACACC GAAGTACTGG CATGGGTCAA ACGCATTCAC
ACCCTGGAGG CCGAAATCAA TGCCCAGGAC GTCAGCATTG ATGACGTCGA AGCAAACATC
GTGCGTTGCC CGAACCAAGT CATGGCAGCG CAAATGGTGG AGCGTATTGA AGCCATCAGC
CGTGAAGGCG ACTCATGCGG TGGTGTGATC GAGTGTGTTG TACGCAATGC CCCAATGGGT
CTGGGGATGC CTGTGTTCGA CAAGCTTGAG GCAGACCTCG CCAAGGCCGT GATGTCACTA
CCTGCCAGCA AGGGCTTTGA GATCGGCTCG GGGTTTGGCG GCACCCTGCT AAAAGGCAGC
GAGCACAACG ACGCTTTCCT CCCCAGCAAT GATGGTCGCC TACGAACAGC CACCAACAAC
TCTGGTGGCA TCCAGGGAGG GATCACTAAC GGTGAATCCA TCGTGATCCG AGTGGCGTTC
AAGCCAACAG CCACCATCCG TAAAGATCAA CAAACAATTG ACGCTGATGG CAACACCACG
ACACTGTCTG CCAAAGGTCG TCATGATCCC TGCGTCCTGC CTAGGGCCGT ACCAATAGTT
GAAGCCATGG TGTCCCTTGT ACTCGCTGAT CACCTCTTAC GCCAACAAGG ACAGTGCAGT
CTCTGGTAA
 
Protein sequence
MGSSFGDLFR ISTFGESHGG GVGVIVEGCP PRLELDLQKI QAELDRRKPG QSKISTPRKE 
EDQVEILSGL LNNTTLGTPI AMVVRNKDHK PGDYKEMNVA FRPSHADATY QAKYGIQARS
GGGRASARET IARVAAGAIA KQLLTKAHNT EVLAWVKRIH TLEAEINAQD VSIDDVEANI
VRCPNQVMAA QMVERIEAIS REGDSCGGVI ECVVRNAPMG LGMPVFDKLE ADLAKAVMSL
PASKGFEIGS GFGGTLLKGS EHNDAFLPSN DGRLRTATNN SGGIQGGITN GESIVIRVAF
KPTATIRKDQ QTIDADGNTT TLSAKGRHDP CVLPRAVPIV EAMVSLVLAD HLLRQQGQCS
LW