Gene P9515_02561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_02561 
SymbolaroC 
ID4720392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp237312 
End bp238406 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content36% 
IMG OID640079919 
Productchorismate synthase 
Protein accessionYP_001010572 
Protein GI123965491 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.46541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAGCA TTTTTGGTAA AATTTTCCGG GTCAGTACTT TTGGAGAATC TCATGGAGGT 
GCAGTTGGAG TTATTCTTGA TGGATGTCCA CCAAAGTTAA AAATAAATAT TGATCTCATA
CAAAATGAAT TAGATAGAAG GAGGCCTGGT CAGAGTAAAA TTACTACCCC AAGAAAGGAA
GATGACAAAT TAGAGATATT AAGTGGTTTA AAGGAAGGAA TAACACTTGG GACTCCTATA
GCCATGTTGG TTCGAAATAA AGACCAAAGA CCAGAAGACT ATAATAATCT TGAGCAAGTA
TTTAGACCAT CTCATGCAGA TGGTACATAT CATCTCAAAT ATGGGATTCA AGCTGGTTCA
GGAGGTGGTA GGGCTTCGGC TAGAGAAACT ATAGGAAGAG TTGCTGCAGG CGCTATTGCA
AAACAATTAT TAAAAACCTT ATTCAATACT GAGATTCTTT CTTGGGTAAA ACGTATACAT
GATATCGACT CTCAAGTTAA TAAAAATAAA CTTACTCTAA GTAAAATAGA TTCAAATATT
GTTAGATGTC CTGATGAAAA GGTGGCTACA AAAATGATTC AAAGAATAAA AGAATTACAG
CAAGAGGGTG ATTCTTGCGG AGGTGTTATT GAATGTTTAG TGAAAAATGT TCCCTCAGGA
TTAGGGATGC CTGTTTTTGA TAAATTGGAG GCTGATTTAG CAAAGGCTCT GATGTCCTTG
CCTGCGACTA AAGGTTTTGA AATAGGTTCA GGTTTCTTAG GAACTTATTT AAGAGGTAGT
GAACATAATG ATTCATTTGT TGAGTCTGAT GACATCAATA AGCTTAAAAC AAAATCAAAT
AATTCTGGAG GTATTCAAGG AGGTATAAGT AATGGAGAGA ATATTGAGAT GAAAATAGCT
TTTAAACCGA CAGCAACTAT TGGAAAGGAA CAGAAAACTG TTAACTCTGA TGGTAAGGAA
ATTGTAATGA AGGCAAAAGG TCGACATGAT CCATGTGTTT TACCAAGAGC AGTACCAATG
GTTGATTCAA TGGTTGCACT TGTTTTAGCA GACCATTTGC TTCTTCATCA AGCGCAATGT
TCAATTATTA AATAG
 
Protein sequence
MSSIFGKIFR VSTFGESHGG AVGVILDGCP PKLKINIDLI QNELDRRRPG QSKITTPRKE 
DDKLEILSGL KEGITLGTPI AMLVRNKDQR PEDYNNLEQV FRPSHADGTY HLKYGIQAGS
GGGRASARET IGRVAAGAIA KQLLKTLFNT EILSWVKRIH DIDSQVNKNK LTLSKIDSNI
VRCPDEKVAT KMIQRIKELQ QEGDSCGGVI ECLVKNVPSG LGMPVFDKLE ADLAKALMSL
PATKGFEIGS GFLGTYLRGS EHNDSFVESD DINKLKTKSN NSGGIQGGIS NGENIEMKIA
FKPTATIGKE QKTVNSDGKE IVMKAKGRHD PCVLPRAVPM VDSMVALVLA DHLLLHQAQC
SIIK