Gene A9601_02451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_02451 
SymbolaroC 
ID4716929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp228016 
End bp229113 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content38% 
IMG OID640077944 
Productchorismate synthase 
Protein accessionYP_001008640 
Protein GI123967782 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.642724 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAGTA GTTTTGGAAA AATTTTTCGT GTTAGTACTT TTGGAGAATC ACATGGTGGT 
GCAGTAGGAG TTATCCTTGA TGGATGTCCC CCTAAGTTAA AAGTAGATAT AAATCTGATA
CAAAATGAAT TAGATAGGCG AAGGCCTGGC CAAAGTGACA TTACAACGCC CAGAAATGAA
GAAGATAAAA TTGAAATATT AAGTGGGATA AAGGAAGGGT TCACACTTGG AACTCCAATA
GCGATGTTGG TAAGAAACAA GGATCAAAGA CCAGGAGACT ATGATAATTT GGAACAAGTA
TTTAGGCCTT CTCATGCAGA TGGTACATAT CATCTGAAAT ATGGAATTCA GGCAAGTTCT
GGCGGTGGAA GAGCCTCTGC TAGAGAAACA ATTGGGAGAG TAGCTGCTGG TGCTGTAGCA
AAACAATTAT TAAAAACCTT CTGTAACACT GAAATACTAT CTTGGGTAAA GCGTATACAT
GATATTGAGT CTGATATAAA TAAAGAGAAG ATTTCTCTCA AACAAATAGA TTCTAATATT
GTTAGATGTC CAGATGAAAA GGTATCAACA GAAATGATCG AGAGAATTAA GGAATTAAAG
CGTCAAGGAG ACTCTTGCGG CGGTGTTATT GAATGTCTAG TAAGGAATGT TCCCTCGGGT
CTTGGAATGC CAGTTTTTGA TAAATTAGAA GCTGATTTAG CAAAGGCTTT GATGTCTTTG
CCTGCCACGA AAGGCTTTGA AATAGGTTCA GGTTTCTCTG GAACTTATTT AAAAGGAAGC
GAACATAATG ATGCATTCAT CAAGTCTGAT GATATTAGTA AGTTAAGAAC AACATCAAAC
AATTCAGGAG GTATACAGGG CGGAATAAGT AATGGTGAAA ATATCGAGAT GAAGATAGCT
TTTAAACCTA CAGCAACCAT CGGGAAAGAA CAGAAAACAG TAAATGCTGA AGGTAAAGAA
GTACTTATGA AAGCAAAAGG GAGGCACGAT CCATGCGTTC TACCAAGAGC AGTTCCCATG
GTTGATGCTA TGGTAGCTCT AGTACTTGCT GATCATTTGC TTCTAAATCA TGCTCAATGT
GACTTAATAA ATAAGTAG
 
Protein sequence
MSSSFGKIFR VSTFGESHGG AVGVILDGCP PKLKVDINLI QNELDRRRPG QSDITTPRNE 
EDKIEILSGI KEGFTLGTPI AMLVRNKDQR PGDYDNLEQV FRPSHADGTY HLKYGIQASS
GGGRASARET IGRVAAGAVA KQLLKTFCNT EILSWVKRIH DIESDINKEK ISLKQIDSNI
VRCPDEKVST EMIERIKELK RQGDSCGGVI ECLVRNVPSG LGMPVFDKLE ADLAKALMSL
PATKGFEIGS GFSGTYLKGS EHNDAFIKSD DISKLRTTSN NSGGIQGGIS NGENIEMKIA
FKPTATIGKE QKTVNAEGKE VLMKAKGRHD PCVLPRAVPM VDAMVALVLA DHLLLNHAQC
DLINK