Gene NATL1_03041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_03041 
SymbolaroC 
ID4780216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp281855 
End bp282940 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content40% 
IMG OID640083569 
Productchorismate synthase 
Protein accessionYP_001014133 
Protein GI124025017 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.418255 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAGTA GTTTCGGGAA ACTTTTTACT ATCAGCACCT TTGGCGAATC TCACGGAGGA 
GGTGTTGGTG TAATTATTGA TGGATGTCCT CCAAGGTTGG AGCTCGACAT TAACGAAATA
CAAAATGATC TCAATCGAAG GAGGCCAGGA CAAAGCAAAA TAACTACTCC AAGAAACGAA
AGTGATGAAG TTGAAATTCT TAGTGGTCTT TTAGGTAATA AAACCTTAGG AACACCAATT
GCCATGGTTG TAAGAAATAA AGATCATCGG CCTAAGGATT ATTCTGAAAT TAAAAAAACT
TTTAGGCCAT CTCATGCTGA TGCTACATAT CAGAAAAAAT ACGGAATTCA GGCTTCAAGT
GGCGGCGGGC GTGCATCAGC AAGAGAAACT ATAGGTAGAG TTGCTGCGGG TTCTGTTGCA
AAGCAACTTC TAACTAAGTT TGCTAAAACG GAAATACTCG CTTGGGTAAA GAGAATTCAT
GATATTGAGG CTGAGATTCA TCCGAGTGAA GTTACTTTTG ATGAGATTGA GAAAAATATT
GTTCGATGTC CAAATCAGTC AGCTGCTGAT TTAATGATTC AGAGAGTGGA GGCTTTTGGT
AAAGAAGGAG ACTCCTGTGG TGGAGTCATA GAATGTGTTG TTCGGAATCC GCCCATAGGA
CTTGGTATGC CTGTTTTTGA TAAATTAGAA GCTGATTTAG CAAAGGCATT AATGTCTTTG
CCTGCCACTA AAGGTTTTGA GGTGGGATCT GGTTTCGGAG GTACTTATTT GAAAGGCAGC
GAACATAATG ATCCTTTTTT GCCATCCGAT TCCAATCAAT TGAAAACTGC CACTAACAAT
TCAGGCGGAA TTCAAGGAGG TATCAGTAAT GGTGAGGATA TAGTACTAAG AGTAGGTTTT
AAACCAACAG CAACTATTAG GAAAAGTCAA AAGACAATTG ATGAGGATGG TAATGCAATA
ACTCTCAAGG CGACAGGAAG ACATGATCCT TGTGTTTTGC CAAGGGCAGT TCCAATGGTT
GAAGCAATGG TTGCGCTAGT TTTAGCTGAT CATTTACTAA GGCAAAGAGG CCAATGTACT
GACTAA
 
Protein sequence
MGSSFGKLFT ISTFGESHGG GVGVIIDGCP PRLELDINEI QNDLNRRRPG QSKITTPRNE 
SDEVEILSGL LGNKTLGTPI AMVVRNKDHR PKDYSEIKKT FRPSHADATY QKKYGIQASS
GGGRASARET IGRVAAGSVA KQLLTKFAKT EILAWVKRIH DIEAEIHPSE VTFDEIEKNI
VRCPNQSAAD LMIQRVEAFG KEGDSCGGVI ECVVRNPPIG LGMPVFDKLE ADLAKALMSL
PATKGFEVGS GFGGTYLKGS EHNDPFLPSD SNQLKTATNN SGGIQGGISN GEDIVLRVGF
KPTATIRKSQ KTIDEDGNAI TLKATGRHDP CVLPRAVPMV EAMVALVLAD HLLRQRGQCT
D