Gene P9301_18831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_18831 
Symbol 
ID4912710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1610087 
End bp1611403 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content29% 
IMG OID640161489 
Productputative p-aminobenzoate synthetase 
Protein accessionYP_001092107 
Protein GI126697221 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.264202 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAA AAAAATTAAT TCTAGAAAAA TGGATAGATC CAGCACTGAT TACGCATCAT 
CTAACAAAAA AATTCGGAGA TAAAGGGTTA GCTTGGCTAG ACAGTGATGG TAAAGAAAAT
GGGGAATGGT CAATAATAGG AATAAAACCT AAAAAAATAA TACAATCAAG AGATATCAAT
AACTTAGACA AAACTAATAA TCCATTTAAC AATTTAAAAA ATATTGAAAA AGGATTTTGG
ATCGGATGGT TAAGTTATGA AGCTGGAGTT TACATAGAAC CCAAAAACCC ATGGCGAAAA
TCTAATATGG CAACTTTATG GATTGCGTCA TATGATCCAA TCATTAAATG TAATCTAATA
AAAAAAGAAA TAATTATCGA AGGCACAAAC TCATCTGAAC TGATGAAATT TAAAAATACA
ATAGACAATA TAAAAAATAT TGAAGAAGAA AATATTATTA AAACAAATTT AAATTTTGAT
TTTTCAAAAA TAAATTTGGA CGAAATGGCT GAAAAATTTC AGAAAAATAT TCTAAAATTA
AAGAAATTAA TTTCTGAAGG GGATATATTT CAAGCAAACC TAACAACTAA ATGCGAGATT
GAATCTTCCA AAAATTATAA TCCTTTAGAT ATTTATTTAA AAATAAGAAG GAAATTAAGA
GCTCCATTTG GAGGAATAAT AATCAATAAT GATAATCATA ATGAGGCGGT ATTATCTACC
TCCCCAGAAA GATTTATAAA AATAGATAAT AAAAGTTTTG TAGAATCAAG ACCTATTAAA
GGAACTAGAT CCAGAGATAA GGATTTAAAT CAAGACGCAC TAAATGCTAT CGATTTAATA
ACGAACGAAA AAGATAGAGC CGAAAATATT ATGATTGTTG ACCTAATAAG AAATGATTTA
AGTAAAGTTT GCGAAACAGG AAGTATTATG GTGCCAGAAA TATTAAAACT TGAAAGTTTC
TTAAAAGTTC ATCATCTAAC TTCAGTAATC AGAGGCAAAT TAAAAAAAAA CAAGAACTGG
ATTGATTTAC TAAAAGCTTG TTGGCCAGGG GGCTCAATAA CTGGAGCACC TAAATTAAGA
TCATGCCAGA GACTTTTTGA ATTAGAAGAA TATGAACGCG GACCATACTG TGGCTCGTTT
TTGAAGCTTG ACTGGAATGG AGAGTTTGAC AGCAATATAC TAATAAGATC ATTTTTAATT
AAAGACAAAA AAATCAGTAT ATATGCTGGT TGCGGAATAG TTATTGACTC AAACCTTGAA
GAGGAAACTA ATGAACTAAA GTGGAAACTT TTACCGTTAA TTGATTCACT AAAATGA
 
Protein sequence
MKIKKLILEK WIDPALITHH LTKKFGDKGL AWLDSDGKEN GEWSIIGIKP KKIIQSRDIN 
NLDKTNNPFN NLKNIEKGFW IGWLSYEAGV YIEPKNPWRK SNMATLWIAS YDPIIKCNLI
KKEIIIEGTN SSELMKFKNT IDNIKNIEEE NIIKTNLNFD FSKINLDEMA EKFQKNILKL
KKLISEGDIF QANLTTKCEI ESSKNYNPLD IYLKIRRKLR APFGGIIINN DNHNEAVLST
SPERFIKIDN KSFVESRPIK GTRSRDKDLN QDALNAIDLI TNEKDRAENI MIVDLIRNDL
SKVCETGSIM VPEILKLESF LKVHHLTSVI RGKLKKNKNW IDLLKACWPG GSITGAPKLR
SCQRLFELEE YERGPYCGSF LKLDWNGEFD SNILIRSFLI KDKKISIYAG CGIVIDSNLE
EETNELKWKL LPLIDSLK