Gene A9601_19021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_19021 
Symbol 
ID4718641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1638093 
End bp1639406 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content29% 
IMG OID640079637 
Productputative p-aminobenzoate synthetase 
Protein accessionYP_001010292 
Protein GI123969434 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0758384 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAA AAAAAATAAT TCTAGAAAAA TGGATAGATC CAGCACTGAT TACGCATCAT 
CTAACAAAAA AATTTGGAGA TCAAGGATTA GCTTGGCTAG ACAGCGATGG CAAAGAAAAT
GGGGAATGGT CAATAATAGG AATTAAACCT AAAAAAATAA TCCAATCAAG AGATATCAAT
AACTTAGACA AAACTAATAA TCCATTTAAC AATTTAAGAA ATATTGAAAA AGGATTTTGG
ATCGGATGGT TAAGTTATGA AGCCGGAGTT TACATAGAAC CAAAAAACCC ATGGAAAAAA
TCTAATATGG CAACTTTATG GATTGCATCA TATGATCCAA TCATTAAATG TAATCTAATA
AAAAAAGAAA TAATTATCGA AGGCACAAAC TCATCTGAAC TGATGAATTA TAAAAACATA
ATCAACAATA TAAAAAATAT CGAAGAAGAA AATATTATTA AAACAAAGTT GAATTTTGAT
TTTTCAAAAA TAAATTTGGA CGAAATGGCT GAAAAATTTC AGAAAAATAT TTTAAAATTG
AAAAAATTAA TTTCCTTAGG AGATATATTT CAAGCAAACC TAACAACTAA ATGCGAAATT
GAATCTTCCA AAAACTATAA TCCTCTAGAT ATTTATTTGA AAATAAGAAG GAAATTAAGA
GCTCCCTTTG GAGGAATAAT AATAAATAAT AATTATAAAG AGGCTGTATT ATCTACCTCG
CCAGAAAGGT TTTTAAAGAT AGATAATAAA AATTTTGTAG AATCAAGACC TATCAAAGGA
ACTAGATCCA GAGATAAAGA TTTAAATCAA GACGCACTTA ATGCTATCGA TTTAATAACT
AATGAGAAAG ATAGAGCCGA AAATATTATG ATTGTTGACC TAATAAGAAA TGATTTAAGT
AAAGTTTGCG AAACAGGAAG TATTATGGTG CCAGAAATAT TAAAACTTGA AAGTTTCTTA
AAAGTTCATC ATCTAACTTC AGTAATCAGA GGCAAATTAA AAAAAGACAC GAACTGGATT
GATTTACTAA AAGCTTGTTG GCCTGGGGGC TCTATAACTG GAGCACCTAA ATTAAGATCA
TGCCAGAGAC TTTTTGAATT AGAAAAATGT GAACGCGGAC CATACTGTGG GTCATTTTTG
AAGCTTGACT GGAATGGAGA GTTTGACAGC AATATACTAA TAAGATCATT TTTAGTTAAA
GACAAAAAAA TCAATATATA TGCTGGTTGC GGAATAGTTA TTGACTCAGA CCCGGAGGAA
GAAACTGATG AACTAAAGTG GAAACTTTTA CCATTAATTG ATTCACTAAA ATGA
 
Protein sequence
MKIKKIILEK WIDPALITHH LTKKFGDQGL AWLDSDGKEN GEWSIIGIKP KKIIQSRDIN 
NLDKTNNPFN NLRNIEKGFW IGWLSYEAGV YIEPKNPWKK SNMATLWIAS YDPIIKCNLI
KKEIIIEGTN SSELMNYKNI INNIKNIEEE NIIKTKLNFD FSKINLDEMA EKFQKNILKL
KKLISLGDIF QANLTTKCEI ESSKNYNPLD IYLKIRRKLR APFGGIIINN NYKEAVLSTS
PERFLKIDNK NFVESRPIKG TRSRDKDLNQ DALNAIDLIT NEKDRAENIM IVDLIRNDLS
KVCETGSIMV PEILKLESFL KVHHLTSVIR GKLKKDTNWI DLLKACWPGG SITGAPKLRS
CQRLFELEKC ERGPYCGSFL KLDWNGEFDS NILIRSFLVK DKKINIYAGC GIVIDSDPEE
ETDELKWKLL PLIDSLK