Gene P9211_18251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_18251 
Symbol 
ID5731561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1652235 
End bp1653563 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content39% 
IMG OID641286212 
Productputative p-aminobenzoate synthetase 
Protein accessionYP_001551710 
Protein GI159904366 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.240913 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTATTC CAATTAGAAA GCTTTGCGAG TGGTGTGATC CTGTTTACGT TGCTGAAAAT 
TTAATTGCTA ATTTTGGGGA AGATGGATTC ATCTGGCTAG ATAGTGATGG AAGCAAAATA
GGAAGGTGGA TTGTTTTAGC AGCGGAGCCT ATAGATCAAA TTTGCTCTAG GGGCTTGCCT
AGCCAATATT GTAATGCTAA TCCTTTTGAT TCTTTAAGAA GTTTAGAACC TGGCCATTGG
ACTGGCTGGT TAAGTTATGA AGCTGGTGCA TGGATAGAGC CAAATAATCC TTGGAAAGAA
GACTCGATGG CAACTTTGTG GATAGCAAGG CATGATCCTG TATTAAAATT TGATCTGAAA
GAGCAAAAAC TTTGGATAGA AGGTTGTGAT CCCAAACGAT TATTAAAATT ATTTAACTGG
ATAAAAGACC TTAAGAATAA TGAGGCGAAG CAAACCTCAG TACAATCAAA AAGTCCCATA
CGTATTCCAC TTCGGTCTTG GGAATGGTTA ACCAATGAAA AAGAATACGC CGAAAAGGTT
GAGATAATTC AAGAGTGGAT CAAAAAAGGT GATATTTTTC AAGCAAGCCT CTCTGCTTGC
TGTAAAGGTA AAAAGCCACA AAATATGCTT GCCATTGATA TATTCAAAAA ATTGAGGCAT
CATTGTCCCG CCCCATTCTC AGGAATTATT ATTGCGTCAG GAGAAGCAAG CGGTGAAGGC
GTAATATCTA CCTCCCCTGA GCGATTTCTC AAAGTACTAC CTAATGGAAC AGTAGAAACA
CGTCCTATTA AAGGAACTCG CCCCCGTCAA AGCAATGCAC AGAGGGATGC TGATATGGCA
GCTGATTTAA TATGTAGTCA AAAAGATAGA GCCGAAAATG TCATGATTGT GGACCTTCTA
AGAAATGATC TAGGTAAAGT TTGTCAACCA GGTAGTATCC AAGTCACAAA ATTAGTTGGA
CTAGAAAGCT ACTCTCAAGT ACATCATCTA ACATCTGTAA TAAGTGGAAC TCTTAGAGAC
GGCAAAACAT GGGTAGATCT CCTTGAATCA TGCTGGCCAG GAGGTTCAAT CAGTGGTGCA
CCTAAGCTAA GAGCATGTCA AAGATTATAT GAACTTGAAC CTATTGCACG AGGTCCATAC
TGTGGCTCAT TCATACATGT TGATTGGGAT GGTCAATTTG ATAGCAATAT TTTAATTCGA
TCTCTCATGA TTAATAAATC GAATCTTCGT GTAAATGCAG GTTGCGGGAT CGTTGCAGAT
TCAGATGCTA ACAATGAAGC GGAAGAACTG ACCTGGAAAT TATTGCCTTT ATTAAAAGCA
TTGGATTGA
 
Protein sequence
MIIPIRKLCE WCDPVYVAEN LIANFGEDGF IWLDSDGSKI GRWIVLAAEP IDQICSRGLP 
SQYCNANPFD SLRSLEPGHW TGWLSYEAGA WIEPNNPWKE DSMATLWIAR HDPVLKFDLK
EQKLWIEGCD PKRLLKLFNW IKDLKNNEAK QTSVQSKSPI RIPLRSWEWL TNEKEYAEKV
EIIQEWIKKG DIFQASLSAC CKGKKPQNML AIDIFKKLRH HCPAPFSGII IASGEASGEG
VISTSPERFL KVLPNGTVET RPIKGTRPRQ SNAQRDADMA ADLICSQKDR AENVMIVDLL
RNDLGKVCQP GSIQVTKLVG LESYSQVHHL TSVISGTLRD GKTWVDLLES CWPGGSISGA
PKLRACQRLY ELEPIARGPY CGSFIHVDWD GQFDSNILIR SLMINKSNLR VNAGCGIVAD
SDANNEAEEL TWKLLPLLKA LD