Gene A9601_03911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_03911 
Symbol 
ID4717086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp348325 
End bp349275 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content30% 
IMG OID640078100 
Productproline iminopeptidase 
Protein accessionYP_001008786 
Protein GI123967928 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.480314 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGATC AAGTCTTGTT CCCGAAAATT GAAGTACGTG AAAAGGGTTT TTTACAAGTT 
AGTGATATTC ATACTATTTA TTGGGAAAGA TCTGGCAATC CAAATGGCAA AAAAATACTT
GTTATTCATG GAGGCCCAGG AGGAGGAAGT CAACCAAGAT ATAGAAGATA CTTTGACCCA
GATAAATTCG ATATTATTCA ATTTGACCAA AGAGGTTGCG GTTCTTCAAC TCCTTTCTCC
GAATTAAAAG AAAATACGAC TAATCATTTA GTAGATGATA TTGAGAAATT AAGGATTCTT
TTTAAAATAG ATACTTGGCA TTTGTTTGGT GGATCTTGGG GCTCAACACT TTCACTAATA
TATGCAATTA AAAATCCCTC AAGAGTTATA AGCTTAACTT TGCGAGGAAT ATTTTTATGT
AGAAAGTTTG AATTATTATG GTTCTATCAA TATGGTGCAA GTGAGATATT CCCTGATGTA
TTTGAAGAAT ATATTTCTGT AATACCAAAA GAAGAAAGAA ATGATTTGAT AAGTTCTTTT
TACAAATATC TAACATCTAC AGATTCGAAT CTTAGATCAA AAGCTGCAGC AGCTTGGACA
AAATGGGAAC TCTCAACTAG TCATTTAATA AATAAAAAAT TCGATTTTGA TAAGTCCCAA
GTTAATTCTT TTTCAGATGC CTTTGCGAGG ATAGAATGTC ATTATTTTAT TAATAATATT
TTCTTAGAAG ATGATTTTAT TTTGAAAAAT TTGAAAACAA TAGAATCGAT TCCAACAAAA
ATAATTCAAG GTAGGTATGA CGTTGTATGT CCTGTTAGGA GTGCATGGGA TCTAAATAAG
AAATTAAAGA ATTCTGAATT AATTATTGTT AATGATGCAG GTCATTCAAT GAGTGAGGAA
GGCATTAGTA TCGAATTAAT AAAAGCTGTA AAAGGAATTC AAAATCTCTA A
 
Protein sequence
MKDQVLFPKI EVREKGFLQV SDIHTIYWER SGNPNGKKIL VIHGGPGGGS QPRYRRYFDP 
DKFDIIQFDQ RGCGSSTPFS ELKENTTNHL VDDIEKLRIL FKIDTWHLFG GSWGSTLSLI
YAIKNPSRVI SLTLRGIFLC RKFELLWFYQ YGASEIFPDV FEEYISVIPK EERNDLISSF
YKYLTSTDSN LRSKAAAAWT KWELSTSHLI NKKFDFDKSQ VNSFSDAFAR IECHYFINNI
FLEDDFILKN LKTIESIPTK IIQGRYDVVC PVRSAWDLNK KLKNSELIIV NDAGHSMSEE
GISIELIKAV KGIQNL