Gene P9301_03901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_03901 
Symbol 
ID4911826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp346265 
End bp347215 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content30% 
IMG OID640159966 
Productproline iminopeptidase 
Protein accessionYP_001090614 
Protein GI126695728 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0624161 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGATC AAGTCTTGTT TCCTAAAATT GAAGTTCGTG AAAAGGGTTT TTTACAAGTA 
AGTGATATTC ATACGATTTA TTGGGAAAGA TCAGGCAACC CAAATGGTAA AAAAATACTA
GTTATTCATG GAGGCCCAGG AGGAGGAAGT CAACCAAGAT ATAGAAGATA CTTTGATCCA
GATAAATTCG ATATTATTCA ATTTGACCAA AGAGGTTGCG GTTCTTCAAC TCCTTTCTCA
GAATTAAAAG AAAATTCGAC TAATCATTTA GTTGAGGATA TTGAGAAATT AAGGATTTTA
TTAAAAATAG ATAGTTGGCA TTTGTTTGGT GGATCTTGGG GCTCAACACT TTCACTTATA
TATGCAATTA AAAATCCCTC TAGAGTTATA AGCTTAACTT TGCGAGGAAT ATTTTTATGT
AGAAAGTTTG AATTATTGTG GTTCTATCAA TATGGTGCAA GTGAGATATT CCCTGATGAA
TTTGAAGAAT ATATTTCTGT GATACCAAAA CAAGAAAGAA ATGATTTGAT AACTTCTTTT
TATAAATATC TAACATCATC AGATGTAAAT CTTAGATCAA AAGCAGCAGC AGCTTGGACA
AAATGGGAAC TCTCAACAAG TCATTTAATA AATAAAAAAT TTGATTTTGA TAAGTCTGAA
GTTAATTCTT TTTCAGATGC CTTTGCAAGA ATCGAATGTC ATTATTTTAT TAATAATATT
TTCTTAGAGG ATGATTTTAT TTTGAAAAAT ATAAAAACAA TAGAATCGAT TCCAACAAAA
ATAATTCAGG GTAGGTATGA CGTTGTGTGT CCTGTTAGGA GTGCTTGGGA TCTAAATAAA
AAATTAAAGA ATTCTGAATT AATTATTGTT GATGATGCTG GTCATTCAAT GAGTGAAAAG
GGCATTACTA AAGAATTAAT AAAAGCTATA AAAGGAATTC AAAATCTCTA A
 
Protein sequence
MKDQVLFPKI EVREKGFLQV SDIHTIYWER SGNPNGKKIL VIHGGPGGGS QPRYRRYFDP 
DKFDIIQFDQ RGCGSSTPFS ELKENSTNHL VEDIEKLRIL LKIDSWHLFG GSWGSTLSLI
YAIKNPSRVI SLTLRGIFLC RKFELLWFYQ YGASEIFPDE FEEYISVIPK QERNDLITSF
YKYLTSSDVN LRSKAAAAWT KWELSTSHLI NKKFDFDKSE VNSFSDAFAR IECHYFINNI
FLEDDFILKN IKTIESIPTK IIQGRYDVVC PVRSAWDLNK KLKNSELIIV DDAGHSMSEK
GITKELIKAI KGIQNL