Gene P9515_03961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_03961 
Symbol 
ID4719584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp359937 
End bp360878 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content31% 
IMG OID640080069 
Productproline iminopeptidase 
Protein accessionYP_001010712 
Protein GI123965631 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAA AAAAATTATT CCCCATAATA GAACCAAGAG AAAAGGGTTT TTTGCAAGTA 
AGTAGAATTC ATTCTATCTA TTGGGAAAGA TCAGGAAATC AAAATGGTAA AAAGATACTA
GTAATTCATG GCGGTCCTGG TGGAGGAAGT CAGCCAAGAT ATAGAAGATA TTTTAATCCT
GAGAAATTTG ATATTATTCA ATTTGATCAA AGAGGTTGCG GTGCCTCAAG ACCTTTTTCA
GAATTAAGAG AAAATACAAC AAAAGATCTA GTAGATGATA TTGAAAAATT AAGATTAAAT
CTTAATATAG ATTCTTGGCA TTTGTTTGGA GGATCATGGG GATCTACGTT AGCTTTGATT
TATGCAATAA AACATCCATC TAGGGTCAAA AGTATGACCT TAAGAGGAAT ATTTTTATGT
AGAAAATTTG AACTCTCATG GTTCTACCAA TATGGTGCAA GTGAAATTTT CCCAGAAGAG
TTTGAAAAAT ATATCTCAGT TATTCCTAAA GATGAAAGGT CTGATTTGGT ATCTTCTTTT
TATAAATATT TAACATCTCC TAATATCGAA CTTAGATCAA AAGCTGCGGC TGCGTGGACA
ACTTGGGAAC TTTCTACAAG TCATCTTATT AAGAGAGATA TTGATGTTGG TAAAAGTAAA
ACTAATTCCT TTTCAGATGC TTTCGCAAGA ATAGAATGTC ATTATTTCAT AAACCATATC
TTCTTAGAGG AGGATTTTAT TATGAAAAAT ATAAAGACTA TAGAATCTAT TCCAACTAAA
ATCATTCAAG GAAGATATGA CGTGGTCTGT CCTGTGAGAA GTGCTTGGGA TCTAAATAAA
AAATTAAAAA ATTCAGAATT AATAATCATT GATGAAGCTG GTCATTCTAT GAGTGAAAAG
GGAATCACAT TGAAATTATT AGAATTAGTT GAAAAGTTAT AA
 
Protein sequence
MKEKKLFPII EPREKGFLQV SRIHSIYWER SGNQNGKKIL VIHGGPGGGS QPRYRRYFNP 
EKFDIIQFDQ RGCGASRPFS ELRENTTKDL VDDIEKLRLN LNIDSWHLFG GSWGSTLALI
YAIKHPSRVK SMTLRGIFLC RKFELSWFYQ YGASEIFPEE FEKYISVIPK DERSDLVSSF
YKYLTSPNIE LRSKAAAAWT TWELSTSHLI KRDIDVGKSK TNSFSDAFAR IECHYFINHI
FLEEDFIMKN IKTIESIPTK IIQGRYDVVC PVRSAWDLNK KLKNSELIII DEAGHSMSEK
GITLKLLELV EKL