Gene PMN2A_0523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_0523 
Symbol 
ID3605897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp1032356 
End bp1033306 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content34% 
IMG OID637687382 
Productprolyl aminopeptidase 
Protein accessionYP_291717 
Protein GI72382362 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.870437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTATTTC CGGAAATAGA ACCTAACGAA AAAGGGATGT TAAATGTCAG TCCACTACAT 
TCCATCTATT GGGAAAGAAG TGGTAATCCC AATGGTTTGT CTGTCTTAAT TATTCATGGC
GGCCCAGGAG GAGGTAGCAG TCCTTCTTAT CGAAGATATT TTGATCCAAA AAAATTTAAT
ATTGTTCAAT TTGATCAAAG AGGATGTGGT AGATCTACTC CACATTCTGA GTTAGAGGAG
AACACCACTC ATCATTTAAT TGAAGATATT GAAAAGATAA GACAGCTCTT GAAAATTGAG
TCATGGCATG TTTTTGGAGG CTCATGGGGG TCAACCCTAA GCTTGATTTA TGCCATTCAA
CACACAGAAA AAGTTTTAAG TCTTACTCTT AGGGGGATAT TTTTATGCAG ACAACACGAA
TTAACTTGGT TCTATCAAAA AGGTGCAAGT GAGATTTTCC CTGAAGAATT CGATCTTTAT
CAATCTGTCA TTCCGCTAAA TGAGAGAGGG AATTTGATTA ATGCTTTTCA TAAGAGATTA
ACAAGTCAAG ATAGGTCTGA GAGAACTCAA GCGGCACATG CTTGGACGAG ATGGGAAATG
TCAACTAGCT ATCTCAAGCC AAAAGAATTA TCAATTAATA AAGCTACTAA TGATAATTTC
TCAGACTCTT TTGCGCGTAT AGAATGTCAT TATTTTATTA ATAATATTTT TTTAGAGGAG
AACTATATTC TAAAAAATAT TAATAAGCTA AAAGGTATTC CTGTTTCGAT TGTTCAGGGA
AGATATGACG TTGTTTGTCC AATGAGAAGT GCATGGGACC TTAATAAAGC ATTGCCCACT
TCTAAACTTT ATGTAATCGA TAATGCAGGA CATTCAATGA AAGAAATTGG AATTTCTAAA
AGATTAATTG AATTGACAAA TGAGTTAGCC AATTCTTTCT CTAATCTCTA A
 
Protein sequence
MLFPEIEPNE KGMLNVSPLH SIYWERSGNP NGLSVLIIHG GPGGGSSPSY RRYFDPKKFN 
IVQFDQRGCG RSTPHSELEE NTTHHLIEDI EKIRQLLKIE SWHVFGGSWG STLSLIYAIQ
HTEKVLSLTL RGIFLCRQHE LTWFYQKGAS EIFPEEFDLY QSVIPLNERG NLINAFHKRL
TSQDRSERTQ AAHAWTRWEM STSYLKPKEL SINKATNDNF SDSFARIECH YFINNIFLEE
NYILKNINKL KGIPVSIVQG RYDVVCPMRS AWDLNKALPT SKLYVIDNAG HSMKEIGISK
RLIELTNELA NSFSNL