Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PMN2A_0523 |
Symbol | |
ID | 3605897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL2A |
Kingdom | Bacteria |
Replicon accession | NC_007335 |
Strand | - |
Start bp | 1032356 |
End bp | 1033306 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 637687382 |
Product | prolyl aminopeptidase |
Protein accession | YP_291717 |
Protein GI | 72382362 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.870437 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTATTTC CGGAAATAGA ACCTAACGAA AAAGGGATGT TAAATGTCAG TCCACTACAT TCCATCTATT GGGAAAGAAG TGGTAATCCC AATGGTTTGT CTGTCTTAAT TATTCATGGC GGCCCAGGAG GAGGTAGCAG TCCTTCTTAT CGAAGATATT TTGATCCAAA AAAATTTAAT ATTGTTCAAT TTGATCAAAG AGGATGTGGT AGATCTACTC CACATTCTGA GTTAGAGGAG AACACCACTC ATCATTTAAT TGAAGATATT GAAAAGATAA GACAGCTCTT GAAAATTGAG TCATGGCATG TTTTTGGAGG CTCATGGGGG TCAACCCTAA GCTTGATTTA TGCCATTCAA CACACAGAAA AAGTTTTAAG TCTTACTCTT AGGGGGATAT TTTTATGCAG ACAACACGAA TTAACTTGGT TCTATCAAAA AGGTGCAAGT GAGATTTTCC CTGAAGAATT CGATCTTTAT CAATCTGTCA TTCCGCTAAA TGAGAGAGGG AATTTGATTA ATGCTTTTCA TAAGAGATTA ACAAGTCAAG ATAGGTCTGA GAGAACTCAA GCGGCACATG CTTGGACGAG ATGGGAAATG TCAACTAGCT ATCTCAAGCC AAAAGAATTA TCAATTAATA AAGCTACTAA TGATAATTTC TCAGACTCTT TTGCGCGTAT AGAATGTCAT TATTTTATTA ATAATATTTT TTTAGAGGAG AACTATATTC TAAAAAATAT TAATAAGCTA AAAGGTATTC CTGTTTCGAT TGTTCAGGGA AGATATGACG TTGTTTGTCC AATGAGAAGT GCATGGGACC TTAATAAAGC ATTGCCCACT TCTAAACTTT ATGTAATCGA TAATGCAGGA CATTCAATGA AAGAAATTGG AATTTCTAAA AGATTAATTG AATTGACAAA TGAGTTAGCC AATTCTTTCT CTAATCTCTA A
|
Protein sequence | MLFPEIEPNE KGMLNVSPLH SIYWERSGNP NGLSVLIIHG GPGGGSSPSY RRYFDPKKFN IVQFDQRGCG RSTPHSELEE NTTHHLIEDI EKIRQLLKIE SWHVFGGSWG STLSLIYAIQ HTEKVLSLTL RGIFLCRQHE LTWFYQKGAS EIFPEEFDLY QSVIPLNERG NLINAFHKRL TSQDRSERTQ AAHAWTRWEM STSYLKPKEL SINKATNDNF SDSFARIECH YFINNIFLEE NYILKNINKL KGIPVSIVQG RYDVVCPMRS AWDLNKALPT SKLYVIDNAG HSMKEIGISK RLIELTNELA NSFSNL
|
| |