Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_03911 |
Symbol | |
ID | 4717086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 348325 |
End bp | 349275 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640078100 |
Product | proline iminopeptidase |
Protein accession | YP_001008786 |
Protein GI | 123967928 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.480314 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGATC AAGTCTTGTT CCCGAAAATT GAAGTACGTG AAAAGGGTTT TTTACAAGTT AGTGATATTC ATACTATTTA TTGGGAAAGA TCTGGCAATC CAAATGGCAA AAAAATACTT GTTATTCATG GAGGCCCAGG AGGAGGAAGT CAACCAAGAT ATAGAAGATA CTTTGACCCA GATAAATTCG ATATTATTCA ATTTGACCAA AGAGGTTGCG GTTCTTCAAC TCCTTTCTCC GAATTAAAAG AAAATACGAC TAATCATTTA GTAGATGATA TTGAGAAATT AAGGATTCTT TTTAAAATAG ATACTTGGCA TTTGTTTGGT GGATCTTGGG GCTCAACACT TTCACTAATA TATGCAATTA AAAATCCCTC AAGAGTTATA AGCTTAACTT TGCGAGGAAT ATTTTTATGT AGAAAGTTTG AATTATTATG GTTCTATCAA TATGGTGCAA GTGAGATATT CCCTGATGTA TTTGAAGAAT ATATTTCTGT AATACCAAAA GAAGAAAGAA ATGATTTGAT AAGTTCTTTT TACAAATATC TAACATCTAC AGATTCGAAT CTTAGATCAA AAGCTGCAGC AGCTTGGACA AAATGGGAAC TCTCAACTAG TCATTTAATA AATAAAAAAT TCGATTTTGA TAAGTCCCAA GTTAATTCTT TTTCAGATGC CTTTGCGAGG ATAGAATGTC ATTATTTTAT TAATAATATT TTCTTAGAAG ATGATTTTAT TTTGAAAAAT TTGAAAACAA TAGAATCGAT TCCAACAAAA ATAATTCAAG GTAGGTATGA CGTTGTATGT CCTGTTAGGA GTGCATGGGA TCTAAATAAG AAATTAAAGA ATTCTGAATT AATTATTGTT AATGATGCAG GTCATTCAAT GAGTGAGGAA GGCATTAGTA TCGAATTAAT AAAAGCTGTA AAAGGAATTC AAAATCTCTA A
|
Protein sequence | MKDQVLFPKI EVREKGFLQV SDIHTIYWER SGNPNGKKIL VIHGGPGGGS QPRYRRYFDP DKFDIIQFDQ RGCGSSTPFS ELKENTTNHL VDDIEKLRIL FKIDTWHLFG GSWGSTLSLI YAIKNPSRVI SLTLRGIFLC RKFELLWFYQ YGASEIFPDV FEEYISVIPK EERNDLISSF YKYLTSTDSN LRSKAAAAWT KWELSTSHLI NKKFDFDKSQ VNSFSDAFAR IECHYFINNI FLEDDFILKN LKTIESIPTK IIQGRYDVVC PVRSAWDLNK KLKNSELIIV NDAGHSMSEE GISIELIKAV KGIQNL
|
| |