Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_03901 |
Symbol | |
ID | 4911826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 346265 |
End bp | 347215 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640159966 |
Product | proline iminopeptidase |
Protein accession | YP_001090614 |
Protein GI | 126695728 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0624161 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGATC AAGTCTTGTT TCCTAAAATT GAAGTTCGTG AAAAGGGTTT TTTACAAGTA AGTGATATTC ATACGATTTA TTGGGAAAGA TCAGGCAACC CAAATGGTAA AAAAATACTA GTTATTCATG GAGGCCCAGG AGGAGGAAGT CAACCAAGAT ATAGAAGATA CTTTGATCCA GATAAATTCG ATATTATTCA ATTTGACCAA AGAGGTTGCG GTTCTTCAAC TCCTTTCTCA GAATTAAAAG AAAATTCGAC TAATCATTTA GTTGAGGATA TTGAGAAATT AAGGATTTTA TTAAAAATAG ATAGTTGGCA TTTGTTTGGT GGATCTTGGG GCTCAACACT TTCACTTATA TATGCAATTA AAAATCCCTC TAGAGTTATA AGCTTAACTT TGCGAGGAAT ATTTTTATGT AGAAAGTTTG AATTATTGTG GTTCTATCAA TATGGTGCAA GTGAGATATT CCCTGATGAA TTTGAAGAAT ATATTTCTGT GATACCAAAA CAAGAAAGAA ATGATTTGAT AACTTCTTTT TATAAATATC TAACATCATC AGATGTAAAT CTTAGATCAA AAGCAGCAGC AGCTTGGACA AAATGGGAAC TCTCAACAAG TCATTTAATA AATAAAAAAT TTGATTTTGA TAAGTCTGAA GTTAATTCTT TTTCAGATGC CTTTGCAAGA ATCGAATGTC ATTATTTTAT TAATAATATT TTCTTAGAGG ATGATTTTAT TTTGAAAAAT ATAAAAACAA TAGAATCGAT TCCAACAAAA ATAATTCAGG GTAGGTATGA CGTTGTGTGT CCTGTTAGGA GTGCTTGGGA TCTAAATAAA AAATTAAAGA ATTCTGAATT AATTATTGTT GATGATGCTG GTCATTCAAT GAGTGAAAAG GGCATTACTA AAGAATTAAT AAAAGCTATA AAAGGAATTC AAAATCTCTA A
|
Protein sequence | MKDQVLFPKI EVREKGFLQV SDIHTIYWER SGNPNGKKIL VIHGGPGGGS QPRYRRYFDP DKFDIIQFDQ RGCGSSTPFS ELKENSTNHL VEDIEKLRIL LKIDSWHLFG GSWGSTLSLI YAIKNPSRVI SLTLRGIFLC RKFELLWFYQ YGASEIFPDE FEEYISVIPK QERNDLITSF YKYLTSSDVN LRSKAAAAWT KWELSTSHLI NKKFDFDKSE VNSFSDAFAR IECHYFINNI FLEDDFILKN IKTIESIPTK IIQGRYDVVC PVRSAWDLNK KLKNSELIIV DDAGHSMSEK GITKELIKAI KGIQNL
|
| |