Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9515_03961 |
Symbol | |
ID | 4719584 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9515 |
Kingdom | Bacteria |
Replicon accession | NC_008817 |
Strand | + |
Start bp | 359937 |
End bp | 360878 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640080069 |
Product | proline iminopeptidase |
Protein accession | YP_001010712 |
Protein GI | 123965631 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAAA AAAAATTATT CCCCATAATA GAACCAAGAG AAAAGGGTTT TTTGCAAGTA AGTAGAATTC ATTCTATCTA TTGGGAAAGA TCAGGAAATC AAAATGGTAA AAAGATACTA GTAATTCATG GCGGTCCTGG TGGAGGAAGT CAGCCAAGAT ATAGAAGATA TTTTAATCCT GAGAAATTTG ATATTATTCA ATTTGATCAA AGAGGTTGCG GTGCCTCAAG ACCTTTTTCA GAATTAAGAG AAAATACAAC AAAAGATCTA GTAGATGATA TTGAAAAATT AAGATTAAAT CTTAATATAG ATTCTTGGCA TTTGTTTGGA GGATCATGGG GATCTACGTT AGCTTTGATT TATGCAATAA AACATCCATC TAGGGTCAAA AGTATGACCT TAAGAGGAAT ATTTTTATGT AGAAAATTTG AACTCTCATG GTTCTACCAA TATGGTGCAA GTGAAATTTT CCCAGAAGAG TTTGAAAAAT ATATCTCAGT TATTCCTAAA GATGAAAGGT CTGATTTGGT ATCTTCTTTT TATAAATATT TAACATCTCC TAATATCGAA CTTAGATCAA AAGCTGCGGC TGCGTGGACA ACTTGGGAAC TTTCTACAAG TCATCTTATT AAGAGAGATA TTGATGTTGG TAAAAGTAAA ACTAATTCCT TTTCAGATGC TTTCGCAAGA ATAGAATGTC ATTATTTCAT AAACCATATC TTCTTAGAGG AGGATTTTAT TATGAAAAAT ATAAAGACTA TAGAATCTAT TCCAACTAAA ATCATTCAAG GAAGATATGA CGTGGTCTGT CCTGTGAGAA GTGCTTGGGA TCTAAATAAA AAATTAAAAA ATTCAGAATT AATAATCATT GATGAAGCTG GTCATTCTAT GAGTGAAAAG GGAATCACAT TGAAATTATT AGAATTAGTT GAAAAGTTAT AA
|
Protein sequence | MKEKKLFPII EPREKGFLQV SRIHSIYWER SGNQNGKKIL VIHGGPGGGS QPRYRRYFNP EKFDIIQFDQ RGCGASRPFS ELRENTTKDL VDDIEKLRLN LNIDSWHLFG GSWGSTLALI YAIKHPSRVK SMTLRGIFLC RKFELSWFYQ YGASEIFPEE FEKYISVIPK DERSDLVSSF YKYLTSPNIE LRSKAAAAWT TWELSTSHLI KRDIDVGKSK TNSFSDAFAR IECHYFINHI FLEEDFIMKN IKTIESIPTK IIQGRYDVVC PVRSAWDLNK KLKNSELIII DEAGHSMSEK GITLKLLELV EKL
|
| |