Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9515_16701 |
Symbol | |
ID | 4720473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9515 |
Kingdom | Bacteria |
Replicon accession | NC_008817 |
Strand | - |
Start bp | 1464257 |
End bp | 1465387 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640081362 |
Product | trypsin-like serine protease |
Protein accession | YP_001011984 |
Protein GI | 123966903 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGATTTT TAAAAAATAA ATTTATTTAT TTATTTAAAT TGGGCATCGT GCTATTTGCT TTTTTAATTA ATTTTTTGCC TTTGTCTGAA GTTTTTGCTT TAAATTCTCT CGATGGGCAT AATTTCGTAT CGGACGCAGT TAAAAATGTA GGTCCTGCAG TGGTTAGAAT TGATACTGAA AGATTAGTAG AAAGACAACA GTTTGATCCA ACTTTATTAG ATCCATTATT AAGAGATTTA CTAGGGGAAC CCGGAATGGC TCCTGACAGA GAAAGAGGTC AAGGTTCAGG TGTGATAATT AACAAAAATG GTTTGGTTTT AACAAATGCT CATGTTGTAG AAAGAGTTGA TAATGTGTCA GTGACGTTGG CGGATGGAAC TAATTGTGAT GGGAAAGTAT TGGGAACCGA TTCGATTACT GATTTAGCGT TAGTTAAAAT CGAACAACTT ATTGATTCAA GTTATGCTCC TTTAGGAGAT TCAGAGAAAC TTGAAGTTGG GGATTGGGCA ATAGCTCTTG GTACGCCGTA TGGCCTTGAG AAAACAGTTA CTCTTGGCAT AGTTAGCAGT CTGCATAGAG ATATCAATTC ACTAGGTTTT TCTGATAAAA GGCTTGATCT AATTCAAACA GATGCCGCAA TTAACCCAGG TAATTCTGGA GGTCCGCTCA TAAATTCTAA TGGCCAGGTT ATTGGCATAA ATACACTCGT TAGAAGTGGA CCTGGAGCTG GCCTAGGTTT TGCAATACCT ATAAATTTAG CTAAAAATGT TTCTGACCAA TTATTAGAGA ATGGTGAAGT TATTCATCCT TATTTAGGAG TACAATTAAT ATCCTTAAAT CCTAAAATGG CTAAACAACA CAACGAAGAT CCTAATGCAA TTGTTCAATT ACCCGAGAGG TCCGGAGCTT TAATTCAGTC TATAGTTCCA AATAGTCCTG CAGAAAAAGC AGGTTTGAAA AGAGGTGACT TAGTAATTGC AGCTGAAAAT ATATCAATAG AAGAACCAAA AACTCTTTTA GATGAAGTAG AAAAAGCTCA AATTGGAAAA GTATTTCTTT TAAATGTTGT GAGGGATAAT AAAGAAATCA AAGTTAATAT TAAACCTGAA GCACTTCCAG GTTTGACATA A
|
Protein sequence | MRFLKNKFIY LFKLGIVLFA FLINFLPLSE VFALNSLDGH NFVSDAVKNV GPAVVRIDTE RLVERQQFDP TLLDPLLRDL LGEPGMAPDR ERGQGSGVII NKNGLVLTNA HVVERVDNVS VTLADGTNCD GKVLGTDSIT DLALVKIEQL IDSSYAPLGD SEKLEVGDWA IALGTPYGLE KTVTLGIVSS LHRDINSLGF SDKRLDLIQT DAAINPGNSG GPLINSNGQV IGINTLVRSG PGAGLGFAIP INLAKNVSDQ LLENGEVIHP YLGVQLISLN PKMAKQHNED PNAIVQLPER SGALIQSIVP NSPAEKAGLK RGDLVIAAEN ISIEEPKTLL DEVEKAQIGK VFLLNVVRDN KEIKVNIKPE ALPGLT
|
| |