Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PMT9312_1583 |
Symbol | |
ID | 3766400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9312 |
Kingdom | Bacteria |
Replicon accession | NC_007577 |
Strand | - |
Start bp | 1478941 |
End bp | 1480071 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 637798120 |
Product | trypsin-like serine protease |
Protein accession | YP_398079 |
Protein GI | 78779967 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTTTC TCAAGATTAA ATTAATTAAT TTAATCCAAA TATTCATTAT TGTTTGTTTT TGTTTAGTTA ATTTCTCTCA AAGAGCTGAA GTTTTAGCCT TGACTTCTTC TGATAGTCAC AATTTCGTAT CATCTGCGGT TAGAAATGTT GGACCTGCAG TGGTAAAAAT TGATACTGAG CGATTCGTTG AGAGGCAACA ATTTGATCCA ACTTTACTGG ACCCTTTATT GAGGGATTTG CTTGGGGAGC AAGGAATTAC ACCTGAGAGA GAGAGAGGTC AAGGTTCTGG AGTAATTATT AATGAGAATG GTTTGGTTCT TACTAATGCT CATGTCGTAG ACAGAGTTGA TGATGTTTTA GTGACTTTGG CAGATGGAAG TATTTGCGAT GGCCAAGTTT TGGGAACAGA TGCAGTAACT GACCTGGCTT TAGTAAAAAT TGAGGAATCT ACATTTTCTA GTTTTGCTCC CCTTGGAAAT TCTGAAGATC TTCAAGTTGG AGATTGGGCA ATAGCACTAG GTACTCCCTA TGGTCTAGAA AAAACAGTTA CCTTAGGAAT TGTAAGTAGT TTACATAGGG ATATTAATAG TCTAGGGTTT TCAGATAAAA GGCTTGATCT TATTCAGACT GATGCGGCAA TAAATCCTGG GAATTCTGGA GGACCACTTA TTAATTCTAA TGGTGAAGTA ATAGGAATCA ATACATTGGT AAGAAGTGGC CCTGGAGCTG GTCTAGGTTT CGCAATACCT ATTAATCTAG CTAAAAGTGT TTCTGATCAG CTTCTCAATA ATGGTGAAGT TATTCATCCA TATTTAGGGG TTCAATTAAT TTCTTTGAAT CCTAGAATTG CTAAAGAACA TAATCAAGAC CCCAATTCAT TAGTCCAATT ACCTGAACGA AATGGGGCTC TTATTCAATC AGTCATACCT AATAGTCCTG CTGAAAAAGC TGGTTTAAGA AGAGGTGATT TAGTTATAGC AGCCGAAAAC ATCTCTATAG AAGAACCTAA AGCTTTGCTA GATGAAGTTG AAAAAGCTCA GATAGGAAAA GTATTCCTTT TAAATGTTTT AAGAGATAAT AAAGAGATAA AGATAAATAT CAAACCAGAA CCTCTACCAG GTTTGACATA A
|
Protein sequence | MKFLKIKLIN LIQIFIIVCF CLVNFSQRAE VLALTSSDSH NFVSSAVRNV GPAVVKIDTE RFVERQQFDP TLLDPLLRDL LGEQGITPER ERGQGSGVII NENGLVLTNA HVVDRVDDVL VTLADGSICD GQVLGTDAVT DLALVKIEES TFSSFAPLGN SEDLQVGDWA IALGTPYGLE KTVTLGIVSS LHRDINSLGF SDKRLDLIQT DAAINPGNSG GPLINSNGEV IGINTLVRSG PGAGLGFAIP INLAKSVSDQ LLNNGEVIHP YLGVQLISLN PRIAKEHNQD PNSLVQLPER NGALIQSVIP NSPAEKAGLR RGDLVIAAEN ISIEEPKALL DEVEKAQIGK VFLLNVLRDN KEIKINIKPE PLPGLT
|
| |