Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_16811 |
Symbol | |
ID | 4912094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 1412773 |
End bp | 1413903 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640161278 |
Product | trypsin-like serine protease |
Protein accession | YP_001091905 |
Protein GI | 126697019 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTTA TCAAGATTAA ATTTATTAAT TTAATCCAAA TTTTCATTAT TTTTTGTTTT TGTTTAGTCA ATTTCTCTCA AAAAGCTGAA GTTTTAGCTT TAACATCTTC AGAAAGTCAT AATTTCGTAT CATCCGCAGT TAAAAATATT GGCCCTGCAG TTGTAAAAAT TGACACTGAG CGCTTGGTAG AGAGGCAACA ATTTGATCCT ACTTTACTTG ACCCATTATT AAGGGATTTA CTTGGTGAGC AAGGCATTAC TCCTGAAAGG GAGAGAGGAC AAGGCTCTGG GGTTATCATT AATGAAAATG GTTTGGTTCT TACAAACGCT CATGTCGTAG AAAGAGTCGA TAATGTTTCA GTTACTTTGG CAGATGGATC TATTTGTGAT GGTGAAGTTT TGGGGACGGA TACAGTAACT GATCTTGCTT TAGTAAAAAT TGATGAAGAT GCTTATTCTG GTTTTGCTCC ACTTGGAAAT TCTGAAGATC TTGAAGTTGG GGATTGGGCA ATAGCTCTTG GTACTCCTTA TGGTCTTGAA AAAACAGTTA CCTTAGGGAT TGTAAGCAGC CTGCATAGAG ATATTAATAG TTTAGGATTT TCAGATAAAA GGTTGGATCT TATTCAGACT GATGCGGCAA TAAATCCAGG AAATTCTGGG GGACCACTCA TAAATTCCAA TGGCGAGGTA ATTGGAATCA ATACATTAGT AAGAAGTGGC CCTGGAGCAG GTCTAGGTTT TGCGATTCCC ATCAATCTGG CTAAAAGTGT TTCTGATCAG CTACTCAAAA ATGGGGAAGT GATTCATCCA TATTTAGGGG TACAATTAAT TTCTTTAAAT CCTAGAATTG CTAAAGAACA TAATCGAGAT CCCAATTCTT TAGTTCAATT ACCCGAAAGA AACGGAGCTC TAATTCAATC AGTAATACCT AATAGCCCCG CTGAAAAAGC TGGTTTAAGA AGAGGAGATT TAGTAATAGC AGCCGAAAAT ATCTCTATAA ATGAGCCTAA GACTTTATTA GATGAAGTAG AAAAAGCTCA GATAGGAAAA GTATTTCTTT TAAATATTTT GAGAGATAAT AAAGAGATAC AGATAAATAT CAAACCAGAA CCTCTCCCAG GTTTGACATA A
|
Protein sequence | MKFIKIKFIN LIQIFIIFCF CLVNFSQKAE VLALTSSESH NFVSSAVKNI GPAVVKIDTE RLVERQQFDP TLLDPLLRDL LGEQGITPER ERGQGSGVII NENGLVLTNA HVVERVDNVS VTLADGSICD GEVLGTDTVT DLALVKIDED AYSGFAPLGN SEDLEVGDWA IALGTPYGLE KTVTLGIVSS LHRDINSLGF SDKRLDLIQT DAAINPGNSG GPLINSNGEV IGINTLVRSG PGAGLGFAIP INLAKSVSDQ LLKNGEVIHP YLGVQLISLN PRIAKEHNRD PNSLVQLPER NGALIQSVIP NSPAEKAGLR RGDLVIAAEN ISINEPKTLL DEVEKAQIGK VFLLNILRDN KEIQINIKPE PLPGLT
|
| |