Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_01011 |
Symbol | |
ID | 5731564 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 105523 |
End bp | 106665 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641284444 |
Product | serine protease |
Protein accession | YP_001549986 |
Protein GI | 159902642 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.336111 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCTAA AGAAACTCAT TGCTGAATCA GGATTTGGTG TATTACTTGT TGGGGCCTAT GTTTTTACTA ATGGCACTCA GCAAATTTCA GCTGCAACAA ATTTCAAATT AGCTCAATTA ACAGTTAGAT CTCAAAACTC TTTTGTTACT GAAGCCATCA ACGAAAGTGG ACCTGCTGTC GTTACTGTGG AGACTCAAAG ACAAGTAGTT TCCAGAAATA ATCTTTTCCC TCCCAATTTT TTTATAGATC CTTCTTTAGA GAGGTTTTTC AATCAGCCAA AATTAAAAAT GCCCAAATCT AGATTGCAGC TAGGTCATGG GAGTGGAGTG ATTTTTTCTT CTAAAGGTCT GGTATTAACT AACGCACATG TGATTGAAAA TACTGACAAA TTAGTTGTTG GCTTGTCAGA TGGAAGAAGA TTCCCAGCCA GGGTGATCGG TCAAGACGCC CTCACAGATC TAGCCGTGAT AGGTATAGAA GGAAAGGGTC CATGGCCAAT TGCAAAATTA GGCGATTCCG ACAAACTTGT TGTAGGTGAA TGGGCTATTG CCGTTGGAAG TCCTTTTGGT CTAGAAAAAA CAGTGACACT AGGGATTATT AGTAACCTTA ATAGAAATGT TTCTCAGCTA GGTATTGCAG ACAAAAGGTT AAAGCTTATA CAAACTGATG CAGCAATCAA TCCAGGCAAT TCTGGTGGTC CATTACTAAA CTCTAATGGA GAAGTAATAG GAATTAATAC ATTAGTCAGA TCTGGCCCAG GGGCAGGCCT AGGTTTTGCA ATACCTATCA ATCAAGCAAT TCAGATTGCA AGTCAATTAG TAGCAAGAGG CAAAGCCATC CATCCAATGA TTGGAGTAAA CCTTACTTAT TTAATAAATC AACCTGAAGA CAACTATATC TCTACAAAAG GGGCACAAAT TATAAATATT CTTCCTGGAA GTCCAGCTGA GAAAGAAGGT CTAAAGGTTA ATGATATTAT TCTTGCAATT AATGGTATAA AAGTTGATGG TCCTCAAGAT GTAGTTGACA AAATTAATAA AAATGGATTG AGTAAGAGGC TAAGATTGAC GCTTGTCAGA AACAAAAGGA GGATAACTGT CTCTATACTC CCAGTAGATA TAAGCAATTT CAAAAAAGAT TAA
|
Protein sequence | MSLKKLIAES GFGVLLVGAY VFTNGTQQIS AATNFKLAQL TVRSQNSFVT EAINESGPAV VTVETQRQVV SRNNLFPPNF FIDPSLERFF NQPKLKMPKS RLQLGHGSGV IFSSKGLVLT NAHVIENTDK LVVGLSDGRR FPARVIGQDA LTDLAVIGIE GKGPWPIAKL GDSDKLVVGE WAIAVGSPFG LEKTVTLGII SNLNRNVSQL GIADKRLKLI QTDAAINPGN SGGPLLNSNG EVIGINTLVR SGPGAGLGFA IPINQAIQIA SQLVARGKAI HPMIGVNLTY LINQPEDNYI STKGAQIINI LPGSPAEKEG LKVNDIILAI NGIKVDGPQD VVDKINKNGL SKRLRLTLVR NKRRITVSIL PVDISNFKKD
|
| |