Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_04311 |
Symbol | |
ID | 4776209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 432194 |
End bp | 433282 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640085935 |
Product | trypsin-like serine protease |
Protein accession | YP_001016448 |
Protein GI | 124022141 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGTGC TGTTCCCTCT TGCAGCCGGG GTTCAGCCTG TTTGGGCTTT GTCTGGTTTA GATGGGACGA CTAGCCATAA CTTCGTTGCT GATGCAGTAA GTCAGGTGGC GCCCGCAGTG GTTCGCATCG ATACGGAACG CACTGTGCAA CGTCAGCCCT TTGATCCCAC GCTGATTGAT CCCTTGCTCA GAGATCTCTT GGGCGAGCCA GGAATTGGGC CAGAGCGTGA GCGGGGTCAG GGTTCGGGTG TCGTGATCGA TGACCAGGGG TTGGTGCTGA CCAATGCCCA CGTGGTTGAA CGGGTGGATG CGGTCAGCGT CACCCTTGCC GATGGAGATC AACACGATGG TTCGGTTGTT GGGACGGATC CTGTTACTGA TCTGGCTCTT GTGCGACTGG ATGGGGGCAC ACGTCCTGAG GCCGCCCCTC TTGGAGATTC TGATGCGCTT GAGGTAGGCG ATTGGGCGAT CGCTCTTGGT ACTCCCTATG GCCTTGAACG CACCGTCACC CTTGGCATTG TTAGCAGCCT GCATCGCAAT ATCAGCAGCC TTGGCTTCTC TGATAAACGT CTGGATTTGA TTCAGACCGA TGCCGCGATT AACCCTGGTA ATTCCGGTGG TCCACTGGTG AATGGTCGTG GTGAGGTGAT CGGTATCAAC ACACTGGTTC GTTCTGGTCC AGGCGCTGGT TTGGGATTTG CTATTCCGAT CAATTTGGCT CGACATGTTT CTGAGCAGCT TTTGACCAGT GGGGAGGTGG TGCATCCTTA TTTGGGTGTC CAATTGGTGC CGCTGACAGC TCGTATTGCC AGGGAGCACA ATCGTGATCC GAATTCGCTG GTGGAATTAC CCGAACGCTT GGGGGCGCTT GTGCAGAGTG TTTTGCCGGA TAGCCCGGCG GAACGAGCTG GTTTGCGGCG TGGTGATCTT GTGATTGCGG CAGCTGAAAC ATCAGTCTCT GATCCACAAA TGCTGCTTAA ACAGGTTGAT CAGGCTGAGA TCGGTGTCCC CTTCTCATTA AGGATCATGC GCAATGGTCA AGAGATGAGC CTTTCGGTTA ATCCAGCCGC ATTACCTGGC CTTAGTTGA
|
Protein sequence | MSVLFPLAAG VQPVWALSGL DGTTSHNFVA DAVSQVAPAV VRIDTERTVQ RQPFDPTLID PLLRDLLGEP GIGPERERGQ GSGVVIDDQG LVLTNAHVVE RVDAVSVTLA DGDQHDGSVV GTDPVTDLAL VRLDGGTRPE AAPLGDSDAL EVGDWAIALG TPYGLERTVT LGIVSSLHRN ISSLGFSDKR LDLIQTDAAI NPGNSGGPLV NGRGEVIGIN TLVRSGPGAG LGFAIPINLA RHVSEQLLTS GEVVHPYLGV QLVPLTARIA REHNRDPNSL VELPERLGAL VQSVLPDSPA ERAGLRRGDL VIAAAETSVS DPQMLLKQVD QAEIGVPFSL RIMRNGQEMS LSVNPAALPG LS
|
| |