Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_02461 |
Symbol | |
ID | 4778226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 259014 |
End bp | 260201 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640085750 |
Product | serine protease |
Protein accession | YP_001016266 |
Protein GI | 124021959 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGCGT TGCTTGAGGA ATCCTCCGCA GAGTTGCGGT TATCCAGCCA ATCTTCAGGT TTGAGCAGGG TTTTCCGTGG TGGTCTTCTG CTTGGCACTG GATTGGTTTG TCTGCCTGTG TTGTCGGGAC TTTCCCCTCA ACCGCTGCAG GCGGCATCAG CCGCGACAGC CTTATCCCGG CAGTCGTTTG TAGCGGATGC GGTGGCTCGC AGCGGCCCTG CTGTGGTGAC TCTTGAGACG AGTCGGACGG TCCGCTCTAT GGGCATGGCT GGTCTGCCTC AGGGACTGTT AGCAGACCCG CTGTTCCAAC ATTTCTTTGG TTTACCTGGC AGGGTCGCTC CCCGCGCGCG GATAGAACGA GGGCAGGGAA GTGGTGTGAT TTTTTCGGCG GAAGGATTGG TTCTCACCAA TGCTCATGTC GTGGAGAAAA CCGATCAGCT GATGGTGGGA TTGCCAGATG GTCGAAGGGT GTCTGGGCGA CTGGTTGGTC AGGACACGAT TACGGATCTG GCTGTAGTGC AGTTGGATGG CTCTGGGCCT TGGCCCACAG CTCCATTGGG AGACTCGGAC CAGCTTCGGG TCGGGGATTG GGCCATAGCC GTTGGCAACC CCTTTGGCCT TGAAAATACG GTGACGTTGG GGATCGTCAG CAACCTCAAC CGCAACGTCT CTCAGCTAGG GATCTCTGGC AAGCGGTTGG ATTTGATTCA GACCGATGCT GCTATCAATC CAGGCAACTC AGGAGGACCG TTGTTGAATT CTGAGGGCAA TGTGGTGGGT ATCAATACAC TCGTTCGCTC TGGGCCAGGC GCTGGGCTGG GTTTTGCTAT TCCTATTAAT CGGGCCAGGA CCATCGCCCA GCAGCTGGTG GAGCGAGGAC GGGCCAGTCA TCCGATGGTG GGAGTAGGTC TCTCGCCGGT GCCATCTGCT CGTTCTGGGG AAGCCAATTC TCCTGGTGCT GTGATTCGTT CCGTGGTGCC GGGTGGGCCT GCAGCAAGTG CCGGTTTGAA GGTTGATGAT GTGATCGTTT CGGTTGAAGG GTTACCGATT GATGGGCCTG CTGAGGTAGT GAGCGCGATT GATCGTCATG GAGTTGGGAG TCCAATCACT CTTGGATTAA TCCGGGGCGA CAGTCGGATT GAGCTGGCGG TTACACCAGT GGAGCTGACG GCGATGCAGG CACCTTGA
|
Protein sequence | MSALLEESSA ELRLSSQSSG LSRVFRGGLL LGTGLVCLPV LSGLSPQPLQ AASAATALSR QSFVADAVAR SGPAVVTLET SRTVRSMGMA GLPQGLLADP LFQHFFGLPG RVAPRARIER GQGSGVIFSA EGLVLTNAHV VEKTDQLMVG LPDGRRVSGR LVGQDTITDL AVVQLDGSGP WPTAPLGDSD QLRVGDWAIA VGNPFGLENT VTLGIVSNLN RNVSQLGISG KRLDLIQTDA AINPGNSGGP LLNSEGNVVG INTLVRSGPG AGLGFAIPIN RARTIAQQLV ERGRASHPMV GVGLSPVPSA RSGEANSPGA VIRSVVPGGP AASAGLKVDD VIVSVEGLPI DGPAEVVSAI DRHGVGSPIT LGLIRGDSRI ELAVTPVELT AMQAP
|
| |