Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_08291 |
Symbol | |
ID | 4776690 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 754100 |
End bp | 755254 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640086338 |
Product | trypsin-like serine protease |
Protein accession | YP_001016845 |
Protein GI | 124022538 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCTTG CCCTATCTGC CCACAGGCTG CATCCGATCA GATGGCTTGG CTTGGTCCTC ATCAGCATCA ACCTAAGTGG CTGCAACGAA GGGTTACGTC AACGCATCGG CATTGGTTCT AAGACAAGCC CAGACAACAC CCCTGTCGTC AGTGATCCAC CCAATTCAGC GCCTCTGCAA CCTGGCACCA ATGTGATCGT GACTGCTGTG GAACAAGTAG GTCCGGCAGT GGTGCGCATC GACACGGTGA AACGGATCGC AAACCCCCTT GGCAACCTCT TCGGCGGCGG ACCTCCCATC CAACGGCAAG CAGGCCAGGG GTCAGGTTTC ATCACACGCT CTGACGGGCT GATCTTCACC AATGCTCATG TGGTTGATGG GGCAGAACAG GTATCGGTAA CCCTTCCAGA TGGCCGCAGT TACAGCGGCA AAGTGCTTGG TGGTGATCCC CTTACAGATG TCGCCGTGGT CAAAGTCGTG GCGAAGAAGC TTCCCGTGGC CCCCCTCGGC AATTCCAACA ACATCAAGCC TGGGCAATGG GCAATCGCTA TCGGCAATCC TCTTGGACTC AACAACACCG TGACTGCAGG CATCATCAGC TCCGTCGACC GCACCAACGC CTTAGGGGGG GGGCAACGAG TTCCTTACAT CCAAACTGAC GCCGCCGTAA ACCCTGGCAA TAGCGGAGGA CCACTCATCA ATGCCTCAGG ACAGGTGATC GGAATCAATA CTGCCATCAA AGTTGCACCG GGAGGCGGGC TGAGTTTTGC AGTACCGATC AACCTGGCCA AACGCATTGC CCAACAAATC GTGGGGAGAG GGCAAGCTTC TCATCCCTAT ATCGGGGTAA GGCTTCAGAG CCTTACCCCC CAGCTAGCCA AAGAAATCAA CGCAACAGGA GGGCAATGCC AGGTGCCTGA AGTCAATGCT GTTCTCGTTG TCGAAGTGAT GTCTCGCAGC CCTGCAGACA AAGCCGGCGT GCGCCAATGC GACTTAATTA GTGAGGTCAA TGGTGAGGTC GTCCGCGACC CTTCGCAAGT ACAACTTGCC GTTGATCGTG GGGAGGTTGG CAAGCCCATG CCGCTCACCC TTGAACGAAA CGACAAGACG ATCGAATTAA TTGTGAAACC AGCAGAGCTA CCCCGGCAGG GGTGA
|
Protein sequence | MTLALSAHRL HPIRWLGLVL ISINLSGCNE GLRQRIGIGS KTSPDNTPVV SDPPNSAPLQ PGTNVIVTAV EQVGPAVVRI DTVKRIANPL GNLFGGGPPI QRQAGQGSGF ITRSDGLIFT NAHVVDGAEQ VSVTLPDGRS YSGKVLGGDP LTDVAVVKVV AKKLPVAPLG NSNNIKPGQW AIAIGNPLGL NNTVTAGIIS SVDRTNALGG GQRVPYIQTD AAVNPGNSGG PLINASGQVI GINTAIKVAP GGGLSFAVPI NLAKRIAQQI VGRGQASHPY IGVRLQSLTP QLAKEINATG GQCQVPEVNA VLVVEVMSRS PADKAGVRQC DLISEVNGEV VRDPSQVQLA VDRGEVGKPM PLTLERNDKT IELIVKPAEL PRQG
|
| |