Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_01041 |
Symbol | |
ID | 4911170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 105099 |
End bp | 106220 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640159669 |
Product | serine protease |
Protein accession | YP_001090328 |
Protein GI | 126695442 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.741511 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAAAGAT ACATCCATAA GCTATTAGTA ATCGTCTCAT TAATTCCAAT AGGCATTACA TATCCTGCAA AAGTAATATC CCAACCATTA AGCAAATCAA ACATTAATGA AGTTAGTTTA TTTTCAAACA AATCTTTCAT AACAAAAGCT GTAGAGAGAA CCGGTGCAGC TGTGGTGACA ATTGATACTC AAAGATATGT TAAAAAAAGA AAATTTCCAA GAAATTCTCA ACTATTTATA GACCCATATT TTGAAAGATT TTTTGGATTA GATTTGCCTA ACGAAAACCG ACCAAGGATA GAGCAAAACC AAGGCAGTGG ATTTATATTT GCAGATGGAC TTGTCATGAC CAATGCTCAT GTAGTGAATG GATCAGATAA GGTAATTGTT GGTTTAACCA ACGGCAAAAA ATTAAACGCC CAACTGATAG GTCAAGACTC TTTTACTGAT TTAGCTGTGT TAAAGATTGA AGGGAAAGGG CCTTGGCCAA AAGCAAAATT GGGCGATTCT GCAAAGATTA AAGTTGGTGA TTGGGCTATA GCAGTTGGAA ATCCATTCGG ACTGGAAAAC ACAGTTACTC TTGGTATTAT TAGTAATCTA AATAGAAACG TAAATCAATT AGGAATATAT GATAAAAAAC TTGAACTGAT ACAAACAGAC GCTGCTATTA ATCCTGGCAA TTCTGGAGGT CCACTGTTGA ATAGCGATGG TGAAGTAATT GGTATTAATA CGTTGATAAG ATCAGGTCCA GGAGCGGGTT TGAGTTTTGC AATCCCAATT AATAAAGCTA AGGAAATTGC CTATCAACTT TTACAAAATG GGAAAGTAAT ACATCCTATG ATTGGAATTA GCCTAATAGA AGAAAGTATT TCTGAGAGAA AAAATAAGGT TGTAAAAGTT GGATATGTAG TACCAAACAG TCCAGCTGAA AAAAGTGGAA TCAAGATAGA TGATATTTTA ATTAAAATAG GAAATAAAGA TATTGAAACA GCATCAGACG TAATAGAACA AATTAGTAAA AATGGTATCA AAAAACAAGT AAATATATTA TTGAAGCGTA AAAATAAATT TATTAAATTA AAAGTAATAC CAACTGATAT TACTAATCTA CAAAATAACT AA
|
Protein sequence | MKRYIHKLLV IVSLIPIGIT YPAKVISQPL SKSNINEVSL FSNKSFITKA VERTGAAVVT IDTQRYVKKR KFPRNSQLFI DPYFERFFGL DLPNENRPRI EQNQGSGFIF ADGLVMTNAH VVNGSDKVIV GLTNGKKLNA QLIGQDSFTD LAVLKIEGKG PWPKAKLGDS AKIKVGDWAI AVGNPFGLEN TVTLGIISNL NRNVNQLGIY DKKLELIQTD AAINPGNSGG PLLNSDGEVI GINTLIRSGP GAGLSFAIPI NKAKEIAYQL LQNGKVIHPM IGISLIEESI SERKNKVVKV GYVVPNSPAE KSGIKIDDIL IKIGNKDIET ASDVIEQISK NGIKKQVNIL LKRKNKFIKL KVIPTDITNL QNN
|
| |