Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9515_01021 |
Symbol | |
ID | 4720237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9515 |
Kingdom | Bacteria |
Replicon accession | NC_008817 |
Strand | - |
Start bp | 104759 |
End bp | 105910 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640079764 |
Product | serine protease |
Protein accession | YP_001010418 |
Protein GI | 123965337 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.425044 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTTTTA AGGCATTAAG AAAAATAAAA ACAAGGAAAT ACTCTTTTAA TAGACTAATG TTTATATCCT ATTTAACTTT TGGTTTACTA AATAAACCGC AAGTAATATC ACAGCCGATA AGAAATCAAA CTCTTGATGA AAAATCTAAA ATTGCAAATA TATCGTTCAT AACAAAAGCA ATTAAGAAAA CAGGGGCATC TGTAGTAACT ATTGACACTC AAAGATTAGT AAAAAATAAA CAATTTTCCA GAGATTCAGG AATTTTCATT GACCCATATT TCGAAAGATT TTTTGGATTA CAATTACCTC GCGAATATCA ACCAAGAATA GAACAGAGTC AGGGAAGTGG ATTTATATTC GAAGATGGTC TAGTAATGAC AAATGCGCAT GTTGTTAATG GCTCTAAAAA AGTAATAGTT GGCTTATCAA ATGGGACAAA ATATGAAGGG AAGTTAATAG GACAAGATTC ACTTACTGAT CTAGCTGTGA TTAAGCTCCA AGGTCGAGGT CCTTGGCCAA AAGCAAAATT AGGTGACTCT TCAAAAATAG AAGTCGGTGA TTGGGCCATA GCTGTGGGAA ACCCTTTTGG ACTCGAAAAT ACAGTCACTC TAGGAATAAT TAGTAATTTA AATAGAAATG TCTCAGAATT AGGAATATAC GATAAAAAAT TTGAATTAAT CCAGACAGAT GCTGCAATTA ACCCCGGCAA TTCTGGAGGA CCATTATTAA ATAGTGCTGG AGAGGTCATT GGAATTAATA CTTTGATTAG ATCAGGACCT GGAGCAGGTC TCAGTTTTGC CATTCCAATA AATAAAGCTA AAGATATCGC TTCACAACTA ATCAACAATG GGCGAGTAAT TCATCCTATG ATTGGAATTA ATTTAATAGA TCAAAATTCT TTTGAGATTA AAAAAAATAT TGTGAAAGTA GGATATGTTG TCCCAAATAG TCCTGCCGAT AAAAGTGGAT TCTATATTAA TGACGTAATT ATTAAAGTGG GTAAAAAAGA TGTTCAAAAT TCTTCAGATG TTATAAATGA AATAACTAAT AATGGAATTA ATAATTATTT AAATATAATC ATCAAAAGAA AAAATAAACT TATTAAATTA AAAGTTAAAC CAACTGATAT AAGTAATTTA TCAGGAAAAT AA
|
Protein sequence | MFFKALRKIK TRKYSFNRLM FISYLTFGLL NKPQVISQPI RNQTLDEKSK IANISFITKA IKKTGASVVT IDTQRLVKNK QFSRDSGIFI DPYFERFFGL QLPREYQPRI EQSQGSGFIF EDGLVMTNAH VVNGSKKVIV GLSNGTKYEG KLIGQDSLTD LAVIKLQGRG PWPKAKLGDS SKIEVGDWAI AVGNPFGLEN TVTLGIISNL NRNVSELGIY DKKFELIQTD AAINPGNSGG PLLNSAGEVI GINTLIRSGP GAGLSFAIPI NKAKDIASQL INNGRVIHPM IGINLIDQNS FEIKKNIVKV GYVVPNSPAD KSGFYINDVI IKVGKKDVQN SSDVINEITN NGINNYLNII IKRKNKLIKL KVKPTDISNL SGK
|
| |