Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmwyl1_1102 |
Symbol | |
ID | 5366786 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Marinomonas sp. MWYL1 |
Kingdom | Bacteria |
Replicon accession | NC_009654 |
Strand | + |
Start bp | 1230907 |
End bp | 1232316 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640803443 |
Product | protease Do |
Protein accession | YP_001339968 |
Protein GI | 152995133 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.432426 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.334771 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGAT TGCTTAAACA AGTTTGCATG GTTGTCGTTA GTAGTTTTAT GATGGCTTCG ATGCTTACTC ATGCGGCTTC ACTCCCTGAC TTTACTGAGT TAGTGGAAAA AGCCTCCCCC GCTGTTGTGA ATATCAGTAC GGAACAAACT GTTACCACAA AAACGGCTAA CGAAGGTGGT CAGCAATTAG GGCCAAATAG TGAAGAACTC AATGAGTTCT TTAAGCACTT CTTTGGTCAA CAGCCTTTCG GTCAACAAGC ACCGCCACAG CAGGGACAGC GTAGTTCATT GGGATCGGGT TTTATCATTT CCCATGATGG TTATGTGTTA ACCAATAATC ACGTTATTGA TGGCGCGGAT GTGATTCATG TTCGCCTTAA TGATAGACGT GAATATGTCG CAAAATTGGT TGGTACGGAT CCTAGAACCG ATTTAGCTCT ACTTAAAATC GAGGCGGATG ACCTGCCTAT TGTCAAAATG GGTGATTCAG ACAAGCTTAA GCCGGGTCAG TGGGTATTAG CGATTGGTTC TCCATTCGGC TTTGACTACA CGGTGACCGC AGGTATCGTC AGTGCAACAG GACGAAGCTT ACCATCTGAT AACTATGTAC CATTTATCCA AACGGACGTA GCGATCAATC CAGGTAACTC TGGTGGTCCT TTGTTCAACC TAGACGGTGA AGTCGTTGGT ATTAACTCAC AAATTTACAC TCGTTCTGGT GGTTTTATGG GCGTGTCGTT TGCGATTCCC TCTAAAGTTG CCATGTCGGT TGTGGATCAA CTGAAGAGCG ATGGTAAAGT ATCTCGCGCT TGGCTGGGTG TGTTGATTCA GGATGTAAAT AATGAGTTGG CTGAGTCATT TGGCTTAGAC AGATCAAATG GTGCATTGAT AAGCCGTGTA TTACCTGATT CTCCAGCTGA AAAAGCAGGC CTAAAGTCAG GCGATATTAT CCTAGAGTTC AACGGACAAT CCATTGCCCA TTCTGGTGAG CTGCCTTATA TAGTTGGGCA AATGAAAGCG GATGAAAAAG TGGATGCTAA AGTGTATCGA GATGGTAAAG AGCAAACCAT TTCTGTGACG CTAGAGGCAC GGCCAAATGA CCCGAAAGTA GTCGCTCAAT CTCAGCAGGA TCAAAACCGT TTAGGTATGA TTGTAGGAGA AGTGCCTGCG GACATGGCGA AAAAATTTGA AATTGACAAT GGTGTTGTTA TCGAGCAAGT ACTTGGCGGT ACAGCGGCCC GTAATGGTTT GCAGCAAGGT GATGTGATCA CCATGTTAAA TGGCAAGCGC ATTACCAGTG TTGCGGAGTT TGCCAAAATT GCCAAAGATA TTCCAAGTGG TCGTTCAGTA CCAATGCGAG TCATTAGACA GGGCTACCCA ATGTTTATTC CATTCAAAAT AATGGATTAA
|
Protein sequence | MNRLLKQVCM VVVSSFMMAS MLTHAASLPD FTELVEKASP AVVNISTEQT VTTKTANEGG QQLGPNSEEL NEFFKHFFGQ QPFGQQAPPQ QGQRSSLGSG FIISHDGYVL TNNHVIDGAD VIHVRLNDRR EYVAKLVGTD PRTDLALLKI EADDLPIVKM GDSDKLKPGQ WVLAIGSPFG FDYTVTAGIV SATGRSLPSD NYVPFIQTDV AINPGNSGGP LFNLDGEVVG INSQIYTRSG GFMGVSFAIP SKVAMSVVDQ LKSDGKVSRA WLGVLIQDVN NELAESFGLD RSNGALISRV LPDSPAEKAG LKSGDIILEF NGQSIAHSGE LPYIVGQMKA DEKVDAKVYR DGKEQTISVT LEARPNDPKV VAQSQQDQNR LGMIVGEVPA DMAKKFEIDN GVVIEQVLGG TAARNGLQQG DVITMLNGKR ITSVAEFAKI AKDIPSGRSV PMRVIRQGYP MFIPFKIMD
|
| |