Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_4352 |
Symbol | |
ID | 5604335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | + |
Start bp | 4823910 |
End bp | 4825280 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640939914 |
Product | protease Do |
Protein accession | YP_001480574 |
Protein GI | 157372585 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.208163 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00128029 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAATAAAA AGTCGTTAAT TCTTAGTGCA TTGGCAATGA GTATTGGCAT CACACTGACC TCCGTACCGG CAGCCAACGC GGCGATGCCT GTCGCCGTGC AGGGGCAGCA GTTGCCAAGC CTGGCGCCGA TGTTGGAAAA AGTATTGCCC GCTGTCGTCA GCGTGCATGT GGAAGGCACT CAGGTGCAGC GCCAGCAGGT ACCGGAAGAG CTCAAACGCT TCTTCGGCCC CGACGCGCCA GGACAACAGC AAAGCTCACG CCCGTTTGAA GGCTTGGGTT CTGGCGTGAT CATCGACGCC GCCAAGGGCT ATGTGTTGAC CAACAACCAC GTGATCAACA ATGCCGACAA AATCAGCGTA CAGTTGAATG ACGGGCGTGA AGTGGATGCC AAGTTGTTGG GACGTGACGA ACAGTCCGAC ATTGCCCTGC TGCAGCTCAG CGATGTGAAA AACCTGACCG CGATAAAAAT GGCGGACTCC GATCAACTGC GGGTCGGCGA CTTTGCCGTG GCCGTCGGCA ACCCATTCGG CCTCGGCCAG ACGGCGACCT CCGGTATTAT CTCTGCGCTG GGTCGTAGCG GTCTGAACCT GGAAGGGTTG GAAAACTTCA TTCAGACCGA TGCTTCCATC AACCGCGGTA ACTCCGGTGG CGCACTGGTT AACCTCAACG GTGAGTTAAT CGGTATCAAC ACCGCCATTT TGGCGCCAAG CGGCGGCAAC GTGGGCATCG GCTTTGCCAT TCCGAGCAAC ATGGCGCAGA ACCTCAGCCA GCAGTTGATT GAATTTGGCG AAGTGAAACG CGGCCTGCTG GGCATCAAAG GCAGTGAAAT GACCGCCGAT ATGGCCAAGG CGTTTAACAC CGACGCCCAA CGCGGTGCCT TCGTCAGCGA AGTGTTGCCG AAATCTGCCG CCGCCAAGGC CGGCATCAAA GCCGGTGACA TTCTGGTTTC CGTCGACGGC AAGCCCGTCA GCAGCTTCGC CGAACTGCGT GCCAAGGTCG GTACCACCGC GCCGGGCAAA ACCATCAAGG TTGGCTTGCT GCGCGACGGT AAGCCACTGG AAGTTTCCGT CACCCTGGAT AACAGTGAAG GCGCCTCCAC CAACGCCGAA ACGCTTTCTC CGGCTCTGCA GGGCGTGTCA CTGAGCAACG GTGCCATTCC AAGTGGTGAC AAAGGCGTGA AGATCGATAG AGTCGATAAA GGCTCCGTAG CCGCGCAGAT CGGGTTGCAG AAGGACGATG TGATCATTGG CGTCAACCGT CAGCGCGTTG AAAACATCAC GACCTTACGC AAGGTGCTGG AAGCCAAGCC ACCGGTGATG GCACTGAACA TCGTGCGCGG CGGCGAAACC ATTTATCTGC TGTTGCGTTA A
|
Protein sequence | MNKKSLILSA LAMSIGITLT SVPAANAAMP VAVQGQQLPS LAPMLEKVLP AVVSVHVEGT QVQRQQVPEE LKRFFGPDAP GQQQSSRPFE GLGSGVIIDA AKGYVLTNNH VINNADKISV QLNDGREVDA KLLGRDEQSD IALLQLSDVK NLTAIKMADS DQLRVGDFAV AVGNPFGLGQ TATSGIISAL GRSGLNLEGL ENFIQTDASI NRGNSGGALV NLNGELIGIN TAILAPSGGN VGIGFAIPSN MAQNLSQQLI EFGEVKRGLL GIKGSEMTAD MAKAFNTDAQ RGAFVSEVLP KSAAAKAGIK AGDILVSVDG KPVSSFAELR AKVGTTAPGK TIKVGLLRDG KPLEVSVTLD NSEGASTNAE TLSPALQGVS LSNGAIPSGD KGVKIDRVDK GSVAAQIGLQ KDDVIIGVNR QRVENITTLR KVLEAKPPVM ALNIVRGGET IYLLLR
|
| |