Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_0789 |
Symbol | |
ID | 5604065 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | + |
Start bp | 878203 |
End bp | 879633 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640936300 |
Product | serine endoprotease |
Protein accession | YP_001477023 |
Protein GI | 157369034 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.290182 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAA CCGCATTAGT TCTGAGCGCA TTGGCATTCA GTATTGGTAT GGCGATGGGT CCGATGACGG CCAGCGCCGC AGAAACCGCT TCTTCCAGCA CTCAACAATT GCCAAGCCTG GCACCGATGC TGGAAAAAGT GATGCCTTCC GTGGTGAGCA TTAATGTTGA AGGCAGCACC ACGGTCAATA CGGCGAAGAT GCCGCCGCAG TTCCAACAGT TCTTCGGTGA AGATTCACCG TTCTGCCAGG ACGGTTCGCC GTTCCAGGCC TCGCCGATGT GTCAGGGTGG CGAGCCGGGT GCACCGGGTC AGGGGACCCA GCAGAAATTC CAGGCGCTGG GCGCCGGGGT GGTGATTGAT GCCGCTAAAG GCTACGTGGT GACCAACAAT CACGTGGTGG ATAACGCCAA TAAAATTCAG GTCCAATTGA ACGACGGCCG TAAATTTGAC GCCAAGGTGA TCGGTAAGGA TCCACGCTCC GATATCGCGC TGATCCAACT GAAAGACTTT AAAAATCTGA CGGCGATCAA AATGGCGGAC TCCGAACAAC TGCGCGTAGG TGATTACACC GTCGCGATCG GCAACCCGTA TGGGCTGGGT GAAACCGCCA CCTCTGGTAT TGTTTCTGCA CTGGGCCGCA GCGGCTTGAA TATCGAAAAC TACGAGAACT TCATCCAGAC CGATGCGGCG ATTAACCGCG GTAACTCCGG TGGTGCGTTG GTTAACCTGA ACGGTGAACT GATCGGCATT AACACCGCCA TTTTGGCACC GGACGGCGGC AACATCGGCA TCGGCTTTGC CATCCCGAGC AACATGGTGA AAAACCTGAC GGCGCAGATG GTCGAATATG GCCAGGTGAA ACGCGGCGAA TTGGGCATTA TGGGTACCGA ACTGAACTCT GAGCTGGCGA AAGCGATGAA AGTGGACGCG CAGCGCGGGG CCTTTGTCAG CCAGGTGATG CCGAAATCCT CTGCCGCCAA GGCGGGTATC AAGGCCGGTG ATGTGATTGT TACCATGAAC GGTAAAGCCA TCTCCAGCTT CGCCTCGTTC CGTGCGGAAA TCGGTACTTT GCCGGTCGGC AGCAAAATGT CGCTGGGCAT TATCCGCGAC GGCAAACCGG TGACCGTGGA CGTCACGCTG GAGCAAAGCG CCCAGACTCA GGTTGAATCC GGCAATATCT ACACCGGTAT TGAAGGCGCC GAACTGAGCA ATGGTCAGGC TGGCGCTCAG AAAGGCGTGA AGGTCGATAA CGTCAAGGCC GGCAGCGCTG CCGCACGTAT CGGTCTGAAA AAAGGCGACT TTATCCTTGG GGTTAACCAG CAGCCGATCC AGAACCTGGG CGAACTGCGT AAAATCCTCG ACAGCAAACC GTCGGTACTG GCGCTGAATA TCCTGCGCGG TGATACCACG CTGTATCTGC TGATGCAATA A
|
Protein sequence | MKKTALVLSA LAFSIGMAMG PMTASAAETA SSSTQQLPSL APMLEKVMPS VVSINVEGST TVNTAKMPPQ FQQFFGEDSP FCQDGSPFQA SPMCQGGEPG APGQGTQQKF QALGAGVVID AAKGYVVTNN HVVDNANKIQ VQLNDGRKFD AKVIGKDPRS DIALIQLKDF KNLTAIKMAD SEQLRVGDYT VAIGNPYGLG ETATSGIVSA LGRSGLNIEN YENFIQTDAA INRGNSGGAL VNLNGELIGI NTAILAPDGG NIGIGFAIPS NMVKNLTAQM VEYGQVKRGE LGIMGTELNS ELAKAMKVDA QRGAFVSQVM PKSSAAKAGI KAGDVIVTMN GKAISSFASF RAEIGTLPVG SKMSLGIIRD GKPVTVDVTL EQSAQTQVES GNIYTGIEGA ELSNGQAGAQ KGVKVDNVKA GSAAARIGLK KGDFILGVNQ QPIQNLGELR KILDSKPSVL ALNILRGDTT LYLLMQ
|
| |