Gene Spro_0789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_0789 
Symbol 
ID5604065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp878203 
End bp879633 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content56% 
IMG OID640936300 
Productserine endoprotease 
Protein accessionYP_001477023 
Protein GI157369034 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.290182 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA CCGCATTAGT TCTGAGCGCA TTGGCATTCA GTATTGGTAT GGCGATGGGT 
CCGATGACGG CCAGCGCCGC AGAAACCGCT TCTTCCAGCA CTCAACAATT GCCAAGCCTG
GCACCGATGC TGGAAAAAGT GATGCCTTCC GTGGTGAGCA TTAATGTTGA AGGCAGCACC
ACGGTCAATA CGGCGAAGAT GCCGCCGCAG TTCCAACAGT TCTTCGGTGA AGATTCACCG
TTCTGCCAGG ACGGTTCGCC GTTCCAGGCC TCGCCGATGT GTCAGGGTGG CGAGCCGGGT
GCACCGGGTC AGGGGACCCA GCAGAAATTC CAGGCGCTGG GCGCCGGGGT GGTGATTGAT
GCCGCTAAAG GCTACGTGGT GACCAACAAT CACGTGGTGG ATAACGCCAA TAAAATTCAG
GTCCAATTGA ACGACGGCCG TAAATTTGAC GCCAAGGTGA TCGGTAAGGA TCCACGCTCC
GATATCGCGC TGATCCAACT GAAAGACTTT AAAAATCTGA CGGCGATCAA AATGGCGGAC
TCCGAACAAC TGCGCGTAGG TGATTACACC GTCGCGATCG GCAACCCGTA TGGGCTGGGT
GAAACCGCCA CCTCTGGTAT TGTTTCTGCA CTGGGCCGCA GCGGCTTGAA TATCGAAAAC
TACGAGAACT TCATCCAGAC CGATGCGGCG ATTAACCGCG GTAACTCCGG TGGTGCGTTG
GTTAACCTGA ACGGTGAACT GATCGGCATT AACACCGCCA TTTTGGCACC GGACGGCGGC
AACATCGGCA TCGGCTTTGC CATCCCGAGC AACATGGTGA AAAACCTGAC GGCGCAGATG
GTCGAATATG GCCAGGTGAA ACGCGGCGAA TTGGGCATTA TGGGTACCGA ACTGAACTCT
GAGCTGGCGA AAGCGATGAA AGTGGACGCG CAGCGCGGGG CCTTTGTCAG CCAGGTGATG
CCGAAATCCT CTGCCGCCAA GGCGGGTATC AAGGCCGGTG ATGTGATTGT TACCATGAAC
GGTAAAGCCA TCTCCAGCTT CGCCTCGTTC CGTGCGGAAA TCGGTACTTT GCCGGTCGGC
AGCAAAATGT CGCTGGGCAT TATCCGCGAC GGCAAACCGG TGACCGTGGA CGTCACGCTG
GAGCAAAGCG CCCAGACTCA GGTTGAATCC GGCAATATCT ACACCGGTAT TGAAGGCGCC
GAACTGAGCA ATGGTCAGGC TGGCGCTCAG AAAGGCGTGA AGGTCGATAA CGTCAAGGCC
GGCAGCGCTG CCGCACGTAT CGGTCTGAAA AAAGGCGACT TTATCCTTGG GGTTAACCAG
CAGCCGATCC AGAACCTGGG CGAACTGCGT AAAATCCTCG ACAGCAAACC GTCGGTACTG
GCGCTGAATA TCCTGCGCGG TGATACCACG CTGTATCTGC TGATGCAATA A
 
Protein sequence
MKKTALVLSA LAFSIGMAMG PMTASAAETA SSSTQQLPSL APMLEKVMPS VVSINVEGST 
TVNTAKMPPQ FQQFFGEDSP FCQDGSPFQA SPMCQGGEPG APGQGTQQKF QALGAGVVID
AAKGYVVTNN HVVDNANKIQ VQLNDGRKFD AKVIGKDPRS DIALIQLKDF KNLTAIKMAD
SEQLRVGDYT VAIGNPYGLG ETATSGIVSA LGRSGLNIEN YENFIQTDAA INRGNSGGAL
VNLNGELIGI NTAILAPDGG NIGIGFAIPS NMVKNLTAQM VEYGQVKRGE LGIMGTELNS
ELAKAMKVDA QRGAFVSQVM PKSSAAKAGI KAGDVIVTMN GKAISSFASF RAEIGTLPVG
SKMSLGIIRD GKPVTVDVTL EQSAQTQVES GNIYTGIEGA ELSNGQAGAQ KGVKVDNVKA
GSAAARIGLK KGDFILGVNQ QPIQNLGELR KILDSKPSVL ALNILRGDTT LYLLMQ