Gene Spro_3601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_3601 
Symbol 
ID5605848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp3980822 
End bp3981847 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content58% 
IMG OID640939152 
Productpeptidase M4 thermolysin 
Protein accessionYP_001479825 
Protein GI157371836 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.238334 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.290182 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACCC TGACAGCGCG TTCGGTCATT CCCCCTTATA TGCTGCGTCG GATCATTGAG 
CACGGCAGCC TGCTGCAGCG CGACTGCGCA TTACACACCC TTAACCACGT TCAAAGCCTG
CTCGGCAACA AGCCGTTACG CGCCCCCGGG GCGAAAACCT CGACCGGTGG CGAAGTCATC
CGCGATATTT TTGATGCCGA AAACGGCACC CAACTGCCGG GTAAACAGGT GCGTAATGAG
GGCCAGGCCA GTAATCATGA CGTGGCGGTG GATGAAGCCT ATGACTACCT CGGCGTCACC
TACGATTTCT TCTGGCAGGC ATTCAAACGC AACTCGCTGG ACAATCAAGG CCTGCCGCTG
ACCGGCAGCG TGCATTACGG CAAGGAATAC CAGAACGCCT TTTGGAACGG CCAGCAAATG
GTCTTCGGCG ATGGTGACGG CGAAATCTTT AACCGTTTTA CCATCGCCAT CGACGTGGTT
GGCCACGAAC TGGCACACGG CGTCACCGAA AGCGAGGCCG GACTAATTTA CTTCCAACAG
GCCGGTGCGC TGAATGAGTC GCTGTCTGAC GTGTTCGGTT CTCTGGTCAA ACAGTTCCAC
CTCAAGCAAA CCGCGGATAA GGCCGACTGG CTGATTGGCG AAGGCCTGCT GGCGAAAGGC
ATCAACGGCA AGGGCCTGCG TTCGATGTCG GCACCCGGTA CCGCCTACAA CGATCCGCTG
CTGGGGAAAG ATCCGCAGCC GGCCGACATG AAAGACTACA TTCAGACCAA AGAGGATAAC
GGCGGCGTCC ACCTCAACTC CGGCATTCCC AACCGCGCCT TCTATCTGGC GGCCACGGCT
CTGGGCGGCT TTGCCTGGGA GAAAGCCGGT TACATCTGGT ACGACACGCT TTGCGACAAG
ACACTGCCGC AGGACGCTGA CTTCGCCACC TTTGCCCGTA CCACGGTGAA ACATGCCAAA
CAGCGCTTCG ACAGTAAAGT GGCGGATAAG GTACAGCAGG CCTGGCATCA GGTAGGGGTG
GCGTAA
 
Protein sequence
MPTLTARSVI PPYMLRRIIE HGSLLQRDCA LHTLNHVQSL LGNKPLRAPG AKTSTGGEVI 
RDIFDAENGT QLPGKQVRNE GQASNHDVAV DEAYDYLGVT YDFFWQAFKR NSLDNQGLPL
TGSVHYGKEY QNAFWNGQQM VFGDGDGEIF NRFTIAIDVV GHELAHGVTE SEAGLIYFQQ
AGALNESLSD VFGSLVKQFH LKQTADKADW LIGEGLLAKG INGKGLRSMS APGTAYNDPL
LGKDPQPADM KDYIQTKEDN GGVHLNSGIP NRAFYLAATA LGGFAWEKAG YIWYDTLCDK
TLPQDADFAT FARTTVKHAK QRFDSKVADK VQQAWHQVGV A