Gene Spro_3663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_3663 
Symbol 
ID5606382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp4056597 
End bp4058135 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content47% 
IMG OID640939214 
Productsulfatase 
Protein accessionYP_001479887 
Protein GI157371898 
COG category[R] General function prediction only 
COG ID[COG2194] Predicted membrane-associated, metal-dependent hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAC ATACAGCAAT TTTAAATAAA ACCCACATAC TCGCAACATC ACCCGTGGCC 
CGTCATTTTT TGCTTTTCGT TGTCGTTTCG TTGATATTAA TAAAAGCGAT GGGATATAGC
GGCGCCAAAG GGATTGATAT CCTGTTTGTT TCACTTAGCC TGTTATTATT ATCTGCAATC
CCCCTCACTC GATATTTTCT GGTTATACCC TATATTATAT TCTGTGCCAT TTATGCCCCC
GTCGGCGTGA TCTACGGCCC ACCTTCCGTT CCCGTGGTGT CGGCCCTGTT TCAAACCAAC
CGTGCGGAAG CCATTGAGTT TCTGCACGCT ATACCTGCCG GCTGCTACCT GTTACCCAGC
GCCACCCTGC TCACGTTGTT TATTCTTGCT CGCTATAGCT GGCAGCGGCC GCTCCCGTTA
AAAAAAATAT TGCCATTTTT GGTGATTTTT ATCGTTTTTC TGTTCGCCAG AATACTCAGT
GGGGGAGTGG AAAATCTTAA ATTAGTGAAT TTCTTTTCCT CGCTAATTTC TTCGTACCAG
AGCTACAACC AACAAATAGC AGAAATTGAA GCCGGTACGC ATTCCCAACC CAGCTGGAAC
GTCGATAGCG CAAAGAGTAA CGAGGCTAAC TATGTCATTG TGGTTGGCGA AAGCATGCGC
AGGGACTATA TGTCATTATT TGGTTATCCG ACGCCAACGA CCCCTTTTCT GGATCACGTG
AACGGGACAT TTTACAGTAA TTATATTTCC ACCGCGCCGA ACACCTTCGA ATCCTTGCCA
AGAACCCTGG CATTGAGCGA CGGAAAAACA CATCATATTG CGGATAATAT TATCACCCTG
GCAAAAGTCG CAGGACTGCA TACCTATTGG TTTTCCAACC AAGGGTTGAT TGGCCAATTT
GATACGCCGA TCTCCAAAAT AGCCATGTTT AGTGATGAGC ATCAGTTCCT GAAAAAAGGA
GATTACCAAT CACGCAATAC CGATGACGAT GAACTGCTGC CGCTGTTGCA AACGGCACTG
GCCCATAATG GTGTGGGGAA CCTGTATGTT CTGCATATTA TGGGATCGCA TGCCGATTTC
TGCGAACGAC TGGGCGGTGA GCCACCGGCT TTCACCAGCG ATAATTCCGA GCTAGGTTGT
TACCTGTCGA CCTACCGCAA GACAGATCGA TTTATCGAGC ATGCTTATCA AATGCTGCAA
GAGACTCACT CTCCCTTTAA ATTATTCTAT TTCTCTGACC ACGGTCTTTC TCACAGAGAT
ATTGACGGCA AGCTGTATCT ACGCCATGGC GGCAATAATC GGCAGAATTA TGAGGTGCCA
CTGCTGGTGC TTTCTGATTC CGACCGACAG CGAACCCTTA TTGAGGATCC GCACAGTGCC
TTTGACTTTC TCAGTCTGTT TGCCCAACAG GCTGGGATTA CCCTCACCCA ACCACAATTA
CACGCAGCGG TGACTGCGCA AGGCACACGG CATGTCTTCA ATGGTCAGGA AATGGTTGAT
TTTGATCAGT TGGCCAACGA CCCGCCAGAG TTACTATGA
 
Protein sequence
MKTHTAILNK THILATSPVA RHFLLFVVVS LILIKAMGYS GAKGIDILFV SLSLLLLSAI 
PLTRYFLVIP YIIFCAIYAP VGVIYGPPSV PVVSALFQTN RAEAIEFLHA IPAGCYLLPS
ATLLTLFILA RYSWQRPLPL KKILPFLVIF IVFLFARILS GGVENLKLVN FFSSLISSYQ
SYNQQIAEIE AGTHSQPSWN VDSAKSNEAN YVIVVGESMR RDYMSLFGYP TPTTPFLDHV
NGTFYSNYIS TAPNTFESLP RTLALSDGKT HHIADNIITL AKVAGLHTYW FSNQGLIGQF
DTPISKIAMF SDEHQFLKKG DYQSRNTDDD ELLPLLQTAL AHNGVGNLYV LHIMGSHADF
CERLGGEPPA FTSDNSELGC YLSTYRKTDR FIEHAYQMLQ ETHSPFKLFY FSDHGLSHRD
IDGKLYLRHG GNNRQNYEVP LLVLSDSDRQ RTLIEDPHSA FDFLSLFAQQ AGITLTQPQL
HAAVTAQGTR HVFNGQEMVD FDQLANDPPE LL