Gene Spro_4084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4084 
Symbol 
ID5606981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp4539525 
End bp4542710 
Gene Length3186 bp 
Protein Length1061 aa 
Translation table11 
GC content44% 
IMG OID640939645 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_001480307 
Protein GI157372318 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCT TTAAAACAGA AGCGCAATTT GAGCAGGCCT TTATTGAGGT TCTCACCAAC 
AAAGGTTGGG AGCCGGAGAT ACTCAAAAAC AAAACCGAAG CAGATTTACT GCAAAACTGG
GCAAACATTT TGTTTGAAAA TAATCGCCAG CAGGATCGTT TAAACGATGT TCCGCTTACC
GCGACTGAAA TGCAGCAAAT TATTGAGCAA ATCAAAGAGA TTAAAACCCC GCTCAAGCTC
AACGGTTTAA TTAACGGCAA AACCGTGGCG ATTAAGCGCG ATAATCCAGC CGATACTTTG
CATGTAGGTA AAGAAATCAG CCTAAAAATT TATGATCGCC AGGAAATTGC TGCCGGCCAA
AGCCGTTACC AAATTGTACA GCAACCCAAA TTTGAACGCG GTAGCCCTTT GCGTAATGAC
AGACGTGGCG ATGTGCTATT ACTGATCAAC GGTATGCCGG TGATCCATGT AGAACTAAAG
CGCAGCGGTA TTCCGGTTAG CCAAGCAGTA AACCAAATTG AAAAGTACAG TAAAGAGGGT
GTATTTAGCG GCCTGTTTTC GCTCATCCAA GTGTTTGTAG CCATGGAGCC CAACGAGGCC
AAATACTTTG CTAACCCCGG GCTAGATGGC AAATTTAACC CTGATTATCA ATTTAACTGG
GCCGATTTTA ATAACGAACC CATGAACCAC TGGAAAGACA TTGCCTCCAC CCTGCTTTCT
ATTCCTATGG CGCACCAGTT AATTGGCTTT TATACCGTTG CTGACGATAC CGATGGCGTG
CTTAAGGTAA TGCGCAGTTA TCAGTATTAC GCCGCCAATG CGATATCCGA CAAAGTTGCT
AAAACCAATT GGCAGCAGCT GGGTAGTGCG GCCAATAATC CCGATCGCCT CGGGGGATAT
GTGTGGCATA CCACGGGTTC GGGTAAAACC ATGACCAGCT TTAAATCGGC GCAGTTGATC
GCACAATCCA AAGATGCCGA TAAAGTGATT TTTTTAATGG ATCGTATCGA GCTGGGTACC
CAATCTCTCC TCGAGTATCA AGGTTTCGCT GACGATAGAG ATTCCGTTCA AGCCACCGAA
AATACTCACG TACTCATTAC CAAATTAAAA AGCACCGCCC CCGCCGATAC CTTAATTGTT
AGCTCCATTC AAAAAATGAG TAATATTTTT GAAGCGGTAG ATGATGAAGG CCCGGCCACA
AACACCGCGG ATATTGAAAA AATTCGCGCT AAGCGTTTGG TGTTCATTGT CGATGAGGCA
CACCGCTCTA CCTTTGGCGA TATGCTGATT ATTATTAAAC GTACCTTTCC ACGTGCCTTA
TTTTTTGGCT TTACCGGTAC GCCGATTCAA GAAGAAAACG AAAAAAACGG CAATACCACC
AGTACCGTAT TTGGCAACGA GCTACACCGC TATAGCATTG CCGATGGCAT ACGCGATGGC
AACGTACTCG GTTTTGACCC TTACAAAGTG CCAACCTTTA GAGACAGCGA CCTAAGACAA
GCGGTGGCGT TAGAGCAAGC CAAGGTCGAC TCGGTGATAG AAGCCATGGA TAACCCGGCC
AAAAAGAAAA AATTTAACCA CTTTATGAAT GATGTGCCTA TGGCTGGCCA CAAAGATGCC
TCGGGGAAAT ACCACAGAGG TATTGAAGAC TATGTGCCTA AAAGTCAATA TTTAAGCGAC
ACCCACCAAG CTAAGGTAGT GGAAGATATT CTGTATAAAT GGGATGTGCT CAGCCAAGGC
ACTAAATTCC ACGCCATTTT GGCCACTAAC AGCATTGCCG AAGCCATTGA CTATTACCGC
CGTTTAAAAG CCGCCAAACC TGAACTTAAA GTATCGGCCC TATTTGACCC GAACATTGAT
AACGACGGCA GTGGTGACCG TGGTCCCACC TTTAAAGGTG ATGGCCTGGA CGAAATTATG
GCCGACTATA ACGCGCGTTA TGGCCAGGAC TTTGACTTTG CCCGCCACGC GGCTTTTAAA
AAAGATTTAG CGGCACGCCT TGCCCATAAA AAGCCCTATG AGCGCATTCA TACCGAGCCT
TCAAAGCAAT TAGATTTACT GATTGTTGTA GATCAAATGC TCACTGGCTT TGACTCTAAA
TGGCTTAATA CCTTGTATTT AGATAAGGTG ATTAAATACC AAAATATTAT TCAAGCGTTC
TCGCGCACCA ATCGCTTGTT TGGCCAGGAC AAACCCCACG GCATCATCCG TTATTACCGC
TATCCACACA CCATGGAGCA ACATATCAAT GATGCGGTAA AACTCTATTC AGGCGACAGA
CCTATCGGCT TGTTTGTTGA TAGGTTAGAA AGCAACCTTA AAGCCATGAA TGAGCTAGTG
GTGGACATTA CCGAGCTGTT CGTCAGTGCG GGTGTTGAGA ATTTTGAAAA ACTGCCAGAC
GATATAGAAA TCTGTGCCCA ATTCGCCAAA TTATTTAACT CCTTTAGCCA ACACCTAGAA
GCGGCTAAAG TACAAGGTTT GCATTGGGAA CAGTCGACCT ATTCCTTTAC TGAAAATGAT
GTAGAACATG AGGTAACGCT GTCCATAGAC GAACAAACTT ACCTGAGCCT AGTTCTGCGT
TACAAAGAGT TGGTAGCCAA AGGTGATGGC AGTGGCACAG GTGGCGGCGA TGTGCCTTTT
GATATCAGTG GCTATTTAAC TGAGATAGAT ACCGGAAAAA TCGATGCCGA CTACATGAAC
AGCCGCTTTG ATAAATATTT AAAAGAGCTG AACAAGCATC AAGACTCTGC GAACATTGAA
ATCACATTAA ATGAGCTGCA CAAGTCGTTT GCATCACTCA CCCAAAGCGA GCAAAAGTAC
GCCAAGCTCT TCTTACACGA CTTGCAGCGC GGTGATGCGC AATTAATTGA AGGCCATACT
TTTAGAGACT ATATCAATAC CTACAAAGAT AACGCTGAAA ATGCACAATT AAACGCAGTT
GTTGATGCTC TTGGTTTAGA TAAAAAACTA CTCATAGCAT TAATGGCTGA TAGTGTTAAT
AATAAGAATC TCAACGACTT TGGTCGTTTC GACGCATTAA AAGAAACGGT AGATAAAACG
AAAGCTAAGG CCTACTTTGA AAAACAAGAC GGCATAACCA TACCTCTATT TAAGCTGAAT
ATTCGCATTG ATCAGTTTTT AAAGCAGTTT ATTTTTGCAC AAATGGATGA TTTCTTAAGT
GACTAA
 
Protein sequence
MTTFKTEAQF EQAFIEVLTN KGWEPEILKN KTEADLLQNW ANILFENNRQ QDRLNDVPLT 
ATEMQQIIEQ IKEIKTPLKL NGLINGKTVA IKRDNPADTL HVGKEISLKI YDRQEIAAGQ
SRYQIVQQPK FERGSPLRND RRGDVLLLIN GMPVIHVELK RSGIPVSQAV NQIEKYSKEG
VFSGLFSLIQ VFVAMEPNEA KYFANPGLDG KFNPDYQFNW ADFNNEPMNH WKDIASTLLS
IPMAHQLIGF YTVADDTDGV LKVMRSYQYY AANAISDKVA KTNWQQLGSA ANNPDRLGGY
VWHTTGSGKT MTSFKSAQLI AQSKDADKVI FLMDRIELGT QSLLEYQGFA DDRDSVQATE
NTHVLITKLK STAPADTLIV SSIQKMSNIF EAVDDEGPAT NTADIEKIRA KRLVFIVDEA
HRSTFGDMLI IIKRTFPRAL FFGFTGTPIQ EENEKNGNTT STVFGNELHR YSIADGIRDG
NVLGFDPYKV PTFRDSDLRQ AVALEQAKVD SVIEAMDNPA KKKKFNHFMN DVPMAGHKDA
SGKYHRGIED YVPKSQYLSD THQAKVVEDI LYKWDVLSQG TKFHAILATN SIAEAIDYYR
RLKAAKPELK VSALFDPNID NDGSGDRGPT FKGDGLDEIM ADYNARYGQD FDFARHAAFK
KDLAARLAHK KPYERIHTEP SKQLDLLIVV DQMLTGFDSK WLNTLYLDKV IKYQNIIQAF
SRTNRLFGQD KPHGIIRYYR YPHTMEQHIN DAVKLYSGDR PIGLFVDRLE SNLKAMNELV
VDITELFVSA GVENFEKLPD DIEICAQFAK LFNSFSQHLE AAKVQGLHWE QSTYSFTEND
VEHEVTLSID EQTYLSLVLR YKELVAKGDG SGTGGGDVPF DISGYLTEID TGKIDADYMN
SRFDKYLKEL NKHQDSANIE ITLNELHKSF ASLTQSEQKY AKLFLHDLQR GDAQLIEGHT
FRDYINTYKD NAENAQLNAV VDALGLDKKL LIALMADSVN NKNLNDFGRF DALKETVDKT
KAKAYFEKQD GITIPLFKLN IRIDQFLKQF IFAQMDDFLS D