Gene Sputcn32_3986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSputcn32_3986 
Symbol 
ID5081469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella putrefaciens CN-32 
KingdomBacteria 
Replicon accessionNC_009438 
Strand
Start bp4634093 
End bp4637155 
Gene Length3063 bp 
Protein Length1020 aa 
Translation table11 
GC content50% 
IMG OID640501198 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_001185488 
Protein GI146295064 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAACG AACAAACCGT CACCGAAAAT GGAATTATCG ACCGCTTAAA GACTTTAGGC 
GGTGTCAAAT GGATATGCTG CCACGGCGAG AACCTGCCCA AGCAAGCGCA GGATATCTTC
GTTGACGAGT GGCTAAAAGA CGCCTTGTGT TCGTTAAACC CCGATATTGC CAAACAGCCG
GATTACGCCG ATGAGGTGAT TTATAAGCTG CGTGGTGTGG TGCTGGAAGC GCGTCACACC
GGCTTGGCAA AAGCCAATGA GAACTTCCAT GAATGGTTAA TGGCCGAGAA AACCTTGCCT
TTTGGCGAGA ATGGCGACCA TATCACCATC AACCTCATTG ATTTCGACAA CATAGCCAAC
AACCACTTTG TGGTGGCCCA GCAGGTGCAC TATATCGCGG CGACCGAAGT CTATTTTGAT
ATTGTGCTCT ATGTAAACGG GATTCCCTTG GTTGTGGGTG AAGTGAAAAC CGCTACCCGC
CCCAGTGTCA CCTGGCAAGA TGGCGCCGCC GATTTTATGG GCGGTAAAAA GCACTACTGG
AAGAACGTCG AACCCTACTT TGTGCCCAAC CTGCTCTGTT TTGCCAGTGA AGGAAAGACC
TTTGCTTATG GCGCCATTAA CGCCCGGGTA AAAGATTGGG GCCCTTGGCA CAATACCGAC
CTGCGTGATG ACATTTTGCC CGGTTTGGCA TCGGTACTGG ACAGCTGTGA GGGCTTGTTA
AACCCGCAAA CCTTATTGCA GTTACTGGAA TCTTTCGCGC TGTTTTCAAC GGTGAAAACC
GGTAAAAACA CCCCACCCAA GCGCATTAAA ATTCTGCCGC GTTACCCACA GTTTGAAGCA
GCCAAGCAAA TAGTAGGCCG TGTGCGCAAA GGCTACCCCA AAAAAGGCCT GATCTGGCAT
TTCCAGGGTT CCGGTAAATC CTTATTGATG CTCTACGCCG CTAAAATGTT GCGTGGTGAC
AATACCCTAA AGAACCCCAC CGTATTGATT GTGGTTGATC GCCGCGACCT AGACACGCAA
ATTAGCGAAA CCTTTGGTGG TGCAGACGTT AAAAACCTTA TCAAGGTACA AAGCTGCAAA
AAACTCGGCC AACATATCGA GCAAGACAGC CGTGGCATTT TAATTACCAC CATCTTTAAG
TTTAAAGATG TCGAGATCGA CGATAGTAAC CCCAATGGCC TTAACAACCG CGACAACATC
ATTGTGTTGG TGGACGAAGC CCACCGCACC CAAGAAGGCG GCTTGGGCGA AAAAATGCGC
TGGGCCCTGC CACAAGCGCA CTTCTATGGA TTAACCGGTA CACCGATTTC CGGTATCGAC
CGCAATACCT TCAAGTTATT TGGCGCCGAA GAAGACCCCG GCCGCTATAT GAACCGCTAC
AGCTACAAGC AGTCGATCCG CGATGGCGCC ACTAACCCAG TGAAGTTCGA ACCACGTTTG
GCGGAGCTAC GGGTAGACCG CGATGCCATC AACCAAGAAT TTGAGCAGCT CGCCAAAGAT
AATAACCTCG ACGACGAAGA AAAAGCGGCG CTCTCTAAAC GCGCGGGCAA ATTAGCGGTG
ATGCTTAAAG CGCCTAAACG CATGGCCGCC GTCAGCAATG ATATTGCCGA GCACTTTACC
AGCCACGTAA AGCCGAAAAA GATGAAAGGC ATGGTGGTGG TTTATGACCG TGATGCCTGC
GTTCAGATGT ATTACTTGCT GGGTGAAAAG CTAGGATTTG ATGCCGTTGA AGTGGTGATG
AACGTCGATC AGGCACCTGT AAAAGCCGAA GAAGGCGGTA AAAAAGACAA GCTCAATAAA
GACTGGCGTA AATGGAAGGA AGAGTTAGAC CTGCCGATTA AACAAGCAGA TTTTGAGCGC
TGGCAGCACA TAGATGCAGA AGACCAAACC CAGAAAGACT TAATCGAAAG CTTTAAAGAC
CCCAAACATC CGTTGCAGCT GATCATCGTC ACCGCTAAGC TACTTACCGG CTTTGACGCG
CCGATCTGCT ACTGCATGTA CTTAGATAAA CCGCTGCGCG ACCACACGCT GCTACAGGCC
ATGTGCCGCA CTAACCGTTT GTACGAAACA GACGATGTGC GCAAAGACAT GGGCCTGATC
ATCGACTACC TAGGCGTGTT TGAAAATCTG CGTACCGCGC TTGCTTATAA TCCTGATGAG
ATCGAAGGTG TTGTCGAAGG CATTGAAGCC TTTAAAGAAT TATTACCCGC CCAGTTAGAC
AAGTGTTTGT CCTTTTTCCC GGGTGTTGAC CGCACCTTAG AAGGCTTCGA AGGCATTATG
GCGGCGCAAG AGTGCTTGCC AACTAATGAA AAACGCGATG AATTTGCCGC CAGCTTTGGT
GTGTTAGCCA AGCTTTGGTC GGCCATTAAC CCAGACACCT TCTTAGGGCC GTATCGCAAA
GACTATAAAT GGTTGGCACA GATTTATGAA TCGGTGCGCC CAGTCGGCCA GACCGGTGCT
TTGGTGTGGG CGGCACTCGG TCCTGAAACC ATCAAAATGA TCCATGAACA TACCGACATC
AATCGCATTC GCGACGATAT CGACGAGCTG ATCATGGATG AGCACGCCAT TTTTACCCTG
ACTGACAAAG AACAAGAACA ACGCGCCAAA CGCCTCGAAA TTGACTTGAT GGGCCGTCTG
CGTGGCAGCA ACGACCCTAA GTTTGTCGAG TTAGGTGAAC GCTTAGAAAA ACTGCGCCAA
GACTATGAGT CAGGCGTTAT CAAAGCGATC GATTGGTTAA AAGGCCTGTT AGATGCGGCT
AAAGACACGG TGCAAGCCGA GCGCGAAACC GGCGAGCATG TGGTGACCGA AGAAGACAAC
AAGCAGGCCT TAACCAAACT CTTCTTGGAA ACCCGCCCAG AAACCACGCC GAAACTAATC
GGCGATGTGG TAGAGCAGAT CGACAAGATC GTAAAAGCCA CCCGGTTTGA AGGCTGGCAA
AACTCCAACA GCGGCCCGCG CGAAATCCAA AAGGCGCTGC TGGTGACATT GGCCCAGTTT
GGCTTGGGTA AAGATAAAGA GCTGTTTGCC AAGGCGTATG GGTATATAGA AGAGCATTAT
TGA
 
Protein sequence
MFNEQTVTEN GIIDRLKTLG GVKWICCHGE NLPKQAQDIF VDEWLKDALC SLNPDIAKQP 
DYADEVIYKL RGVVLEARHT GLAKANENFH EWLMAEKTLP FGENGDHITI NLIDFDNIAN
NHFVVAQQVH YIAATEVYFD IVLYVNGIPL VVGEVKTATR PSVTWQDGAA DFMGGKKHYW
KNVEPYFVPN LLCFASEGKT FAYGAINARV KDWGPWHNTD LRDDILPGLA SVLDSCEGLL
NPQTLLQLLE SFALFSTVKT GKNTPPKRIK ILPRYPQFEA AKQIVGRVRK GYPKKGLIWH
FQGSGKSLLM LYAAKMLRGD NTLKNPTVLI VVDRRDLDTQ ISETFGGADV KNLIKVQSCK
KLGQHIEQDS RGILITTIFK FKDVEIDDSN PNGLNNRDNI IVLVDEAHRT QEGGLGEKMR
WALPQAHFYG LTGTPISGID RNTFKLFGAE EDPGRYMNRY SYKQSIRDGA TNPVKFEPRL
AELRVDRDAI NQEFEQLAKD NNLDDEEKAA LSKRAGKLAV MLKAPKRMAA VSNDIAEHFT
SHVKPKKMKG MVVVYDRDAC VQMYYLLGEK LGFDAVEVVM NVDQAPVKAE EGGKKDKLNK
DWRKWKEELD LPIKQADFER WQHIDAEDQT QKDLIESFKD PKHPLQLIIV TAKLLTGFDA
PICYCMYLDK PLRDHTLLQA MCRTNRLYET DDVRKDMGLI IDYLGVFENL RTALAYNPDE
IEGVVEGIEA FKELLPAQLD KCLSFFPGVD RTLEGFEGIM AAQECLPTNE KRDEFAASFG
VLAKLWSAIN PDTFLGPYRK DYKWLAQIYE SVRPVGQTGA LVWAALGPET IKMIHEHTDI
NRIRDDIDEL IMDEHAIFTL TDKEQEQRAK RLEIDLMGRL RGSNDPKFVE LGERLEKLRQ
DYESGVIKAI DWLKGLLDAA KDTVQAERET GEHVVTEEDN KQALTKLFLE TRPETTPKLI
GDVVEQIDKI VKATRFEGWQ NSNSGPREIQ KALLVTLAQF GLGKDKELFA KAYGYIEEHY