Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sputcn32_3986 |
Symbol | |
ID | 5081469 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella putrefaciens CN-32 |
Kingdom | Bacteria |
Replicon accession | NC_009438 |
Strand | + |
Start bp | 4634093 |
End bp | 4637155 |
Gene Length | 3063 bp |
Protein Length | 1020 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640501198 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_001185488 |
Protein GI | 146295064 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTAACG AACAAACCGT CACCGAAAAT GGAATTATCG ACCGCTTAAA GACTTTAGGC GGTGTCAAAT GGATATGCTG CCACGGCGAG AACCTGCCCA AGCAAGCGCA GGATATCTTC GTTGACGAGT GGCTAAAAGA CGCCTTGTGT TCGTTAAACC CCGATATTGC CAAACAGCCG GATTACGCCG ATGAGGTGAT TTATAAGCTG CGTGGTGTGG TGCTGGAAGC GCGTCACACC GGCTTGGCAA AAGCCAATGA GAACTTCCAT GAATGGTTAA TGGCCGAGAA AACCTTGCCT TTTGGCGAGA ATGGCGACCA TATCACCATC AACCTCATTG ATTTCGACAA CATAGCCAAC AACCACTTTG TGGTGGCCCA GCAGGTGCAC TATATCGCGG CGACCGAAGT CTATTTTGAT ATTGTGCTCT ATGTAAACGG GATTCCCTTG GTTGTGGGTG AAGTGAAAAC CGCTACCCGC CCCAGTGTCA CCTGGCAAGA TGGCGCCGCC GATTTTATGG GCGGTAAAAA GCACTACTGG AAGAACGTCG AACCCTACTT TGTGCCCAAC CTGCTCTGTT TTGCCAGTGA AGGAAAGACC TTTGCTTATG GCGCCATTAA CGCCCGGGTA AAAGATTGGG GCCCTTGGCA CAATACCGAC CTGCGTGATG ACATTTTGCC CGGTTTGGCA TCGGTACTGG ACAGCTGTGA GGGCTTGTTA AACCCGCAAA CCTTATTGCA GTTACTGGAA TCTTTCGCGC TGTTTTCAAC GGTGAAAACC GGTAAAAACA CCCCACCCAA GCGCATTAAA ATTCTGCCGC GTTACCCACA GTTTGAAGCA GCCAAGCAAA TAGTAGGCCG TGTGCGCAAA GGCTACCCCA AAAAAGGCCT GATCTGGCAT TTCCAGGGTT CCGGTAAATC CTTATTGATG CTCTACGCCG CTAAAATGTT GCGTGGTGAC AATACCCTAA AGAACCCCAC CGTATTGATT GTGGTTGATC GCCGCGACCT AGACACGCAA ATTAGCGAAA CCTTTGGTGG TGCAGACGTT AAAAACCTTA TCAAGGTACA AAGCTGCAAA AAACTCGGCC AACATATCGA GCAAGACAGC CGTGGCATTT TAATTACCAC CATCTTTAAG TTTAAAGATG TCGAGATCGA CGATAGTAAC CCCAATGGCC TTAACAACCG CGACAACATC ATTGTGTTGG TGGACGAAGC CCACCGCACC CAAGAAGGCG GCTTGGGCGA AAAAATGCGC TGGGCCCTGC CACAAGCGCA CTTCTATGGA TTAACCGGTA CACCGATTTC CGGTATCGAC CGCAATACCT TCAAGTTATT TGGCGCCGAA GAAGACCCCG GCCGCTATAT GAACCGCTAC AGCTACAAGC AGTCGATCCG CGATGGCGCC ACTAACCCAG TGAAGTTCGA ACCACGTTTG GCGGAGCTAC GGGTAGACCG CGATGCCATC AACCAAGAAT TTGAGCAGCT CGCCAAAGAT AATAACCTCG ACGACGAAGA AAAAGCGGCG CTCTCTAAAC GCGCGGGCAA ATTAGCGGTG ATGCTTAAAG CGCCTAAACG CATGGCCGCC GTCAGCAATG ATATTGCCGA GCACTTTACC AGCCACGTAA AGCCGAAAAA GATGAAAGGC ATGGTGGTGG TTTATGACCG TGATGCCTGC GTTCAGATGT ATTACTTGCT GGGTGAAAAG CTAGGATTTG ATGCCGTTGA AGTGGTGATG AACGTCGATC AGGCACCTGT AAAAGCCGAA GAAGGCGGTA AAAAAGACAA GCTCAATAAA GACTGGCGTA AATGGAAGGA AGAGTTAGAC CTGCCGATTA AACAAGCAGA TTTTGAGCGC TGGCAGCACA TAGATGCAGA AGACCAAACC CAGAAAGACT TAATCGAAAG CTTTAAAGAC CCCAAACATC CGTTGCAGCT GATCATCGTC ACCGCTAAGC TACTTACCGG CTTTGACGCG CCGATCTGCT ACTGCATGTA CTTAGATAAA CCGCTGCGCG ACCACACGCT GCTACAGGCC ATGTGCCGCA CTAACCGTTT GTACGAAACA GACGATGTGC GCAAAGACAT GGGCCTGATC ATCGACTACC TAGGCGTGTT TGAAAATCTG CGTACCGCGC TTGCTTATAA TCCTGATGAG ATCGAAGGTG TTGTCGAAGG CATTGAAGCC TTTAAAGAAT TATTACCCGC CCAGTTAGAC AAGTGTTTGT CCTTTTTCCC GGGTGTTGAC CGCACCTTAG AAGGCTTCGA AGGCATTATG GCGGCGCAAG AGTGCTTGCC AACTAATGAA AAACGCGATG AATTTGCCGC CAGCTTTGGT GTGTTAGCCA AGCTTTGGTC GGCCATTAAC CCAGACACCT TCTTAGGGCC GTATCGCAAA GACTATAAAT GGTTGGCACA GATTTATGAA TCGGTGCGCC CAGTCGGCCA GACCGGTGCT TTGGTGTGGG CGGCACTCGG TCCTGAAACC ATCAAAATGA TCCATGAACA TACCGACATC AATCGCATTC GCGACGATAT CGACGAGCTG ATCATGGATG AGCACGCCAT TTTTACCCTG ACTGACAAAG AACAAGAACA ACGCGCCAAA CGCCTCGAAA TTGACTTGAT GGGCCGTCTG CGTGGCAGCA ACGACCCTAA GTTTGTCGAG TTAGGTGAAC GCTTAGAAAA ACTGCGCCAA GACTATGAGT CAGGCGTTAT CAAAGCGATC GATTGGTTAA AAGGCCTGTT AGATGCGGCT AAAGACACGG TGCAAGCCGA GCGCGAAACC GGCGAGCATG TGGTGACCGA AGAAGACAAC AAGCAGGCCT TAACCAAACT CTTCTTGGAA ACCCGCCCAG AAACCACGCC GAAACTAATC GGCGATGTGG TAGAGCAGAT CGACAAGATC GTAAAAGCCA CCCGGTTTGA AGGCTGGCAA AACTCCAACA GCGGCCCGCG CGAAATCCAA AAGGCGCTGC TGGTGACATT GGCCCAGTTT GGCTTGGGTA AAGATAAAGA GCTGTTTGCC AAGGCGTATG GGTATATAGA AGAGCATTAT TGA
|
Protein sequence | MFNEQTVTEN GIIDRLKTLG GVKWICCHGE NLPKQAQDIF VDEWLKDALC SLNPDIAKQP DYADEVIYKL RGVVLEARHT GLAKANENFH EWLMAEKTLP FGENGDHITI NLIDFDNIAN NHFVVAQQVH YIAATEVYFD IVLYVNGIPL VVGEVKTATR PSVTWQDGAA DFMGGKKHYW KNVEPYFVPN LLCFASEGKT FAYGAINARV KDWGPWHNTD LRDDILPGLA SVLDSCEGLL NPQTLLQLLE SFALFSTVKT GKNTPPKRIK ILPRYPQFEA AKQIVGRVRK GYPKKGLIWH FQGSGKSLLM LYAAKMLRGD NTLKNPTVLI VVDRRDLDTQ ISETFGGADV KNLIKVQSCK KLGQHIEQDS RGILITTIFK FKDVEIDDSN PNGLNNRDNI IVLVDEAHRT QEGGLGEKMR WALPQAHFYG LTGTPISGID RNTFKLFGAE EDPGRYMNRY SYKQSIRDGA TNPVKFEPRL AELRVDRDAI NQEFEQLAKD NNLDDEEKAA LSKRAGKLAV MLKAPKRMAA VSNDIAEHFT SHVKPKKMKG MVVVYDRDAC VQMYYLLGEK LGFDAVEVVM NVDQAPVKAE EGGKKDKLNK DWRKWKEELD LPIKQADFER WQHIDAEDQT QKDLIESFKD PKHPLQLIIV TAKLLTGFDA PICYCMYLDK PLRDHTLLQA MCRTNRLYET DDVRKDMGLI IDYLGVFENL RTALAYNPDE IEGVVEGIEA FKELLPAQLD KCLSFFPGVD RTLEGFEGIM AAQECLPTNE KRDEFAASFG VLAKLWSAIN PDTFLGPYRK DYKWLAQIYE SVRPVGQTGA LVWAALGPET IKMIHEHTDI NRIRDDIDEL IMDEHAIFTL TDKEQEQRAK RLEIDLMGRL RGSNDPKFVE LGERLEKLRQ DYESGVIKAI DWLKGLLDAA KDTVQAERET GEHVVTEEDN KQALTKLFLE TRPETTPKLI GDVVEQIDKI VKATRFEGWQ NSNSGPREIQ KALLVTLAQF GLGKDKELFA KAYGYIEEHY
|
| |