Gene Sbal223_3473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_3473 
Symbol 
ID7086087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp4131098 
End bp4134211 
Gene Length3114 bp 
Protein Length1037 aa 
Translation table11 
GC content44% 
IMG OID643462358 
Producttype I site-specific deoxyribonuclease, HsdR family 
Protein accessionYP_002359379 
Protein GI217974628 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACTT ATAAAGCGGT GGCAGAATCC AATAACTTTA TTGTGTTAGA TAAATACACT 
AAAGAGTGGC AGGTTAACGA AAGCTTTCAA AGCGAAGGTG ATTTAGAACG TGAGCTGATT
CAGGACCTAA GCAACCAGGG CTATGAGTAT TTATCGGCAC TAACCACGCC CGATGCCATG
CTGGCGAATG TGCGCAAGCA GCTACAAGCC CTTAATAAGG TCGAATTTAC CGATGCCGAG
TGGGCTCGCT TTTGCGAGCA ATACCTGAAC ACACCAAGCG ACAATATTCT TGATAAAACT
CGCAAAGTGC ACAGTGACTA TATTTTCGAT TTTGTCTTTG ATGATGGCCA TATAGAAAAC
ATCTACCTGT TTGATAAGAA GACCATTGCT CGTAACAAAG TGCAGGTCAT TAAGCAGTTT
GAGCAAACGG GAAGCCATGC CAACCGTTAC GATGTGACGA TTTTGATCAA TGGTTTGCCT
TTGGTGCAAA TCGAACTGAA AAAGCGTGGT GTGGCGATCC GTGAGGCGTT TAATCAAATA
CATCGCTACA GCAAAGAAAG CTTTAATAGC GATCACTCCT TGTACAAGTT CTTGCAGCTT
TTTGTGATCT CTAATGGCAC CGACACGCGC TATTTTGCCA ATACCACCAA GCGCGACAAA
AACAGCTTCG ATTTTACGAT GAACTGGGCG AAGTCAGACA ATACGCTGAT CAAGGATTTA
AAAGACTTTA CCGCCACTTT TTTCCAGAAA GAGACCTTGC TTGAAGTGAT CTTGCGTTAC
TCGGTGTTTG ACGTCAGCAA TACCTTACTT GTAATGCGCC CTTACCAAAT TGCGGCTACT
GAGCGCATTT TATGGAAGAT AAAGAGCGCG TTTAACGCTA AGAACTGGAG CAGCACCGAG
AGCGGTGGGT ACATTTGGCA CACTACGGGT TCAGGCAAAA CACTCACCAG CTTTAAAGCG
GCACGTTTGG CCACCGATTT AGACTGTATC GATAAAGTCT TCTTTGTGGT GGACCGAAAA
GACCTCGACT ATCAGACGAT GAAAGAGTAT CAGCGGTTCT CGCCTGATAG TGTGAACGGT
TCAGACAGTA CCGCAGGGCT TAAGAGTAAC CTGGATAAAG ACGATAACAA GATTGTCGTC
ACCACAATCC AAAAGCTCAA CAACTTAATC AAAAGCGAAA GCGATTTAGC TATATACAAC
AAGCAAGTCG TTTTTATTTT TGATGAGTGT CATCGAAGCC AGTTCGGTGA GGCGCAGAAA
AACCTGAAGA AGAAATTCAA AAAGTACTAC CAGTTTGGCT TCACAGGTAC GCCCATTTTC
CCAGAAAATG CACTGGGCGC AGAAACCACG GGTAGCGTGT TTGGCCAGCA GTTGCACACG
TATGTGATCA CCGATGCCAT TCGTGATGAA AAAGTACTCA AGTTCAAAGT GGACTATAAC
GATGTTCGTC CGCAATTTAA GAGCATTGAG ACTGAGCAGG ATGATAAGAA GCTTAGTGCG
GCGGAGAATA AGAAAGCCTT CTTACATCCA ATGCGCATTG GTGAAATCTC GCAGTATATT
CTGAGCCATT TTAACCAGAA AACACATCGA GCCTATTCTG GCGCCAAAGG GTTTAACGCC
ATGTTTGCGG TCAGTAGTGT TGATGCGGCG AAAGCCTACT ATGAAACCTT CAAAACATTG
CAGGCCGAGG CTGAGAATGG ACCTACAAGT AAACCTCTGC GCATTGCAAC TATCTTCTCA
TTTGCTGCCA ATGAAGAGCA GGACGCCATT GGTGATATTT TAGATGAGAG CTTTGAAGTA
AGTAGCATGA ACAGCAGCGC AAAAGAGTTT TTGAGCGCCG CGATTAAAGA TTACAACGCC
ATGTTTAAAT CTAACTATGG CGTGGACAGT AATGGCTTTC AGAACTACTA CCGTGACCTT
GCTCAACGGG TGAAGAATCA AGAAATAGAT CTGCTCATAG TGGTGGGGAT GTTTTTAACG
GGCTTTGATG CGCCTACGCT TAACACCTTG TTTGTTGATA AGAATTTGCG CTATCACGGC
TTGATGCAGG CATTTTCTCG CACTAACCGC ATTTATGATG CGACGAAGAC CTTTGGCAAT
ATTGTTACCT TCCGTGATTT GGAACAGGCA ACCATTGATG CAATCACCTT GTTTGGTGAC
AAAAACACCA AAAACGTGGT GTTGGAGAAG AGCTACAAAG AATACATGGA AGGCTTTAAC
GATATTGCTA CTGGCCAAGC GCGTCGAGGT TTTGTTGATA TCGTCAAAGA GCTGCAAGAA
CGCTTTCCAA ACCCAGAAAA TATTGAAAAA GAACAAGATA AAAAAGAGTT TGCCAAGCTG
TTTGGTGAAT ACCTAAAAGC AGAAAATATT CTACAAAACT ACGATGAGTT TGCTGGATTA
AAAGCACTGC AAAGCCTAGA TACCAGTGAT GAACAAGCGG TTGAAGAGTT TAAGTCAAAG
TACTATCTGG ACGATGCTGA CATCGAGGCA ATGCAAACCA TCGATATGCC GAGCGAACGT
GCTATCCAAG ATTATCGCTC TACCTACAAC GATACCCGCG ACTGGTTGCG ACGCGATAAA
GAAGGTCAGG ATAAAGATAA GTCAACGGTA GACTGGGACG ATGTGGTCTT TGAAATTGAT
TTGCTTAAAT CGCAAGAAAT CAACCTCGAT TACATTCTTG AACTGATTTT TGATCACAAT
AAGAAGAACA AGAGCAAAGA AGGCTTGATT GAAGAAGTTC GTCGTATGAT CCGCGGTAGC
CTTGGCAATA GAGCGAAAGA GAGCTTGGTG GTTGATTTTA TTAACCAGAC AAACCTAGAT
GAGATCAATG ACAAAGCCAG CATTATCGAC GCTTTCTTCA AATTCGCCCA GGCGGAGCAG
AAGCGCGAAG CCGATGCCAT TATTGCCTCT GAAAAGCTCA ATATCGATGC TGCCAAACGT
TATATTGCAT CATCACTCAA ACGTGAGTAC GCCTCTGAAA ATGGCACTGC TCTTAATGAG
GCGCTGCCCA AACTTAGCCC ATTGAACCCA CAGTACCGAA CCCAGAAGCA AACGGTTTTT
CAGAAAATTG CCGCTTTTGT TGAAAAATTC AAAGGTGTGG GTGGGCAGTT GTAG
 
Protein sequence
MTTYKAVAES NNFIVLDKYT KEWQVNESFQ SEGDLERELI QDLSNQGYEY LSALTTPDAM 
LANVRKQLQA LNKVEFTDAE WARFCEQYLN TPSDNILDKT RKVHSDYIFD FVFDDGHIEN
IYLFDKKTIA RNKVQVIKQF EQTGSHANRY DVTILINGLP LVQIELKKRG VAIREAFNQI
HRYSKESFNS DHSLYKFLQL FVISNGTDTR YFANTTKRDK NSFDFTMNWA KSDNTLIKDL
KDFTATFFQK ETLLEVILRY SVFDVSNTLL VMRPYQIAAT ERILWKIKSA FNAKNWSSTE
SGGYIWHTTG SGKTLTSFKA ARLATDLDCI DKVFFVVDRK DLDYQTMKEY QRFSPDSVNG
SDSTAGLKSN LDKDDNKIVV TTIQKLNNLI KSESDLAIYN KQVVFIFDEC HRSQFGEAQK
NLKKKFKKYY QFGFTGTPIF PENALGAETT GSVFGQQLHT YVITDAIRDE KVLKFKVDYN
DVRPQFKSIE TEQDDKKLSA AENKKAFLHP MRIGEISQYI LSHFNQKTHR AYSGAKGFNA
MFAVSSVDAA KAYYETFKTL QAEAENGPTS KPLRIATIFS FAANEEQDAI GDILDESFEV
SSMNSSAKEF LSAAIKDYNA MFKSNYGVDS NGFQNYYRDL AQRVKNQEID LLIVVGMFLT
GFDAPTLNTL FVDKNLRYHG LMQAFSRTNR IYDATKTFGN IVTFRDLEQA TIDAITLFGD
KNTKNVVLEK SYKEYMEGFN DIATGQARRG FVDIVKELQE RFPNPENIEK EQDKKEFAKL
FGEYLKAENI LQNYDEFAGL KALQSLDTSD EQAVEEFKSK YYLDDADIEA MQTIDMPSER
AIQDYRSTYN DTRDWLRRDK EGQDKDKSTV DWDDVVFEID LLKSQEINLD YILELIFDHN
KKNKSKEGLI EEVRRMIRGS LGNRAKESLV VDFINQTNLD EINDKASIID AFFKFAQAEQ
KREADAIIAS EKLNIDAAKR YIASSLKREY ASENGTALNE ALPKLSPLNP QYRTQKQTVF
QKIAAFVEKF KGVGGQL