Gene Sbal223_4470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_4470 
Symbol 
ID7094387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011668 
Strand
Start bp51610 
End bp54864 
Gene Length3255 bp 
Protein Length1084 aa 
Translation table11 
GC content43% 
IMG OID643467333 
Producttype I site-specific deoxyribonuclease, HsdR family 
Protein accessionYP_002364291 
Protein GI217980315 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.566129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value0.610404 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACTG AAGACCAGTT AGAACAAGAA TGTATTAGAT GGTTCACCGA CCAAGGTTAT 
CTGTATAAAA ATGGCTATGA CATCGCACCT GATGGCGATT CTCCAGAGCG TGATGATTAT
CATCAAGTTG TTTTAAAACA ACGGCTATTA AATCAATTAG CAATCATCAA TCCAGAGCTT
CCGATAGAGG CACTGAATGA TGTGGTTAAC ACGGTATCTT CCCCCGATAC GCCAATTCTT
ATTAAAAACA ACCGTGTATT CCACAAGTTT GTGATTGAAG GGGTGCCTGT CGAATACACG
GCAGTTGAAG ACGGCGAATC GAAAACTAAG CATACTCACG CTCAGTTAAT GGATTTTACG
ACGACTGATA ACAATGAATT CCTTATCGTG AATCAGTTTA CGATTACAGG TACAAAAGGG
AATCGCCGTC CCGATGTTTT GGTGTTTATT AACGGCTTGC CCATCTCTGT TATTGAGTTG
AAAAATCCAG CCGATGAGCA TGCCGACATT TGGAATGCCT TTAATCAACT TCAAACCTAC
AAAGATGAAA TTTCAGATCT GTTTATTTTC AATGAAGCGC TGGTTGTTAG CGATGGTTGG
ACTGCCCGTG TTGGCTCATT GACGGCGAAT AAAGAGCGTT TCCTACCATG GAAAACTGTA
ACTGGCGAAG ACGATAAACC CTTACTGGAG TTTCAACTAG AAACCATGGT GCGTGGTTTC
TTCAAACCAG ATTTGCTACT TGATTATATT CGTTACTTTA TTCTCTTTGA GATCGATAAT
GACAACATCA TTAAGAAGAT AGCTGGCTAT CACCAGTTTC ATGCTGTACG TGCTGCGGTA
GAAGCGACCG TTAAAGCTAA ACAAGCAGAT AGCGATTTAC CTATGGTTGC TGAGAGTATA
GGCAAATATC AGGTTCAAGC GACAAAAGGC TTAGATAAGA TTAAAGCGGG TAGCGGTAAG
GCTGGAGTGG TTTGGCATAC TCAGGGCAGT GGTAAGAGTA TCTCGATGGT GTGCTACGCC
AGTAAGCTGC TGCAACAGCC TGCAATGAAT AACCCCACTA TTGTGGTGGT GACAGACCGT
AATGATCTTG ATGGTCAGCT TTACAATACT TTTGGTATGG CACAAGAGAC ATTAAAGCAG
ATCCCACAAC AGGCAGATGG TCGAGACTTG TTGCGTCAAT TACTGTTAAA TCGTCAATCG
GGTGGGATCA TTTTTACCAC CATTCAAAAG TTTGCATTAT TAGACGATGA AATTGAGCAT
CCTATCCTGT CAACGCGTGC CAACATTGTA GTGGTGTCTG ATGAAGCGCA CCGCAGCCAA
TATGGCAATA AGTCGAAGTT AGTCGAGGTT AAAGACGCCA ATGGCAATGT GACAGGCTCT
AAGTATGTCT TTGGTTACTC AAAATACATG CGCGATGCGC TGCCAAATGC TTCATTCATT
GGCTTCACAG GCACTCCGAT TGCCATGGAT GATAAAGATA CACGTGGTGT GTTTGGCGAG
TATGTCTCTG TCTATGATAT TCAAGATGCG GTAGATGATG GTGCCACAGT ACCAATCTAC
TATGAATCTC GTTTAGCTAA GCTTGATATC AATCAAGCTG AGATTGAGGA GCTTAACGAC
CAAGTCGAAG ATGAAATCGG TGAGGAAGAA GAGACAGCTA GCCGTGAACA AATCAAATCA
CAATGGGCAA CACTAGAAAA GCTCGTTGGT GCTGAGCCTC GTATTCAGCA AGTCGCTCTC
GATTTGGTGA ACCATTTTAG CACTCGCAGC GAGACGTTTC CCGGCAAAGC CATGATTGTA
GCGATGAGCC GTGAGATCTG TGTCGATCTT TACAATGCTA TTGTCGCAAT AAAGCCTGAA
TGGCATCACC CCGATCCCGA TAAAGGCGCA ATCAAAATTG TGATGACGGG CAGCGCCTCT
GACAAAGAGA AGATGCAGCC TCATATTCAT GATAAGAAGA CCAAAAAGCT GTTTGAAAAA
CGCTATAAAG ATACCCAAGA CGAACTTCAA TTGGTGATTG TGCGTGATAT GTGGCTAACA
GGCTTCGATG CGCCATGTTG TCATACCATG TATATCGATA AGCCGATGAA GGGGCACAAT
TTGATGCAGG CTATTGCACG CGTAAATCGT GTGTTTAAAG ATAAGCCGGG CGGTTTAGTG
GTTGATTATA TCGGTATTGC CAATGAGCTA AAAAATGCCC TTAAAACCTA TACAAATAGC
CAAGGTAAAG GGCAACCAAC AGTTGATACA GCTGAAGCGT TTTCAGTATT AATGGAAAAA
ATTGATATTG TGAGAGGTAT GTTTGCCACG CCAGTTGATG GTGCCGTATT TAACTACCGA
CCTGAGTTTG AAACGAATGC ATTGCGTCTG TTGCCTGGGG CTGTAAACCA TTTATCAGGT
TTATCTCATC GCAATAGTAA AGGTGAAGAG GTACGAGATG GTAAACGGCG CTTCTTGGAT
GTAATGGCGG CAATGACTAA AGCGTTTTCT CTATGCAATA CCATGGATGA AACCCAAGGT
TATAAAAACG AAATAGCTTT CTACTCTGCG ATTAAAACCG CTTTTATCAA ACATTCAACA
GTTGATAAAA AACGCAGTGA TGAAGAACGT AACACAGCAT TAAAGCAGAT ATTAGATAAC
GCAATTATTG CTGATGGTGT TGATGACATC TTTAATATGG TGGGGTTAGA TAAACCTAAT
ATTGGTTTAC TCTCTGAGGA GTTCTTGGAA GATGTGAAGA ACATGAAAGA GAAGAACCTT
GCCGTTGAGT TACTAGAGAA GTTGCTGCGT GATGAGGTGA AAGCGCGGAT GAAGAATGAT
GTGGTGCAGG AGAAGAAGTA TTCTGAGCGC ATCATGACAA CACTGCAAAA GTATCATAAT
CGCAGTATTG AAACGGCTCA GGTTATTGAA GAACTAATCA AATGGGCGAA AGAGATGGCT
GCTGATGCTG AGATGGCGGA AGGTCTAAAT CTATCTCCTG ATGAAATTGC TTTTTATCGA
GCATTGGTCA TGAACGAAGC TTCAGTGCGA GAACTGGGTG ATGATAGTTT GCGTGGTTTA
GCTATCGAGT TAACTCAACA GCTGCGTAAG TCAGCTACGG TAGACTGGCA AAAGCGTGAT
AGTGTTCGAG CACGTATGCG TAATTTAGTT CGTCGTCTGC TGCGCCGCTG GAAGTATCCA
CCTGACGCCG CAGAAGCTGC GATTAAGTTA GTGCTTGAGC AAGCAGAAGT ACTGGCTGAT
GACTGGTATA GGTAA
 
Protein sequence
MITEDQLEQE CIRWFTDQGY LYKNGYDIAP DGDSPERDDY HQVVLKQRLL NQLAIINPEL 
PIEALNDVVN TVSSPDTPIL IKNNRVFHKF VIEGVPVEYT AVEDGESKTK HTHAQLMDFT
TTDNNEFLIV NQFTITGTKG NRRPDVLVFI NGLPISVIEL KNPADEHADI WNAFNQLQTY
KDEISDLFIF NEALVVSDGW TARVGSLTAN KERFLPWKTV TGEDDKPLLE FQLETMVRGF
FKPDLLLDYI RYFILFEIDN DNIIKKIAGY HQFHAVRAAV EATVKAKQAD SDLPMVAESI
GKYQVQATKG LDKIKAGSGK AGVVWHTQGS GKSISMVCYA SKLLQQPAMN NPTIVVVTDR
NDLDGQLYNT FGMAQETLKQ IPQQADGRDL LRQLLLNRQS GGIIFTTIQK FALLDDEIEH
PILSTRANIV VVSDEAHRSQ YGNKSKLVEV KDANGNVTGS KYVFGYSKYM RDALPNASFI
GFTGTPIAMD DKDTRGVFGE YVSVYDIQDA VDDGATVPIY YESRLAKLDI NQAEIEELND
QVEDEIGEEE ETASREQIKS QWATLEKLVG AEPRIQQVAL DLVNHFSTRS ETFPGKAMIV
AMSREICVDL YNAIVAIKPE WHHPDPDKGA IKIVMTGSAS DKEKMQPHIH DKKTKKLFEK
RYKDTQDELQ LVIVRDMWLT GFDAPCCHTM YIDKPMKGHN LMQAIARVNR VFKDKPGGLV
VDYIGIANEL KNALKTYTNS QGKGQPTVDT AEAFSVLMEK IDIVRGMFAT PVDGAVFNYR
PEFETNALRL LPGAVNHLSG LSHRNSKGEE VRDGKRRFLD VMAAMTKAFS LCNTMDETQG
YKNEIAFYSA IKTAFIKHST VDKKRSDEER NTALKQILDN AIIADGVDDI FNMVGLDKPN
IGLLSEEFLE DVKNMKEKNL AVELLEKLLR DEVKARMKND VVQEKKYSER IMTTLQKYHN
RSIETAQVIE ELIKWAKEMA ADAEMAEGLN LSPDEIAFYR ALVMNEASVR ELGDDSLRGL
AIELTQQLRK SATVDWQKRD SVRARMRNLV RRLLRRWKYP PDAAEAAIKL VLEQAEVLAD
DWYR