Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_4470 |
Symbol | |
ID | 7094387 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011668 |
Strand | - |
Start bp | 51610 |
End bp | 54864 |
Gene Length | 3255 bp |
Protein Length | 1084 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643467333 |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_002364291 |
Protein GI | 217980315 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 0.566129 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 0.610404 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCACTG AAGACCAGTT AGAACAAGAA TGTATTAGAT GGTTCACCGA CCAAGGTTAT CTGTATAAAA ATGGCTATGA CATCGCACCT GATGGCGATT CTCCAGAGCG TGATGATTAT CATCAAGTTG TTTTAAAACA ACGGCTATTA AATCAATTAG CAATCATCAA TCCAGAGCTT CCGATAGAGG CACTGAATGA TGTGGTTAAC ACGGTATCTT CCCCCGATAC GCCAATTCTT ATTAAAAACA ACCGTGTATT CCACAAGTTT GTGATTGAAG GGGTGCCTGT CGAATACACG GCAGTTGAAG ACGGCGAATC GAAAACTAAG CATACTCACG CTCAGTTAAT GGATTTTACG ACGACTGATA ACAATGAATT CCTTATCGTG AATCAGTTTA CGATTACAGG TACAAAAGGG AATCGCCGTC CCGATGTTTT GGTGTTTATT AACGGCTTGC CCATCTCTGT TATTGAGTTG AAAAATCCAG CCGATGAGCA TGCCGACATT TGGAATGCCT TTAATCAACT TCAAACCTAC AAAGATGAAA TTTCAGATCT GTTTATTTTC AATGAAGCGC TGGTTGTTAG CGATGGTTGG ACTGCCCGTG TTGGCTCATT GACGGCGAAT AAAGAGCGTT TCCTACCATG GAAAACTGTA ACTGGCGAAG ACGATAAACC CTTACTGGAG TTTCAACTAG AAACCATGGT GCGTGGTTTC TTCAAACCAG ATTTGCTACT TGATTATATT CGTTACTTTA TTCTCTTTGA GATCGATAAT GACAACATCA TTAAGAAGAT AGCTGGCTAT CACCAGTTTC ATGCTGTACG TGCTGCGGTA GAAGCGACCG TTAAAGCTAA ACAAGCAGAT AGCGATTTAC CTATGGTTGC TGAGAGTATA GGCAAATATC AGGTTCAAGC GACAAAAGGC TTAGATAAGA TTAAAGCGGG TAGCGGTAAG GCTGGAGTGG TTTGGCATAC TCAGGGCAGT GGTAAGAGTA TCTCGATGGT GTGCTACGCC AGTAAGCTGC TGCAACAGCC TGCAATGAAT AACCCCACTA TTGTGGTGGT GACAGACCGT AATGATCTTG ATGGTCAGCT TTACAATACT TTTGGTATGG CACAAGAGAC ATTAAAGCAG ATCCCACAAC AGGCAGATGG TCGAGACTTG TTGCGTCAAT TACTGTTAAA TCGTCAATCG GGTGGGATCA TTTTTACCAC CATTCAAAAG TTTGCATTAT TAGACGATGA AATTGAGCAT CCTATCCTGT CAACGCGTGC CAACATTGTA GTGGTGTCTG ATGAAGCGCA CCGCAGCCAA TATGGCAATA AGTCGAAGTT AGTCGAGGTT AAAGACGCCA ATGGCAATGT GACAGGCTCT AAGTATGTCT TTGGTTACTC AAAATACATG CGCGATGCGC TGCCAAATGC TTCATTCATT GGCTTCACAG GCACTCCGAT TGCCATGGAT GATAAAGATA CACGTGGTGT GTTTGGCGAG TATGTCTCTG TCTATGATAT TCAAGATGCG GTAGATGATG GTGCCACAGT ACCAATCTAC TATGAATCTC GTTTAGCTAA GCTTGATATC AATCAAGCTG AGATTGAGGA GCTTAACGAC CAAGTCGAAG ATGAAATCGG TGAGGAAGAA GAGACAGCTA GCCGTGAACA AATCAAATCA CAATGGGCAA CACTAGAAAA GCTCGTTGGT GCTGAGCCTC GTATTCAGCA AGTCGCTCTC GATTTGGTGA ACCATTTTAG CACTCGCAGC GAGACGTTTC CCGGCAAAGC CATGATTGTA GCGATGAGCC GTGAGATCTG TGTCGATCTT TACAATGCTA TTGTCGCAAT AAAGCCTGAA TGGCATCACC CCGATCCCGA TAAAGGCGCA ATCAAAATTG TGATGACGGG CAGCGCCTCT GACAAAGAGA AGATGCAGCC TCATATTCAT GATAAGAAGA CCAAAAAGCT GTTTGAAAAA CGCTATAAAG ATACCCAAGA CGAACTTCAA TTGGTGATTG TGCGTGATAT GTGGCTAACA GGCTTCGATG CGCCATGTTG TCATACCATG TATATCGATA AGCCGATGAA GGGGCACAAT TTGATGCAGG CTATTGCACG CGTAAATCGT GTGTTTAAAG ATAAGCCGGG CGGTTTAGTG GTTGATTATA TCGGTATTGC CAATGAGCTA AAAAATGCCC TTAAAACCTA TACAAATAGC CAAGGTAAAG GGCAACCAAC AGTTGATACA GCTGAAGCGT TTTCAGTATT AATGGAAAAA ATTGATATTG TGAGAGGTAT GTTTGCCACG CCAGTTGATG GTGCCGTATT TAACTACCGA CCTGAGTTTG AAACGAATGC ATTGCGTCTG TTGCCTGGGG CTGTAAACCA TTTATCAGGT TTATCTCATC GCAATAGTAA AGGTGAAGAG GTACGAGATG GTAAACGGCG CTTCTTGGAT GTAATGGCGG CAATGACTAA AGCGTTTTCT CTATGCAATA CCATGGATGA AACCCAAGGT TATAAAAACG AAATAGCTTT CTACTCTGCG ATTAAAACCG CTTTTATCAA ACATTCAACA GTTGATAAAA AACGCAGTGA TGAAGAACGT AACACAGCAT TAAAGCAGAT ATTAGATAAC GCAATTATTG CTGATGGTGT TGATGACATC TTTAATATGG TGGGGTTAGA TAAACCTAAT ATTGGTTTAC TCTCTGAGGA GTTCTTGGAA GATGTGAAGA ACATGAAAGA GAAGAACCTT GCCGTTGAGT TACTAGAGAA GTTGCTGCGT GATGAGGTGA AAGCGCGGAT GAAGAATGAT GTGGTGCAGG AGAAGAAGTA TTCTGAGCGC ATCATGACAA CACTGCAAAA GTATCATAAT CGCAGTATTG AAACGGCTCA GGTTATTGAA GAACTAATCA AATGGGCGAA AGAGATGGCT GCTGATGCTG AGATGGCGGA AGGTCTAAAT CTATCTCCTG ATGAAATTGC TTTTTATCGA GCATTGGTCA TGAACGAAGC TTCAGTGCGA GAACTGGGTG ATGATAGTTT GCGTGGTTTA GCTATCGAGT TAACTCAACA GCTGCGTAAG TCAGCTACGG TAGACTGGCA AAAGCGTGAT AGTGTTCGAG CACGTATGCG TAATTTAGTT CGTCGTCTGC TGCGCCGCTG GAAGTATCCA CCTGACGCCG CAGAAGCTGC GATTAAGTTA GTGCTTGAGC AAGCAGAAGT ACTGGCTGAT GACTGGTATA GGTAA
|
Protein sequence | MITEDQLEQE CIRWFTDQGY LYKNGYDIAP DGDSPERDDY HQVVLKQRLL NQLAIINPEL PIEALNDVVN TVSSPDTPIL IKNNRVFHKF VIEGVPVEYT AVEDGESKTK HTHAQLMDFT TTDNNEFLIV NQFTITGTKG NRRPDVLVFI NGLPISVIEL KNPADEHADI WNAFNQLQTY KDEISDLFIF NEALVVSDGW TARVGSLTAN KERFLPWKTV TGEDDKPLLE FQLETMVRGF FKPDLLLDYI RYFILFEIDN DNIIKKIAGY HQFHAVRAAV EATVKAKQAD SDLPMVAESI GKYQVQATKG LDKIKAGSGK AGVVWHTQGS GKSISMVCYA SKLLQQPAMN NPTIVVVTDR NDLDGQLYNT FGMAQETLKQ IPQQADGRDL LRQLLLNRQS GGIIFTTIQK FALLDDEIEH PILSTRANIV VVSDEAHRSQ YGNKSKLVEV KDANGNVTGS KYVFGYSKYM RDALPNASFI GFTGTPIAMD DKDTRGVFGE YVSVYDIQDA VDDGATVPIY YESRLAKLDI NQAEIEELND QVEDEIGEEE ETASREQIKS QWATLEKLVG AEPRIQQVAL DLVNHFSTRS ETFPGKAMIV AMSREICVDL YNAIVAIKPE WHHPDPDKGA IKIVMTGSAS DKEKMQPHIH DKKTKKLFEK RYKDTQDELQ LVIVRDMWLT GFDAPCCHTM YIDKPMKGHN LMQAIARVNR VFKDKPGGLV VDYIGIANEL KNALKTYTNS QGKGQPTVDT AEAFSVLMEK IDIVRGMFAT PVDGAVFNYR PEFETNALRL LPGAVNHLSG LSHRNSKGEE VRDGKRRFLD VMAAMTKAFS LCNTMDETQG YKNEIAFYSA IKTAFIKHST VDKKRSDEER NTALKQILDN AIIADGVDDI FNMVGLDKPN IGLLSEEFLE DVKNMKEKNL AVELLEKLLR DEVKARMKND VVQEKKYSER IMTTLQKYHN RSIETAQVIE ELIKWAKEMA ADAEMAEGLN LSPDEIAFYR ALVMNEASVR ELGDDSLRGL AIELTQQLRK SATVDWQKRD SVRARMRNLV RRLLRRWKYP PDAAEAAIKL VLEQAEVLAD DWYR
|
| |