Gene Shewana3_4244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_4244 
Symbol 
ID4475968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008573 
Strand
Start bp117132 
End bp120386 
Gene Length3255 bp 
Protein Length1084 aa 
Translation table11 
GC content46% 
IMG OID639720601 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_863759 
Protein GI117676183 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00104553 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCACAG AAGACCAACT AGAGCAGCTC TGCCTTGACT GGTTTACAGA CCAAGGTTAT 
CTATACAAAA ATGGCTACGA CATCGCCCCT GATGGCGATT CACCAGAGCG TGATGATTAC
CACCAAGTCG TGCTGAAGCA GCGCTTGCTT GAGCGTTTAG AGGTATTAAA CCCTGAACTA
CCACAAGAGA GCTTGGATGT GGTGGTGAAT ACGGTAAGTA CGCCAGACAC GCCAGTACTG
ATTAAAAGCA ACCGCGGTTT CCACAAATAC CTGATTGAAG GTGTACCTGT TGAGTACACC
GTGTATGAAG ACGGGGAAAC GAAAACCAAG CACACCCACG CACGCTTAAT GGATTTTACC
AATCCAGATA ACAACGAGTT TCTGGTGGTG AATCAGTTCA CCATTACAGG AAAAAAAGGC
AACCGTCGCC CTGATGTAGT GGTGTTCATT AATGGTTTGC CGATTGCGGT TATCGAGCTG
AAAAACCCAG CCGACGAACA TGCGGACATT TGGCATGCTT ACAACCAACT GCAAACCTAC
AAAGATGAAA TTCCTGATCT GTTTGTTTTC AACGAATCTT TGGTGGTGAG TGATGGTTGG
ACGGCGCGTG TTGGCTCTCT CACCGCAAAC AAAGAACGTT TTTTACCATG GAAAACTGTT
TCTGGTGAAG ACGATAAACC GCTCTGGGAT TATCAGCTTG AAACCATGGT TAAGGGTTTC
TTCAAGCCTG ATTTACTGCT TGATTACATT CGTTACTTTG TCTTGTTTGA AACTGACAAC
GACACTGTCA TCAAGAAAAT TGCTGGTTAC CACCAGTTCC ATGCCGTGCG AGCGGCAGTC
GAAGCCACAG TGCGAGCGAA ACAGGCCGAT GGCGAACTGC CTTTAGTCGC TGAAAGCATG
GGGCAATACC AAGTTCAAGC GACAAAAGGC TTAGATAAAA TCAAGCCCGG AAGTGGTAAA
GCGGGTGTGG TTTGGCATAC GCAAGGCAGT GGTAAAAGTA TCTCTATGGT TTGCTACGCA
AGCAAACTAT TGCAACAGCC TGCGATGAAC AACCCAACGA TTGTGGTGGT GACAGACCGT
AATGATCTTG ATGGTCAGCT TTTTAATACC TTTGGTATGG CGCATGAAAC GCTTAAACAA
GTGCCATCGC AAGCGGACGG GCGTGATAAG CTGCGTGAGC TGCTGCTCTC TCGTCAATCT
GGCGGCATCA TCTTTACCAC CATTCAAAAA TTTGCCCTAC TTGCTGAAGA AACAGAGCAC
CCAGTGTTAT CAACGCGTGC CAACATCGTT GTGGTATCGG ATGAAGCGCA CCGCAGTCAG
TACGGTAACA AATCAAAGTT AGTGGACGTT AAAGACGCTA ATGGCAATGT CATCTCACAA
AAGTTTGTTT ATGGTTACTC CAAATACATG CGTGATGCGC TGCCAAATGC ATCGTTCATT
GGCTTTACGG GAACACCCAT TGCCATGGAC GACAAAGACA CTCGTGGCGT ATTTGGTGAG
TACGTTTCTA TCTATGATAT TCAAGATGCG GTCGATGATG GTGCTACGGT GCCAATCTAC
TATGAGTCTC GCTTAGCCAA GCTTGATATC AACCAAGCTG AAATCGAGCA GCTTAATGAC
CAAATCGAAG ATGAGATCGG TGAGGATGAA GAGACTGCGA GCCGTGAGAA AATCAAATCT
CAATGGGCAA CGCTTGAAAA GCTCGTCGGT GCTGAGCCAC GTATCCAACA AGTAGCTCAG
GACTTAGTGA ATCACTTTAC CACGCGTACC GAAACGTTTC CCGGCAAAGC CATGATTGTG
GCGATGAGTC GTGAAATCTG TGTCGATCTC TACAATGCGA TTGTTGCTAT CAAGCCAGAG
TGGCACAACG ATGACCCTGC TAAAGGGGCT ATCAAGATAG TCATGACAGG AAGCGCATCT
GATAAACCTA AGATGCAGCC GCACATCCAT GATAAGAAAA CCAAGAAGCT CTTTGAGGCG
CGATATAAAG ATACTCAAGA TGAACTTCAA TTGGTGATTG TTCGTGATAT GTGGCTAACA
GGCTTTGATG CGCCGTGTTG CCATACCATG TATGTCGATA AGCCAATGAA AGGTCACAAC
CTGATGCAGG CTATCGCACG CGTAAACCGC GTTTTTAAAG ACAAACCGGG CGGTCTGGTG
GTTGACTATA TCGGCATTGC TAATGAGCTT AAAAACGCAC TCAAAACCTA CACCAATAGC
CAAGGTAAAG GGCAACCAAC CGTCGATACT GCTGAAGCCT TTTCGGTACT GATGGAAAAG
GTTGATATCG TTCGTGGCAT GTTTGCGACT CCAGTTGATG GCAAGGTTTT TAACTATCGT
CCTGACTTTG AAACTGATGC GCTGCGTTTA CTGCCCGGTG CGGTAAACCA TCTATCAGGT
TTGTCACACA CAGATAGCAA AGGCAAAGAA GTACGCGATG GCAAACGTCG ATTCCTCGAT
GTGATGGCTG CGTTAGGTAA AGCGTATTCA TTGTGTAATA CCATGGATGA AACGCGTGGT
TACAAGAACG AAATTGCCTT CTACTCTGCG ATTAAAGCCG CGTTTATGAA GCACTCTACC
GTTGATAAAA AGCGTACGGA TGAAGAGCGC AATACGGCTT TGAAGCAGAT CCTCGACAAT
GCTGTGATCG CTGATGGTGT TGATGATATT TTCTCAATGG TTGGTTTAGA TAAGCCTAAC
ATCGGTTTGC TTTCTGAAGA GTTCTTAGAA GACGTTAAGA ACATGAAAGA AAAGAACCTT
GCTGTTGAGC TGCTTGAGAA GCTGCTGCGT GATGAAGTCA AAGCTCGGAT GAAGAACGAT
GTGGTGCAAG AGAAAAAATA CTCAGACCGC ATTTTGACGA CATTAAACAA GTACCACAAC
CGTAGCATCG AAACAGCTCA GGTCATTGAA GAGCTAATTC AATGGGCAAA AGAGATGCAA
GAAGATGCGG AGATGGTAGA TAAGCTCAAC CTTTCTGTGG ATGAGATGGC GTTCTACCGT
GCTTTGGTTG AAAACGAGTC CTCAGTTCGG GAGCTTGGCA ACGATGATTT ACGCAACTTA
GCGATTGAAC TTACTCAGCA ACTTCGTAAA TCTGCAACGG TTGACTGGCA AAAGCGCGAA
AGCGTTCGCG CTCGTATGCG TAACCTAGTA CGCCGACTGC TGCGTCGATG GAAATATCCA
CCTGATTCCG CAGAGGAAGC CATCAAGCTT GTGTTAGAAC AAGCAGAGGT TCTAGCTGAC
GGCTGGTATA AGTAA
 
Protein sequence
MITEDQLEQL CLDWFTDQGY LYKNGYDIAP DGDSPERDDY HQVVLKQRLL ERLEVLNPEL 
PQESLDVVVN TVSTPDTPVL IKSNRGFHKY LIEGVPVEYT VYEDGETKTK HTHARLMDFT
NPDNNEFLVV NQFTITGKKG NRRPDVVVFI NGLPIAVIEL KNPADEHADI WHAYNQLQTY
KDEIPDLFVF NESLVVSDGW TARVGSLTAN KERFLPWKTV SGEDDKPLWD YQLETMVKGF
FKPDLLLDYI RYFVLFETDN DTVIKKIAGY HQFHAVRAAV EATVRAKQAD GELPLVAESM
GQYQVQATKG LDKIKPGSGK AGVVWHTQGS GKSISMVCYA SKLLQQPAMN NPTIVVVTDR
NDLDGQLFNT FGMAHETLKQ VPSQADGRDK LRELLLSRQS GGIIFTTIQK FALLAEETEH
PVLSTRANIV VVSDEAHRSQ YGNKSKLVDV KDANGNVISQ KFVYGYSKYM RDALPNASFI
GFTGTPIAMD DKDTRGVFGE YVSIYDIQDA VDDGATVPIY YESRLAKLDI NQAEIEQLND
QIEDEIGEDE ETASREKIKS QWATLEKLVG AEPRIQQVAQ DLVNHFTTRT ETFPGKAMIV
AMSREICVDL YNAIVAIKPE WHNDDPAKGA IKIVMTGSAS DKPKMQPHIH DKKTKKLFEA
RYKDTQDELQ LVIVRDMWLT GFDAPCCHTM YVDKPMKGHN LMQAIARVNR VFKDKPGGLV
VDYIGIANEL KNALKTYTNS QGKGQPTVDT AEAFSVLMEK VDIVRGMFAT PVDGKVFNYR
PDFETDALRL LPGAVNHLSG LSHTDSKGKE VRDGKRRFLD VMAALGKAYS LCNTMDETRG
YKNEIAFYSA IKAAFMKHST VDKKRTDEER NTALKQILDN AVIADGVDDI FSMVGLDKPN
IGLLSEEFLE DVKNMKEKNL AVELLEKLLR DEVKARMKND VVQEKKYSDR ILTTLNKYHN
RSIETAQVIE ELIQWAKEMQ EDAEMVDKLN LSVDEMAFYR ALVENESSVR ELGNDDLRNL
AIELTQQLRK SATVDWQKRE SVRARMRNLV RRLLRRWKYP PDSAEEAIKL VLEQAEVLAD
GWYK