Gene Shewana3_3790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_3790 
Symbol 
ID4480000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp4553289 
End bp4556450 
Gene Length3162 bp 
Protein Length1053 aa 
Translation table11 
GC content49% 
IMG OID639728403 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_871414 
Protein GI117922222 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAATA ACGAATTCGA CAAAGTAGAG CAACCCGCCA TCGCTCAGCT GCACCAGCTA 
GGTTGGCATT ATATTCCCGG TAAAGCGCTG GCGCCGGAGC ACAGTAATCA AGAACGTGAT
TACTTGCGTG ATGTGGTATT GGTGAAGCGC TTAACGCAGG CGATTGAGCG GATTAATCCA
TGGATTAGTG AGGAGAACCT ACGCAAAGTG GTGCGCATGG TGACGCACCC GCAATGCGCT
GGTTTGATGG AGTATAACCA CGGTTTTTAT CAAACCATGG TGGGCTATCT CTCGGTCGAA
CAAGACCTTG GCAAGGGCAA GAAAGGCCAA ACGGTTAAGC TGATTGATTT TGATGATGTT
AGCAATAACG AATTTATCTG CACCAATCAG TTTAAGGTGG AAGGCACTAA GCAAACCATC
ATTCCTGACA TTGTCTGTTT TGTGAATGGC CTGCCGCTGG CGGTGATTGA GTGTAAATCG
CCGTTTATCT CTACGCCGGT TGTGTGTGGC ATTGACCAAC TGCGGCGCTA CGCTAATCAG
CGCCATCCTG AAGATTTTGA AGGCGCAGAA AAGCTGTTTT GGTATAACCA ACTGATGGTA
TCTACTAGCC GCGATATCGC CCGCGTGGGC ACCATTAGCT CGCCTGCACA GTATTACGCC
GATTGGAAAG ATGCTTATCC GCTGACTGAT GCGCAGATTG CCCAAGTCGA TAGCGATAAC
GTGGTGCCAT TAGCTAAAAA GTTAACGGTA GAGCAGCCTG TCGCCAGCGT TGACGCTGCG
GGTAGTGAGC TGCCACAAGT GGCTGAGCCA CCAGCAGATT ACCAAGTAGC AAAGGTAACC
GCGCAGCAAC GGCTACTGGC GGGGATGTTC AGCCGTGAGA ACTTTCTAGA TATCGTGCGT
AACTTCATTC TGTTTGAACC TGATGATGGC CGCTTAATTA AGAAAGTGGC GCGTTATCAG
CAGTTCCGCG CGGTGAATAA AGTGATTCAT CAATTAAAGA CCGGCAAAAC CCGCAAGGAT
AAGTCTGGCG TAGTATGGCA TACCCAAGGC AGTGGTAAGT CGCTGACGAT GGTGATGCTG
GCGGTAAAAA TGCGCCATGA CCTCGAGCTT AAGCAATACA AGCTGGTCTT TGTGACTGAC
AGAACTCAGT TAGATGAGCA GTTGTCTGCC ACCTTTCAAG GGGCCCAAGG CGAAACCGTG
TATAACGCGG ATTCGGTTGC CGCGCTGCGT GACCTGCTGA AAAAAGATAG CTCAGACTTG
GTGACTGCCA TGGTGCAGAA GTTCCAAGAC GCAGAGAAAG AGGCGCAGCA ACAGGGCGAC
AAGAGCTTTG TTAACATCAA TACCTCCGAC AAGATTATTG TGCTGGCGGA CGAGGCGCAC
CGTACTCAGT TTGGTGGCTT AGCGACGACC ATTAACGCCG CTTTGCCGAA CGCGCCCAAA
ATCGGTTTTA CCGGCACACC ATTGTTGAAA ACCCAAAAGA TGGATACCGC CTTTGGTGGC
TACATTGATA AATATAAAAT CAATGAGGCC GTTGAGGACG GCGCAACGGT AAGGCTGTTG
TATGAAGGTC GTGAGGTAAA AACCGCTGTG GCGGGCGAAT CGCTTGATGC CTTGTTTGAA
GAGTATTTTG GCGATTACAC CAAAGAAGAA CAGCAAGAGA TTAAACGTAA ATACGGCGTT
GAAAAGGCGG TACGAGAAGC GCCAGCGCGC ATTCGTTGGG TCTGTATCGA CCTGTTAAAA
CATTACCGTG AGCGCATTCG GCCTGATGGA TTTAAGGCGA TGATCGTCGT TGGCAGTCGC
CACGCGGCAA CCCTCTTCAA AAAGACCTTA GACGAGTTAA AGGCGCCGCC ATCAGAGGTG
ATTATCTCCG GTGATCATAA TGACCCGCAG TACATTGCCC AATACACCGA TAAGGTTCAC
CAGAAGAAGG TTATCGCTAA CTTTAAGAAG CCGTTTGGTA TAGATTCACC GGGCACAGAT
GAGAAACAGC GCAAGTTTGA TAATACCGCC TTTTTGATCG TGAAAGACAT GCTGCTCACC
GGCTTTGATG CGCCGATTGC GCAGGTGATG TATATCGACC GTAAGCTACA AGACCATACT
TTGATGCAGG CCATTGCGCG GGTAAACCGA AACTACAAAG GCAAAGAGTG CGGTTATGTG
GTGGACTACT ACGGTTTAAC CAGTTATCTC ACCGATGCAT TAGAGCTGTT TAGTAGCGAT
GATGTCGAAG GCACCTACCA AAGCCTTAAA GATGAGTTGC CTAAACTGCA AGCTGCGCAT
ACGCGGCTTA AGGCGTTTTT CAAAGGCGTG ACCAGTGATG ACGTTGATGA TTATGTTGTC
GTCCTGAAAG ACGATGCTTT CCGGGCGCAG TTTGAAGTGG CTTACAAGCG TTTTGCTAAG
CAATTGAACG TGATTCTGCC TGATGCTTCA GCTAAGCCTT ATATCCCGGA TATGACCTTC
TTTGGCAAGG TGCATCATGC GGCGCGCAAT CGTTATCGTG ACGATGGTAT GGATTTGCTG
GATGTGGGCG AGAAGGTGCG CCAGCTGGTG GACGAGCACA TTTTGAGTAC CGGCGTAGAC
CCGAAAATTC CACCGGTAGA TTTACTGGCC GCTAACTTTA AAGAGCATGT GCAAAAGATT
AAATCGCCTG AGTCAAAGGC CTCAGAAATT GAGAGTGCGC TTAAGCATCA CATCAATATC
AATTTAGAGG ATGACCCCGA GTATTACCGT TCGTTGAGCT TGCGCTTACG CGATATTATT
GAGAAGACGG CGGGTAACTG GGAGCGCCAG CTAGAGCTGT TGCTGGAAAT GACCAATGAC
TTGAGCACTG CACATAAGCA AGCGGCGCAA AATGTTGGCC TGAGTGAAAC CGAGTTCGCC
TTTTACAATA TTCTGTTGGC TGAAGTCACT CGTATCCATG ACGGCGATAT GATTGATGAA
GCATTGCATG AGCAAATTAA GTCAACGACA CAGGCGTTAG TACAAGTATT GGATGAAGCG
ACGCGGATCG TCGATTTCTT CAATAAAGAA GACGAAATCA AACGGATGAA GAAAAAGATT
AAGCGAGTGA TATTAGATCA GCCTTTTGGA GATAAAGCGT TGGTAGCAAC TCTGCAAGAC
CGCTTTGTTG AACTGGCTAA GCACAAGTTT GGCAACAAGT GA
 
Protein sequence
MSNNEFDKVE QPAIAQLHQL GWHYIPGKAL APEHSNQERD YLRDVVLVKR LTQAIERINP 
WISEENLRKV VRMVTHPQCA GLMEYNHGFY QTMVGYLSVE QDLGKGKKGQ TVKLIDFDDV
SNNEFICTNQ FKVEGTKQTI IPDIVCFVNG LPLAVIECKS PFISTPVVCG IDQLRRYANQ
RHPEDFEGAE KLFWYNQLMV STSRDIARVG TISSPAQYYA DWKDAYPLTD AQIAQVDSDN
VVPLAKKLTV EQPVASVDAA GSELPQVAEP PADYQVAKVT AQQRLLAGMF SRENFLDIVR
NFILFEPDDG RLIKKVARYQ QFRAVNKVIH QLKTGKTRKD KSGVVWHTQG SGKSLTMVML
AVKMRHDLEL KQYKLVFVTD RTQLDEQLSA TFQGAQGETV YNADSVAALR DLLKKDSSDL
VTAMVQKFQD AEKEAQQQGD KSFVNINTSD KIIVLADEAH RTQFGGLATT INAALPNAPK
IGFTGTPLLK TQKMDTAFGG YIDKYKINEA VEDGATVRLL YEGREVKTAV AGESLDALFE
EYFGDYTKEE QQEIKRKYGV EKAVREAPAR IRWVCIDLLK HYRERIRPDG FKAMIVVGSR
HAATLFKKTL DELKAPPSEV IISGDHNDPQ YIAQYTDKVH QKKVIANFKK PFGIDSPGTD
EKQRKFDNTA FLIVKDMLLT GFDAPIAQVM YIDRKLQDHT LMQAIARVNR NYKGKECGYV
VDYYGLTSYL TDALELFSSD DVEGTYQSLK DELPKLQAAH TRLKAFFKGV TSDDVDDYVV
VLKDDAFRAQ FEVAYKRFAK QLNVILPDAS AKPYIPDMTF FGKVHHAARN RYRDDGMDLL
DVGEKVRQLV DEHILSTGVD PKIPPVDLLA ANFKEHVQKI KSPESKASEI ESALKHHINI
NLEDDPEYYR SLSLRLRDII EKTAGNWERQ LELLLEMTND LSTAHKQAAQ NVGLSETEFA
FYNILLAEVT RIHDGDMIDE ALHEQIKSTT QALVQVLDEA TRIVDFFNKE DEIKRMKKKI
KRVILDQPFG DKALVATLQD RFVELAKHKF GNK