Gene SeD_A4929 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4929 
Symbol 
ID6875413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4762682 
End bp4765948 
Gene Length3267 bp 
Protein Length1088 aa 
Translation table11 
GC content54% 
IMG OID642787803 
Producthypothetical protein 
Protein accessionYP_002218396 
Protein GI198245499 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.511306 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.806169 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCTCT CATTCAGTGA AGCGAAGTTA GAACAGGCGA TAATTGAACT GTTACAGGAT 
CAGGGCTATC AGCACCTGAT CGGCGATAAT GTGCCGCGTT CCAGCCTCGA TCAGGTGATT
ATCGAAGACG ATCTTCGTCA CTACCTCGCC GCACGCTATC AGGCCGACGG CATCACCGAA
GAGGAGATCC AGCGGCTGAT CAAACAGTTC ACCACCCTGC CCGCCTCCGA TCTCTATGAA
AGCAACAAAA CCTTCTGCGC CTGGCTGGCC AACGGTTTCC TGTTTAAACA CGACGATCGC
CAGCAGAAAG ATCTCTATAT CGAACTGCTC GACACCCGGC ATCTGCCCGC CGCGCTGCGT
GAACTGTTCG ATACCGAAGA TGTGCCGCTG CAACAGGCCG CAGAAACGCC GGCAAGTTAT
CATCTTATCC AGCCGCTGAA CCGGGTAAAA ATCGTCAATC AGCTGACCAT CACCGGCAAA
GAAAACCAGC CGCGTATCCC GGATGCCATT CTGTATATCA ACGGCCTGCC GTTAGTAGTG
TTTGAATTTA AAAGTGCGGT GCGCGAGCAG GAAGCCAATA TTGGCGATGC CTGGAAACAG
CTCTGCACCC GCTATCGCCG CGATATTCCG CAGCTGTTTG TCTATAACGC GCTGTGCATT
ATCAGCGACG GCGTCAATAA CCGCATGGGC AACCTGTTTG CGCCGTACGA GTATTTCTAT
TCCTGGCGCA AAGTCACCGG CAATGAAAAC CGCGAGCAGA ACGGCATCCC TTCCCTGCAC
TCGATGATTC AGGGGCTGTT TCACCCGGTC CGCCTGCTGG ACGTGATCAA AAACTTTATC
TGCTTCCCGG ATAAAGCCAA ACACGAAGTG AAAATCTGCT GCCGCTACCC GCAGTACTAT
GCGGCACGCA AGCTGTATTA CAGCATCGAA CGGGCGCGTA AACCTTTCGG CGACGGCAAG
GGCGGCACCT ACTTTGGCGC AACCGGCTGC GGCAAGAGCT ACACCATGCA GTTCTTAACC
CGCCTGCTGA TGAAAAGCGT CGAATTCGCC AGCCCAACGA TCGTGCTGAT CACCGACCGC
ACCGATCTGG ACGATCAGCT CTCAGCGCAG ATGTGCAATG CCAAAAACTA TATCGGCGAC
GACACCGTGG TGCCCGTCAC CAGCCGCGAC GATCTGCGGG AGAAGCTGGC CGGTCGCAAC
AGCGGCGGCG TCTTCCTGAC CACCATCCAT AAATTTACCG AAGATACCGA ACTGCTCTCC
GAGCGCAGCA ATATCATCTG TATTTCCGAT GAAGCGCACC GCAGCCAGGT CAACCTCGAT
CAGAAAGTGA TCGTCGATAA AGAGAGCGGC AGGGTGCGTA AAACCTACGG TTTCGCCAAA
TATCTGCACG ACTCGCTGCC GCTGGCCACC TACGTTGGTT TTACCGGCAC CCCTATCGAC
GCCACGATGG ACGTCTTTGG CGTTATCGTG GACAGCTACA CCATGACTGA ATCCGTGCAG
GATGAGATCA CCGTGCGCAT CGTCTACGAA GGCCGCGCCG CGAAGGTCAT CCTCGATGCC
GGTAAGCTGG AAGAGGTGGA AAAGTATTAC GAAGAGTGCG CCAATGCGGG CACCAACGAG
TGGCAAATCG ACGAGAGCAA AAAAGCCAGC GCCACCATGA ACGCCATTCT TGGCGATGAA
GACCGCCTGA AAGCGCTGGC GGAAGACTTT GCGAAACACT ATGAAAAACG TGTCGCCGAA
GGCTCAACGG TGAAAGGCAA AGCGATGTTT GTCTGCGCCA GCCGCGAGAT TGCCTGGGAT
TTCTATCGTC AGTTGAAAGA CTTTCGTCCC GCGTGGTTTG AAGTTAAGCA GGCCCCGGAG
GGGGTTGAGC TGACCGGGCA GGAGGAGAAA GAGCTGCCGC CGTCCGAAAT GGTCAAGATG
GTGATGACAC GCGGTAAAGA CGATGACGCG AAACTGTACG ATTTGCTGGG CAGTAAAGAG
TACCGCAAAG AGCTGGATAA ACAGTTCAAA AACGCGAAGT CGAATTTCAA AATCGCCATC
GTGGTGGACA TGTGGCTGAC CGGTTTTGAC GTGCCGGAGC TGGACACCAT TTATATCGAT
AAGCCGCTGC AAAAGCATAA CCTGATCCAG ACCATCTCCC GCGTTAACCG CAAGATGGAA
GGCAAAAGCA AAGGGCTGGT CGTCGACTAT ATCGGCATTA AGCGCCAGAT GAACCAGGCG
CTGGCGATGT ATTCGCGCAT TGACGCCACC AACTTTGAAG ATATTCAGCA GTCAGTCATT
GAGGTGAAAA ATCATCTCGA TCTGCTGGCG CAGGTGTTCC ATGAATTCGA CAGCCGCCCT
TATTTCAGCG GCGAGCCGCA GGCGCAGCTT GCCTGCCTGA ACTTCGCGGC AGAATTTGTC
ATGCGCACCC AGAAGCTGGA ACGCCGCTTT ATGGGGCTGG TGAAGCGGCT GAAAGCGGCC
TATGACGTCT GCTGCGGCAG CGAGGCGCTG TCGCAGGCCG AGCGCGACCA TATCCATTTT
TACATTGCCG TGCGCTCTAT CGTCTTCAAG CTGACCAAAG GCGACGCGCC GGACCTCACA
CAGATGAACG CTCGCGTGCG CGAGATGATC GCCGAAGCGC TGAAAGCCGA CGGCGTGGAA
GAGCTATTTT TCCTCGGTGA CAAGAAAGCG GAGTCGATCG ATATTTTCGA CGATGATTAT
CTGGCGCGCA TTAATAAGAT CAAACTGCCG GCCACCAAAA TTCAGCTGCT GCAAAAGCTG
CTGGAAAAAG CCATCAGCGA CTTTAAACAG ATCAACCAGC TTCAGGGGAT CAATTTCACC
CGTCGTTTCC AGTCCATTAT GGATAAATAT AACGAGCGCC GGGAAGATGA CGTCCTCAAC
GGTGAAGAGT TTGATAACTT CAGCCAGGAG ATGACCGATA TTATCTACGA TATTAAAACC
GAGATGGGCA CCTATGCCGA GATGGGGATT GATATCGAAG AGAAAGCCTT TTACGACATT
CTGGCGCATA TGCGTGATAA GTATCAGTTT ACCTATGAGG ACGACAAGAT GCTGGCGCTG
GCAAAAGAGA TGAAGGCGGT GGTCGATAAC ACCTCGAAGT ATCCGGACTG GAGCAAGCGT
GATGATATTA AGGCGAAGCT GAAGGTGGAG CTAATCCTGC TGCTGCACAA GCACAAGTTC
CCGCCGGTGG CGAATGATGA TGTGTATATG GGCGTGCTGT CGCAGGCGGA GAATTTTAAG
CAGCACCATA GTCATACGCT GCATTAA
 
Protein sequence
MHLSFSEAKL EQAIIELLQD QGYQHLIGDN VPRSSLDQVI IEDDLRHYLA ARYQADGITE 
EEIQRLIKQF TTLPASDLYE SNKTFCAWLA NGFLFKHDDR QQKDLYIELL DTRHLPAALR
ELFDTEDVPL QQAAETPASY HLIQPLNRVK IVNQLTITGK ENQPRIPDAI LYINGLPLVV
FEFKSAVREQ EANIGDAWKQ LCTRYRRDIP QLFVYNALCI ISDGVNNRMG NLFAPYEYFY
SWRKVTGNEN REQNGIPSLH SMIQGLFHPV RLLDVIKNFI CFPDKAKHEV KICCRYPQYY
AARKLYYSIE RARKPFGDGK GGTYFGATGC GKSYTMQFLT RLLMKSVEFA SPTIVLITDR
TDLDDQLSAQ MCNAKNYIGD DTVVPVTSRD DLREKLAGRN SGGVFLTTIH KFTEDTELLS
ERSNIICISD EAHRSQVNLD QKVIVDKESG RVRKTYGFAK YLHDSLPLAT YVGFTGTPID
ATMDVFGVIV DSYTMTESVQ DEITVRIVYE GRAAKVILDA GKLEEVEKYY EECANAGTNE
WQIDESKKAS ATMNAILGDE DRLKALAEDF AKHYEKRVAE GSTVKGKAMF VCASREIAWD
FYRQLKDFRP AWFEVKQAPE GVELTGQEEK ELPPSEMVKM VMTRGKDDDA KLYDLLGSKE
YRKELDKQFK NAKSNFKIAI VVDMWLTGFD VPELDTIYID KPLQKHNLIQ TISRVNRKME
GKSKGLVVDY IGIKRQMNQA LAMYSRIDAT NFEDIQQSVI EVKNHLDLLA QVFHEFDSRP
YFSGEPQAQL ACLNFAAEFV MRTQKLERRF MGLVKRLKAA YDVCCGSEAL SQAERDHIHF
YIAVRSIVFK LTKGDAPDLT QMNARVREMI AEALKADGVE ELFFLGDKKA ESIDIFDDDY
LARINKIKLP ATKIQLLQKL LEKAISDFKQ INQLQGINFT RRFQSIMDKY NERREDDVLN
GEEFDNFSQE MTDIIYDIKT EMGTYAEMGI DIEEKAFYDI LAHMRDKYQF TYEDDKMLAL
AKEMKAVVDN TSKYPDWSKR DDIKAKLKVE LILLLHKHKF PPVANDDVYM GVLSQAENFK
QHHSHTLH