Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4929 |
Symbol | |
ID | 6875413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 4762682 |
End bp | 4765948 |
Gene Length | 3267 bp |
Protein Length | 1088 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642787803 |
Product | hypothetical protein |
Protein accession | YP_002218396 |
Protein GI | 198245499 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.511306 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 0.806169 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATCTCT CATTCAGTGA AGCGAAGTTA GAACAGGCGA TAATTGAACT GTTACAGGAT CAGGGCTATC AGCACCTGAT CGGCGATAAT GTGCCGCGTT CCAGCCTCGA TCAGGTGATT ATCGAAGACG ATCTTCGTCA CTACCTCGCC GCACGCTATC AGGCCGACGG CATCACCGAA GAGGAGATCC AGCGGCTGAT CAAACAGTTC ACCACCCTGC CCGCCTCCGA TCTCTATGAA AGCAACAAAA CCTTCTGCGC CTGGCTGGCC AACGGTTTCC TGTTTAAACA CGACGATCGC CAGCAGAAAG ATCTCTATAT CGAACTGCTC GACACCCGGC ATCTGCCCGC CGCGCTGCGT GAACTGTTCG ATACCGAAGA TGTGCCGCTG CAACAGGCCG CAGAAACGCC GGCAAGTTAT CATCTTATCC AGCCGCTGAA CCGGGTAAAA ATCGTCAATC AGCTGACCAT CACCGGCAAA GAAAACCAGC CGCGTATCCC GGATGCCATT CTGTATATCA ACGGCCTGCC GTTAGTAGTG TTTGAATTTA AAAGTGCGGT GCGCGAGCAG GAAGCCAATA TTGGCGATGC CTGGAAACAG CTCTGCACCC GCTATCGCCG CGATATTCCG CAGCTGTTTG TCTATAACGC GCTGTGCATT ATCAGCGACG GCGTCAATAA CCGCATGGGC AACCTGTTTG CGCCGTACGA GTATTTCTAT TCCTGGCGCA AAGTCACCGG CAATGAAAAC CGCGAGCAGA ACGGCATCCC TTCCCTGCAC TCGATGATTC AGGGGCTGTT TCACCCGGTC CGCCTGCTGG ACGTGATCAA AAACTTTATC TGCTTCCCGG ATAAAGCCAA ACACGAAGTG AAAATCTGCT GCCGCTACCC GCAGTACTAT GCGGCACGCA AGCTGTATTA CAGCATCGAA CGGGCGCGTA AACCTTTCGG CGACGGCAAG GGCGGCACCT ACTTTGGCGC AACCGGCTGC GGCAAGAGCT ACACCATGCA GTTCTTAACC CGCCTGCTGA TGAAAAGCGT CGAATTCGCC AGCCCAACGA TCGTGCTGAT CACCGACCGC ACCGATCTGG ACGATCAGCT CTCAGCGCAG ATGTGCAATG CCAAAAACTA TATCGGCGAC GACACCGTGG TGCCCGTCAC CAGCCGCGAC GATCTGCGGG AGAAGCTGGC CGGTCGCAAC AGCGGCGGCG TCTTCCTGAC CACCATCCAT AAATTTACCG AAGATACCGA ACTGCTCTCC GAGCGCAGCA ATATCATCTG TATTTCCGAT GAAGCGCACC GCAGCCAGGT CAACCTCGAT CAGAAAGTGA TCGTCGATAA AGAGAGCGGC AGGGTGCGTA AAACCTACGG TTTCGCCAAA TATCTGCACG ACTCGCTGCC GCTGGCCACC TACGTTGGTT TTACCGGCAC CCCTATCGAC GCCACGATGG ACGTCTTTGG CGTTATCGTG GACAGCTACA CCATGACTGA ATCCGTGCAG GATGAGATCA CCGTGCGCAT CGTCTACGAA GGCCGCGCCG CGAAGGTCAT CCTCGATGCC GGTAAGCTGG AAGAGGTGGA AAAGTATTAC GAAGAGTGCG CCAATGCGGG CACCAACGAG TGGCAAATCG ACGAGAGCAA AAAAGCCAGC GCCACCATGA ACGCCATTCT TGGCGATGAA GACCGCCTGA AAGCGCTGGC GGAAGACTTT GCGAAACACT ATGAAAAACG TGTCGCCGAA GGCTCAACGG TGAAAGGCAA AGCGATGTTT GTCTGCGCCA GCCGCGAGAT TGCCTGGGAT TTCTATCGTC AGTTGAAAGA CTTTCGTCCC GCGTGGTTTG AAGTTAAGCA GGCCCCGGAG GGGGTTGAGC TGACCGGGCA GGAGGAGAAA GAGCTGCCGC CGTCCGAAAT GGTCAAGATG GTGATGACAC GCGGTAAAGA CGATGACGCG AAACTGTACG ATTTGCTGGG CAGTAAAGAG TACCGCAAAG AGCTGGATAA ACAGTTCAAA AACGCGAAGT CGAATTTCAA AATCGCCATC GTGGTGGACA TGTGGCTGAC CGGTTTTGAC GTGCCGGAGC TGGACACCAT TTATATCGAT AAGCCGCTGC AAAAGCATAA CCTGATCCAG ACCATCTCCC GCGTTAACCG CAAGATGGAA GGCAAAAGCA AAGGGCTGGT CGTCGACTAT ATCGGCATTA AGCGCCAGAT GAACCAGGCG CTGGCGATGT ATTCGCGCAT TGACGCCACC AACTTTGAAG ATATTCAGCA GTCAGTCATT GAGGTGAAAA ATCATCTCGA TCTGCTGGCG CAGGTGTTCC ATGAATTCGA CAGCCGCCCT TATTTCAGCG GCGAGCCGCA GGCGCAGCTT GCCTGCCTGA ACTTCGCGGC AGAATTTGTC ATGCGCACCC AGAAGCTGGA ACGCCGCTTT ATGGGGCTGG TGAAGCGGCT GAAAGCGGCC TATGACGTCT GCTGCGGCAG CGAGGCGCTG TCGCAGGCCG AGCGCGACCA TATCCATTTT TACATTGCCG TGCGCTCTAT CGTCTTCAAG CTGACCAAAG GCGACGCGCC GGACCTCACA CAGATGAACG CTCGCGTGCG CGAGATGATC GCCGAAGCGC TGAAAGCCGA CGGCGTGGAA GAGCTATTTT TCCTCGGTGA CAAGAAAGCG GAGTCGATCG ATATTTTCGA CGATGATTAT CTGGCGCGCA TTAATAAGAT CAAACTGCCG GCCACCAAAA TTCAGCTGCT GCAAAAGCTG CTGGAAAAAG CCATCAGCGA CTTTAAACAG ATCAACCAGC TTCAGGGGAT CAATTTCACC CGTCGTTTCC AGTCCATTAT GGATAAATAT AACGAGCGCC GGGAAGATGA CGTCCTCAAC GGTGAAGAGT TTGATAACTT CAGCCAGGAG ATGACCGATA TTATCTACGA TATTAAAACC GAGATGGGCA CCTATGCCGA GATGGGGATT GATATCGAAG AGAAAGCCTT TTACGACATT CTGGCGCATA TGCGTGATAA GTATCAGTTT ACCTATGAGG ACGACAAGAT GCTGGCGCTG GCAAAAGAGA TGAAGGCGGT GGTCGATAAC ACCTCGAAGT ATCCGGACTG GAGCAAGCGT GATGATATTA AGGCGAAGCT GAAGGTGGAG CTAATCCTGC TGCTGCACAA GCACAAGTTC CCGCCGGTGG CGAATGATGA TGTGTATATG GGCGTGCTGT CGCAGGCGGA GAATTTTAAG CAGCACCATA GTCATACGCT GCATTAA
|
Protein sequence | MHLSFSEAKL EQAIIELLQD QGYQHLIGDN VPRSSLDQVI IEDDLRHYLA ARYQADGITE EEIQRLIKQF TTLPASDLYE SNKTFCAWLA NGFLFKHDDR QQKDLYIELL DTRHLPAALR ELFDTEDVPL QQAAETPASY HLIQPLNRVK IVNQLTITGK ENQPRIPDAI LYINGLPLVV FEFKSAVREQ EANIGDAWKQ LCTRYRRDIP QLFVYNALCI ISDGVNNRMG NLFAPYEYFY SWRKVTGNEN REQNGIPSLH SMIQGLFHPV RLLDVIKNFI CFPDKAKHEV KICCRYPQYY AARKLYYSIE RARKPFGDGK GGTYFGATGC GKSYTMQFLT RLLMKSVEFA SPTIVLITDR TDLDDQLSAQ MCNAKNYIGD DTVVPVTSRD DLREKLAGRN SGGVFLTTIH KFTEDTELLS ERSNIICISD EAHRSQVNLD QKVIVDKESG RVRKTYGFAK YLHDSLPLAT YVGFTGTPID ATMDVFGVIV DSYTMTESVQ DEITVRIVYE GRAAKVILDA GKLEEVEKYY EECANAGTNE WQIDESKKAS ATMNAILGDE DRLKALAEDF AKHYEKRVAE GSTVKGKAMF VCASREIAWD FYRQLKDFRP AWFEVKQAPE GVELTGQEEK ELPPSEMVKM VMTRGKDDDA KLYDLLGSKE YRKELDKQFK NAKSNFKIAI VVDMWLTGFD VPELDTIYID KPLQKHNLIQ TISRVNRKME GKSKGLVVDY IGIKRQMNQA LAMYSRIDAT NFEDIQQSVI EVKNHLDLLA QVFHEFDSRP YFSGEPQAQL ACLNFAAEFV MRTQKLERRF MGLVKRLKAA YDVCCGSEAL SQAERDHIHF YIAVRSIVFK LTKGDAPDLT QMNARVREMI AEALKADGVE ELFFLGDKKA ESIDIFDDDY LARINKIKLP ATKIQLLQKL LEKAISDFKQ INQLQGINFT RRFQSIMDKY NERREDDVLN GEEFDNFSQE MTDIIYDIKT EMGTYAEMGI DIEEKAFYDI LAHMRDKYQF TYEDDKMLAL AKEMKAVVDN TSKYPDWSKR DDIKAKLKVE LILLLHKHKF PPVANDDVYM GVLSQAENFK QHHSHTLH
|
| |