Gene SNSL254_A2155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2155 
Symbol 
ID6485090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2078583 
End bp2080013 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content54% 
IMG OID642737509 
ProductDNA cytosine methylase 
Protein accessionYP_002041256 
Protein GI194443436 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.0409475 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAAA ATATTTCAGT AACACACGCC CGGAACCTCA TCGCCGACGA CGCCGGAAGC 
GAGATCCAGG CGATGCTGAG TCAATTGCTG GAAATCTACG ATGTTAAAAC GCTGGTGGCG
CACCTTAACG GCCTGGGCGA ACAGCACTGG AGCCCGGCCA TCTTAAAGCG CGTAATGATG
AACGCGGCAT GGCATCGTTT GAGCGACAAT GAACTCACCT GTCTTAAAAC AGGGTTGCCG
ACGCCGCCAG CGCATCATCC ACATTACGCC TTTCGTTTTA TCGATCTCTT CGCGGGCATC
GGCGGCATTC GCCGCGGATT TGAAGCGATA GGCGGACAGT GCGTGTTTAC CAGCGAATGG
AATAAGCACG CGGTACGGAC ATATAAAGCG AACTATTTTT GCGATCCGCT GCAACATCGC
TTTAATGAAG ATATCCGCGA TATCACGTTG AGCCACCGGG AAGGGGTCAG CGATGATGAG
GCGGCGGAAC ACATTCGCCA GCATATTCCG CAACATGATG TCCTGCTGGC GGGCTTTCCC
TGTCAGCCAT TTTCTCTGGC GGGCGTTTCC AAGAAAAATG CGCTGGGCCG CGCCCACGGC
TTTGCCTGCG AGACTCAGGG GACATTATTT TTTGATGTCG TAAGAATTAT CGACGCCCGC
CGCCCCGCGC TGTTTGTGCT GGAAAACGTG AAAAACCTTA AAAGTCACGA CCAGGGCAAC
ACCTTCCGCA TTATTATGCA AACGCTCGAT GAACTGGGAT ATGACGTGGC GGATGCCGCT
GACAATGGCC CGGACGATCC GAAAATTATC GACGGGCAGC ACTTTCTTCC TCAGCATCGG
GAACGTATTG TGTTGGTGGG ATTCCGTCGC GATTTAAACC TGAAAACCGA TTTTACGTTA
CGCAATATCG CCCGTTGTTA TCCACCGCGC CGTCCGACGC TGGCAGAACT GCTGGAGCCC
GTCGTCGAAG CCAAATATAT CCTGACGCCG GTGCTGTGGA AATATTTATA TCGCTACGCG
AAAAAGCACC AGGCGCGGGG AAACGGTTTT GGCTATGGCA TGGTTTATCC TGACAATCCG
GAAAGTGTGG CGCGCACGTT ATCTGCTCGC TACTACAAAG ATGGCGCCGA AATTCTGATC
GATCGTGGTT GGGATATGGC GAAAGGCGAA GTGAATTTCG ACGATGCTGG CAACCAACAA
CATCGTCCCC GCCGACTCAC GCCGAGAGAG TGCGCGCGTT TAATGGGATT TGAGGCGCCG
CAAACGTACC AGTTCAGGAT ACCTGTCTCG GATACGCAGG CCTATCGCCA GTTTGGCAAC
TCCGTGGTGG TGCCGGTATT TGCTGCGGTA GCAAAGCTGC TGGAACCCAA AATTCACCAG
GCGGTGACGC TGCGTCAGAA AGAGACGGTA GATGGCGGAC GTTCACGATA A
 
Protein sequence
MQENISVTHA RNLIADDAGS EIQAMLSQLL EIYDVKTLVA HLNGLGEQHW SPAILKRVMM 
NAAWHRLSDN ELTCLKTGLP TPPAHHPHYA FRFIDLFAGI GGIRRGFEAI GGQCVFTSEW
NKHAVRTYKA NYFCDPLQHR FNEDIRDITL SHREGVSDDE AAEHIRQHIP QHDVLLAGFP
CQPFSLAGVS KKNALGRAHG FACETQGTLF FDVVRIIDAR RPALFVLENV KNLKSHDQGN
TFRIIMQTLD ELGYDVADAA DNGPDDPKII DGQHFLPQHR ERIVLVGFRR DLNLKTDFTL
RNIARCYPPR RPTLAELLEP VVEAKYILTP VLWKYLYRYA KKHQARGNGF GYGMVYPDNP
ESVARTLSAR YYKDGAEILI DRGWDMAKGE VNFDDAGNQQ HRPRRLTPRE CARLMGFEAP
QTYQFRIPVS DTQAYRQFGN SVVVPVFAAV AKLLEPKIHQ AVTLRQKETV DGGRSR