Gene SNSL254_pSN254_0125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_pSN254_0125 
Symbol 
ID4929562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_009140 
Strand
Start bp107495 
End bp109120 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content51% 
IMG OID642572424 
ProductC-5 cytosine-specific DNA methylase 
Protein accessionYP_001101999 
Protein GI134047248 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACTA TCGTAAACAC CAAGTTAGGA GAACATCGAG GCAAGAAGCG CGTCTGGCTG 
GAAGGCCAAA AGCTACTGCG TGAAGGTTAC TATCCTGGCA TGAAGTATGA TCTGGAGCTG
AAAGATTCCC AGGTTGTGCT TCGCGTCAAA GAGGAGGGTA AGTTCACCAT CAGTAAGCGT
GAGCGCAATG GCCGGGTGTC TCCAATCATC GATCTGACCG TGCAGGAGCT TGCTACCGTT
TTTGACGGTG TAGAGATGCT CCGCGTGTTC ATCCGTAACG GCGCAATTGT GATCTCTGCT
CACCATCAAC AAGAGCGAGT GATCGAGCGC GTCAACCGGC TTATCAGTAA GCTGGAGAAT
GGAGAATCGC TTTCGGTATG CAGCCTCTTT CATGGGGGTG GTGTGCTTGA TAAAGCGATT
CACGCCGGTT TTCACAAGGC GGGAATTGCC AGTGCCATAT CTGTGGCCGT GGAGATGGAA
GGGAAATATC TCGATTCGTC TTTGGCAAAT AACCCGGAGC TTTGGAACGA AGATTCTATT
GTTATCGAAT CGCCAATCCA GGCTGTGAAT CTCAGCAAAC GCCCACCACA GGTGGATGTT
TTGATGGGGG GCATACCGTG TACCGGCGCG TCAAAGTCAG GTCGCAGTAA GAACAAGTTG
GAGTTTGCCG AATCGCATGA GGCGGCAGGT GCCATGTTCT TCAACTTCCT GCAATTCGTA
GAAGCGCTAA ACCCAGCCGT TGTGCTGATT GAGAACGTGC CTGAGTACCA GAACACCGCT
TCGATGGAAG TGATTCGTTC GGTGCTCTCT TCGTTGGGTT ACTCCCTACA AGAGCGCATT
CTCGACGGCA ATGAGTTTGG GGTTATTGAG CGCCGCAAGC GTCTTTGTGT TGTTGCGCTT
TCCCACGGGA TCGACGGGTT TGAACTTGAG AAGGTTCAGC CTGTTCGCAC CAAGGAAAGT
CGCATACAGG ACATCCTGGA GCCAGTTCCG CTCGATTCTG AACGTTGGAA GTCATTTGAC
TATCTGGCTG AGAAGGAGTT GAGAGACAAA GCTGCTGGCA AGGGGTTCTC TCGCCAGCTT
CTGACTGGCG ACGATGAGTT TTGCGGCACC ATAGGTAAGG ACTATGCAAA ATGCAGAAGT
ACCGAACCTT TCATTGTTCA TCCAGAACAG CCGGAGTTGT CTCGCATCTT TACACCGACA
GAACATTGTC GAGTGAAAGG GATACCAGAG GAACTCATCC AAGGTCTGTC GGACACTATT
GCCCACCAGA TTCTCGGGCA ATCGGTAGTC TTTCCCGCGT TCGAGGCTTT AGCCCTCGCA
TTAGGGAACA GCCTGTGGAG CTGGGTTGGA ATGATGCCAA TCATGGTCGA AGTCGTGGAT
GAATCACAGC CGGTGATCGG TGGTGAAGAC TTCCATTGGG CAACGGCATT GGTTGACGCA
AAGGGCACTC TCAAGCTGTC ACCGGCAGCG AAAAAACAGG GGATGCCCTT CAACATTATG
GATGGTCAAT TGGCTGTCTA TTCACCTAAC GGAACTAAAA AGAGCTGCGG CCATGAGCCT
TGCGAATATC TCCCGGTAAT GATGTCCGGA GACGCAATCA TGGTCACTTC ATCTTTGGTT
CATTAG
 
Protein sequence
MATIVNTKLG EHRGKKRVWL EGQKLLREGY YPGMKYDLEL KDSQVVLRVK EEGKFTISKR 
ERNGRVSPII DLTVQELATV FDGVEMLRVF IRNGAIVISA HHQQERVIER VNRLISKLEN
GESLSVCSLF HGGGVLDKAI HAGFHKAGIA SAISVAVEME GKYLDSSLAN NPELWNEDSI
VIESPIQAVN LSKRPPQVDV LMGGIPCTGA SKSGRSKNKL EFAESHEAAG AMFFNFLQFV
EALNPAVVLI ENVPEYQNTA SMEVIRSVLS SLGYSLQERI LDGNEFGVIE RRKRLCVVAL
SHGIDGFELE KVQPVRTKES RIQDILEPVP LDSERWKSFD YLAEKELRDK AAGKGFSRQL
LTGDDEFCGT IGKDYAKCRS TEPFIVHPEQ PELSRIFTPT EHCRVKGIPE ELIQGLSDTI
AHQILGQSVV FPAFEALALA LGNSLWSWVG MMPIMVEVVD ESQPVIGGED FHWATALVDA
KGTLKLSPAA KKQGMPFNIM DGQLAVYSPN GTKKSCGHEP CEYLPVMMSG DAIMVTSSLV
H