Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_pSN254_0125 |
Symbol | |
ID | 4929562 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_009140 |
Strand | + |
Start bp | 107495 |
End bp | 109120 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642572424 |
Product | C-5 cytosine-specific DNA methylase |
Protein accession | YP_001101999 |
Protein GI | 134047248 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0270] Site-specific DNA methylase |
TIGRFAM ID | [TIGR00675] DNA-methyltransferase (dcm) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 75 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTACTA TCGTAAACAC CAAGTTAGGA GAACATCGAG GCAAGAAGCG CGTCTGGCTG GAAGGCCAAA AGCTACTGCG TGAAGGTTAC TATCCTGGCA TGAAGTATGA TCTGGAGCTG AAAGATTCCC AGGTTGTGCT TCGCGTCAAA GAGGAGGGTA AGTTCACCAT CAGTAAGCGT GAGCGCAATG GCCGGGTGTC TCCAATCATC GATCTGACCG TGCAGGAGCT TGCTACCGTT TTTGACGGTG TAGAGATGCT CCGCGTGTTC ATCCGTAACG GCGCAATTGT GATCTCTGCT CACCATCAAC AAGAGCGAGT GATCGAGCGC GTCAACCGGC TTATCAGTAA GCTGGAGAAT GGAGAATCGC TTTCGGTATG CAGCCTCTTT CATGGGGGTG GTGTGCTTGA TAAAGCGATT CACGCCGGTT TTCACAAGGC GGGAATTGCC AGTGCCATAT CTGTGGCCGT GGAGATGGAA GGGAAATATC TCGATTCGTC TTTGGCAAAT AACCCGGAGC TTTGGAACGA AGATTCTATT GTTATCGAAT CGCCAATCCA GGCTGTGAAT CTCAGCAAAC GCCCACCACA GGTGGATGTT TTGATGGGGG GCATACCGTG TACCGGCGCG TCAAAGTCAG GTCGCAGTAA GAACAAGTTG GAGTTTGCCG AATCGCATGA GGCGGCAGGT GCCATGTTCT TCAACTTCCT GCAATTCGTA GAAGCGCTAA ACCCAGCCGT TGTGCTGATT GAGAACGTGC CTGAGTACCA GAACACCGCT TCGATGGAAG TGATTCGTTC GGTGCTCTCT TCGTTGGGTT ACTCCCTACA AGAGCGCATT CTCGACGGCA ATGAGTTTGG GGTTATTGAG CGCCGCAAGC GTCTTTGTGT TGTTGCGCTT TCCCACGGGA TCGACGGGTT TGAACTTGAG AAGGTTCAGC CTGTTCGCAC CAAGGAAAGT CGCATACAGG ACATCCTGGA GCCAGTTCCG CTCGATTCTG AACGTTGGAA GTCATTTGAC TATCTGGCTG AGAAGGAGTT GAGAGACAAA GCTGCTGGCA AGGGGTTCTC TCGCCAGCTT CTGACTGGCG ACGATGAGTT TTGCGGCACC ATAGGTAAGG ACTATGCAAA ATGCAGAAGT ACCGAACCTT TCATTGTTCA TCCAGAACAG CCGGAGTTGT CTCGCATCTT TACACCGACA GAACATTGTC GAGTGAAAGG GATACCAGAG GAACTCATCC AAGGTCTGTC GGACACTATT GCCCACCAGA TTCTCGGGCA ATCGGTAGTC TTTCCCGCGT TCGAGGCTTT AGCCCTCGCA TTAGGGAACA GCCTGTGGAG CTGGGTTGGA ATGATGCCAA TCATGGTCGA AGTCGTGGAT GAATCACAGC CGGTGATCGG TGGTGAAGAC TTCCATTGGG CAACGGCATT GGTTGACGCA AAGGGCACTC TCAAGCTGTC ACCGGCAGCG AAAAAACAGG GGATGCCCTT CAACATTATG GATGGTCAAT TGGCTGTCTA TTCACCTAAC GGAACTAAAA AGAGCTGCGG CCATGAGCCT TGCGAATATC TCCCGGTAAT GATGTCCGGA GACGCAATCA TGGTCACTTC ATCTTTGGTT CATTAG
|
Protein sequence | MATIVNTKLG EHRGKKRVWL EGQKLLREGY YPGMKYDLEL KDSQVVLRVK EEGKFTISKR ERNGRVSPII DLTVQELATV FDGVEMLRVF IRNGAIVISA HHQQERVIER VNRLISKLEN GESLSVCSLF HGGGVLDKAI HAGFHKAGIA SAISVAVEME GKYLDSSLAN NPELWNEDSI VIESPIQAVN LSKRPPQVDV LMGGIPCTGA SKSGRSKNKL EFAESHEAAG AMFFNFLQFV EALNPAVVLI ENVPEYQNTA SMEVIRSVLS SLGYSLQERI LDGNEFGVIE RRKRLCVVAL SHGIDGFELE KVQPVRTKES RIQDILEPVP LDSERWKSFD YLAEKELRDK AAGKGFSRQL LTGDDEFCGT IGKDYAKCRS TEPFIVHPEQ PELSRIFTPT EHCRVKGIPE ELIQGLSDTI AHQILGQSVV FPAFEALALA LGNSLWSWVG MMPIMVEVVD ESQPVIGGED FHWATALVDA KGTLKLSPAA KKQGMPFNIM DGQLAVYSPN GTKKSCGHEP CEYLPVMMSG DAIMVTSSLV H
|
| |