Gene SNSL254_A2119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2119 
Symbol 
ID6485424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2050613 
End bp2051818 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content50% 
IMG OID642737474 
Productlysine-N-methylase 
Protein accessionYP_002041221 
Protein GI194444875 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.00000000610642 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAGAAA TCACCGTCAC TGAACCTGCC TTTGTCACCC GCTTTTCCTG TTCTGGCTCG 
GCCTGCCGCG ACCACTGTTG TAAGGGCTGG AAAATCACGC TGGATAAGAC GACGGTTAAA
AAGTATCTCG CCAGTAAAGA CGCGACGATT CGTACCATCG CGCAAGAGCA TATTATCCTG
CTGAAAAAGA ACAACAGCCA CTGGGGCGAA ATTAAGCTGC CTTCGGCGCT GGGGAATTGC
CCTTATCTGG ATGAGGAACG TCTGTGCCGG GTACAAAAAA CGTTAGGCGC AAAGGCATTA
AGTCATACCT GTTCCTCTTT CCCGCGGGCG CACCATACCT ATAAAAATGA GGTACGTAAC
TCCCTGAGTC TTGCCTGCCC GGAGGTAACG TCGCGTATTT TAAACGATCC TGACGCGATG
GCGCTCGGCG AAAAAACAAT CATTCAGCAG ACATTCAATA CTGCGCCGTT ATTTCCAGCG
CAGCAAAAGT TACTCAATCT GTTTTGCCTG AGTTTGATCA ACCATGCCAA CAGCAGTACG
GAAGCCGCGC TCTATGCGTT GATTAAATTC GTCATGTATA CGCAGAAATT TGCCAAAATT
GATGATGCCG CGCTGGGCGA ACTGGAACAG GTGTATGCCG CGCTACTTGA GCAGTTGCAG
ACCGGCGTGC TGGCGCAGGA ATTGATGAAT ATCGCGCCGG ACAGCAAGGT AAAAACCTCG
CTGGTATTGC AGATGCAGGA CTATTTCCGC TCGCTCCCGC TTAGTCGTGG CAGTGTTATC
CTCGATCACT ATATCCAGTG TCTTCTGCGG GTGCTGACGG CGGAAGAGGG CGTTTCAATG
GAGCAGAAGG TTAGCGATAT TGAGTCCTCA TTAGCGCGCT GTTTACAGGC GGATGAGCAG
CAGAAGAACT GGGCCTTCAG AAATTTAATT CTCTATAAAA TTTGGGAAAA TAATTTCCCC
AACCAGCCGA ATGTCGACCC GTTACGCGCG CTGTATATTA TCGTGGCGGA ATATGCCTTT
ATTAAGCTAT TAACGGCAGC CAGCGTGCAT GAGCGCGGGC GGCTTGAGTG GGATGACGTT
ACCAATATTG TGTATAGCTT TCATTCCCGC AGCCAGCATA GCAGCGAGGT GGCGGCGAAT
TTTCATCGCC ATATAGAAAC GGTGCGTACT GGCGACGATC TGTCGATGAT TCATCTTCTG
ACATAG
 
Protein sequence
MKEITVTEPA FVTRFSCSGS ACRDHCCKGW KITLDKTTVK KYLASKDATI RTIAQEHIIL 
LKKNNSHWGE IKLPSALGNC PYLDEERLCR VQKTLGAKAL SHTCSSFPRA HHTYKNEVRN
SLSLACPEVT SRILNDPDAM ALGEKTIIQQ TFNTAPLFPA QQKLLNLFCL SLINHANSST
EAALYALIKF VMYTQKFAKI DDAALGELEQ VYAALLEQLQ TGVLAQELMN IAPDSKVKTS
LVLQMQDYFR SLPLSRGSVI LDHYIQCLLR VLTAEEGVSM EQKVSDIESS LARCLQADEQ
QKNWAFRNLI LYKIWENNFP NQPNVDPLRA LYIIVAEYAF IKLLTAASVH ERGRLEWDDV
TNIVYSFHSR SQHSSEVAAN FHRHIETVRT GDDLSMIHLL T