Gene SNSL254_A2115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2115 
Symbol 
ID6486467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2047000 
End bp2047986 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content57% 
IMG OID642737471 
ProductD-cysteine desulfhydrase 
Protein accessionYP_002041218 
Protein GI194445885 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2515] 1-aminocyclopropane-1-carboxylate deaminase 
TIGRFAM ID[TIGR01275] pyridoxal phosphate-dependent enzymes, D-cysteine desulfhydrase family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.00000000000146031 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCACTAC ATCACTTAAC GCGCTTTCCT CGCCTGGAGC TTATCGGCGC GCCGACCCCG 
CTCGAATACC TGCCGCGATT GTCTGATTAT CTTGGCCGTG AGATTTACAT TAAGCGCGAT
GACGTTACGC CCATTGCAAT GGGCGGCAAT AAACTGCGCA AGCTGGAGTT TTTGGTGGCC
GACGCGCTGC GAGAAGGGGC GGATACGCTG ATAACCGCAG GAGCGATTCA GTCGAACCAT
GTCCGTCAGA CGGCGGCGGT CGCGGCCAAA TTGGGACTGC ATTGCGTCGC CTTGCTGGAA
AATCCAATCG GTACCACCGC GGAAAATTAC CTGACGAATG GTAATCGCCT GTTACTGGAT
TTATTTAATA CGCAAATTGA GATGTGCGAT GCGCTAACCG ATCCGGATGC GCAGCTGCAA
ACGCTGGCGA CGCGCATTGA AGCGCAAGGG TTCAGGCCCT ATGTGATTCC GGTCGGCGGC
TCCAGCGCGC TGGGGGCAAT GGGATACGTA GAGAGCGCCC TGGAAATCGC CCAGCAGTGT
GAAGAGGTCG TCGGGCTCTC TTCGGTGGTG GTGGCCTCCG GCAGCGCCGG AACGCACGCC
GGGTTAGCTG TCGGGCTGGA ACATCTGATG CCGGATGTCG AACTAATTGG CGTGACCGTT
TCACGTTCTG TCGCCGAGCA GAAACCCAAA GTGATTGCCT TGCAGCAGGC TATTGCCGGT
CAGCTGGCGC TGACGGCGAC GGCGGATATT CATTTATGGG ATGACTATTT TGCCCCGGGT
TACGGCGTGC CAAATGACGC GGGGATGGAG GCGGTGAAAC TGCTGGCGAG CCTGGAGGGG
GTGTTGCTGG ATCCGGTATA TACCGGAAAA GCGATGGCGG GTCTGATAGA CGGCATCAGC
CAGAAACGCT TTAACGATGA CGGGCCGATT CTCTTTATTC ACACCGGTGG GGCGCCTGCG
CTGTTTGCCT ACCATCCTCA TGTATAA
 
Protein sequence
MPLHHLTRFP RLELIGAPTP LEYLPRLSDY LGREIYIKRD DVTPIAMGGN KLRKLEFLVA 
DALREGADTL ITAGAIQSNH VRQTAAVAAK LGLHCVALLE NPIGTTAENY LTNGNRLLLD
LFNTQIEMCD ALTDPDAQLQ TLATRIEAQG FRPYVIPVGG SSALGAMGYV ESALEIAQQC
EEVVGLSSVV VASGSAGTHA GLAVGLEHLM PDVELIGVTV SRSVAEQKPK VIALQQAIAG
QLALTATADI HLWDDYFAPG YGVPNDAGME AVKLLASLEG VLLDPVYTGK AMAGLIDGIS
QKRFNDDGPI LFIHTGGAPA LFAYHPHV