Gene SNSL254_A3210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3210 
SymbolcsdA 
ID6485697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3120294 
End bp3121499 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content60% 
IMG OID642738512 
Productcysteine sulfinate desulfinase 
Protein accessionYP_002042236 
Protein GI194443337 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily
[TIGR03392] cysteine desulfurase, catalytic subunit CsdA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0520726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones86 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCTT TTAATCCCAC GCAGTTTCGC GCGCAGTTTC CCGCGCTAGC CGATGCGGGT 
GTTTATCTCG ATAGCGCCGC CACGGCATTA AAGCCACAGG CAGTCATTGA CGCCACGCAC
CAGTTTTATT ATTTGAGCGC CGGTAACGTT CATCGTAGCC AGTTTGCGCA GGCGCAGCGC
CTGACGGCGC AATATGAAGC GGCCAGAGCA AAAGCAGCGC GACTGTTAAA CGCGCCCGAT
GAAAAAAGTA TCGTCTGGAC ACGCGGCACC ACCGAAGCGA TCAACATGGT GGCGCAGTGT
TACGCCCGTC CTCGTCTGCG CCCCGGCGAT GAAATTATCG TTAGCGTCGC CGAGCATCAC
GCCAACCTTG TGCCCTGGCT GATGGTGGCG CAACAAACCG GCGCGCAGGT CATAAAACTG
CCGCTTAATG ACCGGCGTCT TCCTGATGTT GAGCGTCTGC CGGAACTGAT CACGTCGCGC
AGCCGCATTC TGGCGCTGGG GCAAATGTCG AACGTAACGG GCGGCTGCCC GGATCTCGCA
AGCGCTATCA GCGCCGCTCA CGCAGCGGGA ATGGTCGTGA TGGTAGATGG CGCGCAAGGC
GCGGTACACT TCCCGGCGGA TGTTCAGCAG CTTGATATCG ATTTTTATGC TTTTTCCGCT
CACAAACTGT ATGGCCCGAC CGGTATCGGC GTGCTGTACG GTAAGCCGGA GCTTCTTGAG
GCGATGTCGC CCTGGCTCGG CGGCGGCAAG ATGATCCGTG ACGTTAGCTT TGAAGGCTTC
ACCACTCAAA GCGCTCCCTG GAAACTGGAA GCGGGGACGC CGAACGTCGC CGGGGTCATC
GGCCTGAGCG CTGCGCTGGA ATGGCTGTCC GATATCGATA TTGAACAGGC CGAAAACTGG
AGCCGCGGGC TGGCGACGCT GGCGGAAGAC GCACTGGCGA AACGCCCGGG CTTTCGTTCG
TTCCGCTGCC AGGACTCCAG CCTGCTGGCC TTTGATTTTG TCGGCGTGCA CCACGGCGAT
ATGGTGACGC TGCTGGCGGA ATACGGTATT GCGCTCCGGG CCGGGCAACA TTGCGCCCAG
CCATTGCTGG CGGAACTTGG CGTCACAGGG ACTCTGCGCG CCTCTTTTGC GCCGTATAAT
ACCCAACATG ATGTGGATGC GTTGGTTAAC GCCGTTGACC GCGCGCTGGA ACTGCTGGTG
GATTAA
 
Protein sequence
MNAFNPTQFR AQFPALADAG VYLDSAATAL KPQAVIDATH QFYYLSAGNV HRSQFAQAQR 
LTAQYEAARA KAARLLNAPD EKSIVWTRGT TEAINMVAQC YARPRLRPGD EIIVSVAEHH
ANLVPWLMVA QQTGAQVIKL PLNDRRLPDV ERLPELITSR SRILALGQMS NVTGGCPDLA
SAISAAHAAG MVVMVDGAQG AVHFPADVQQ LDIDFYAFSA HKLYGPTGIG VLYGKPELLE
AMSPWLGGGK MIRDVSFEGF TTQSAPWKLE AGTPNVAGVI GLSAALEWLS DIDIEQAENW
SRGLATLAED ALAKRPGFRS FRCQDSSLLA FDFVGVHHGD MVTLLAEYGI ALRAGQHCAQ
PLLAELGVTG TLRASFAPYN TQHDVDALVN AVDRALELLV D