Gene SeHA_C3195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3195 
SymbolcsdA 
ID6489375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3118595 
End bp3119800 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content60% 
IMG OID642743337 
Productcysteine sulfinate desulfinase 
Protein accessionYP_002046956 
Protein GI194451539 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily
[TIGR03392] cysteine desulfurase, catalytic subunit CsdA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.364366 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones84 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCTT TTAATCCCAC GCAGTTTCGC GCGCAGTTTC CCGCGCTAGC CGATGCGGGT 
GTTTATCTCG ATAGCGCCGC CACGGCATTA AAGCCACAGG CAGTCATTGA CGCCACGCAC
CAGTTTTATT GTTTGAGCGC CGGTAACGTT CATCGTAGCC AGTTTGCGCA GGCGCAGCGC
CTGACGGCGC AATATGAAGC GGCCAGAGCA AAAGCAGCAC GACTGTTAAA CGCGCCCGAT
GAAAAAAGTA TCGTCTGGAC ACGCGGCACC ACCGAAGCGA TCAACATGGT GGCGCAGTGT
TACGCCCGTC CTCGTCTGCG CCCCGGCGAT GAAATTATCG TTAGCGTCGC CGAGCATCAC
GCCAACCTTG TGCCCTGGCT GATGGTGGCG CAACAAACCG GCGCGCAGGT CATAAAACTG
CCGCTTAATG ACCGGCGTCT TCCTGATGTT GAGCGTCTGC CGGAACTGAT CACGTCGCGC
AGCCGGATTC TGGCGCTGGG GCAAATGTCG AACGTAACGG GCGGCTGTCC GGATCTCGCA
AGCGCTATCA GCGCCGCTCA CGCGGCGGGA ATGGTCGTGA TGGTAGATGG CGCGCAAGGC
GCGGTACACT TCCCGGCGGA TGTTCAGCAG CTTGATATCG ATTTTTATGC TTTTTCCGCT
CACAAACTGT ATGGCCCGAC CGGTATCGGC GTGCTGTACG GTAAGCCGGA GCTTCTTGAG
GCGATGTCGC CCTGGCTCGG CGGCGGCAAG ATGATCCGTG ACGTTAGCTT TGAAGGCTTC
ACCACTCAAA GCGCTCCCTG GAAGCTGGAA GCGGGGACGC CGAACGTCGC CGGGGTCATC
GGCCTGAGCG CTGCGCTGGA ATGGCTGTCC GATATCGATA TTGAACAGGC CGAAAACTGG
AGCCGGGGGC TGGCGACGCT GGCGGAAGAC GCACTGGCGA AACGCCCGGG CTTTCGTTCG
TTCCGCTGCC AGGACTCCAG CCTGCTGGCC TTTGATTTTG TCGGCGTGCA CCACGGCGAT
ATGGTGACGC TGCTGGCGGA ATACGGTATT GCGCTCCGGG CCGGGCAACA TTGCGCCCAG
CCATTGCTGG CGGAACTTGG CGTCACAGGG ACTCTGCGCG CCTCTTTTGC GCCGTATAAT
ACCCAACATG ATGTGGATGC GTTGGTTAAC GCCGTTGACC GCGCGCTGGA ACTGCTGGTG
GATTAA
 
Protein sequence
MNAFNPTQFR AQFPALADAG VYLDSAATAL KPQAVIDATH QFYCLSAGNV HRSQFAQAQR 
LTAQYEAARA KAARLLNAPD EKSIVWTRGT TEAINMVAQC YARPRLRPGD EIIVSVAEHH
ANLVPWLMVA QQTGAQVIKL PLNDRRLPDV ERLPELITSR SRILALGQMS NVTGGCPDLA
SAISAAHAAG MVVMVDGAQG AVHFPADVQQ LDIDFYAFSA HKLYGPTGIG VLYGKPELLE
AMSPWLGGGK MIRDVSFEGF TTQSAPWKLE AGTPNVAGVI GLSAALEWLS DIDIEQAENW
SRGLATLAED ALAKRPGFRS FRCQDSSLLA FDFVGVHHGD MVTLLAEYGI ALRAGQHCAQ
PLLAELGVTG TLRASFAPYN TQHDVDALVN AVDRALELLV D