Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A3310 |
Symbol | csdA |
ID | 6872581 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 3183703 |
End bp | 3184908 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642786319 |
Product | cysteine sulfinate desulfinase |
Protein accession | YP_002216960 |
Protein GI | 198243622 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily [TIGR03392] cysteine desulfurase, catalytic subunit CsdA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.04923 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCTT TTAATCCCAC GCAGTTTCGC GCGCAGTTTC CCGCGCTAGC CGATGCGGGT GTTTATCTCG ATAGCGCCGC CACGGCATTA AAGCCACAGG CAGTCATTGA CGCCACGCAC CAGTTTTATT GTTTGAGCGC CGGTAACGTT CATCGTAGCC AGTTTGCGCA GGCGCAGCGC CTGACGGCGC AATATGAAGC GGCCAGAGCA AAAGCAGCGC GGCTGTTAAA CGCGCCCGAT GAAAAAAGTA TCGTCTGGAC ACGCGGCACC ACCGAAGCGA TCAACATGGT GGCGCAGTGT TACGCCCGTC CTCGTCTGCG CCCCGGCGAT GAAATTATCG TTAGCGTCGC CGAGCATCAC GCCAACCTTG TGCCCTGGCT GATGGTGGCG CAACAAACCG GCGCGCAGGT CATAAAACTG CCGCTTAATG ACCGACGTCT TCCTGATGTT GAGCGTCTGC CGGAACTGAT CACGTCGCGC AGCCGGATTC TGGCGCTGGG GCAAATGTCG AACGTAACGG GCGGCTGCCC GGATCTCGCA AGCGCTATCA GCACCGCTCA CGCAGCGGGA ATGGTCGTGA TGGTAGATGG CGCGCAAGGC GCGGTACACT TCCCGGCGGA TGTCCAGCAG CTTGATATCG ATTTTTATGC TTTTTCCGCT CACAAACTGT ATGGCCCGAC CGGTATCGGC GTGCTGTACG GTAAGCCGGA GCTTCTTGAG GCGATGTCGC CCTGGCTCGG CGGCGGCAAG ATGATCCGTG ACGTTAGCTT TGAAGGCTTC ACCACTCAAA GCGCTCCCTG GAAGCTGGAA GCGGGGACGC CGAACGTCGC CGGGGTCATC GGCCTGAGCG CTGCGCTGGA ATGGCTGTCC GATATCGATA TTGAACAGGC CGAAAACTGG AGCCGGGGGT TGGCGACGCT GGCGGAAGAC GCACTGGCGA AACGTCCGGG CTTTCGTTCG TTCCGCTGCC AGGACTCCAG CCTGCTGGCC TTTGATTTTG TCGGCGTGCA CCACGGCGAT ATGGTGACGC TGCTGGCGGA ATACGGTATT GCGCTCCGGG CCGGGCAACA TTGCGCCCAG CCATTGCTGG CGGAACTTGG CGTCACAGGG ACTCTGCGCG CCTCTTTTGC GCCGTATAAT ACCCAACATG ATGTGGATGC GTTGGTTAAC GCCGTTGACC GCGCGCTGGA ACTGCTGGTG GATTAA
|
Protein sequence | MNAFNPTQFR AQFPALADAG VYLDSAATAL KPQAVIDATH QFYCLSAGNV HRSQFAQAQR LTAQYEAARA KAARLLNAPD EKSIVWTRGT TEAINMVAQC YARPRLRPGD EIIVSVAEHH ANLVPWLMVA QQTGAQVIKL PLNDRRLPDV ERLPELITSR SRILALGQMS NVTGGCPDLA SAISTAHAAG MVVMVDGAQG AVHFPADVQQ LDIDFYAFSA HKLYGPTGIG VLYGKPELLE AMSPWLGGGK MIRDVSFEGF TTQSAPWKLE AGTPNVAGVI GLSAALEWLS DIDIEQAENW SRGLATLAED ALAKRPGFRS FRCQDSSLLA FDFVGVHHGD MVTLLAEYGI ALRAGQHCAQ PLLAELGVTG TLRASFAPYN TQHDVDALVN AVDRALELLV D
|
| |