Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2951 |
Symbol | csdA |
ID | 6146569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3026637 |
End bp | 3027842 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641617820 |
Product | cysteine sulfinate desulfinase |
Protein accession | YP_001744975 |
Protein GI | 170680672 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily [TIGR03392] cysteine desulfurase, catalytic subunit CsdA |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00101745 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGTTT TTAATCCCGC GCAGTTTCGC GCCCAGTTTC CCGCGCTACA GGATGCGGGC GTCTATCTCG ACAGCGCCGC GACCGCGCTT AAACCTGAAG CCGTGGTTGA AGCCACCCGA CAGTTTTATA GCCTGAGCGC CGGAAACGTC CATCGCAGCC AGTTTGCCGA AGCCCAACGC CTGACCGCGC GTTACGAAGC TGCGCGGGAA AAAGTGGCAC AATTACTGAA TGCACCGGAT GATAAAACTA TCGTCTGGAC GCGCGGCACC ACTGAATCCA TCAACATGGT GGCACAATGC TATGCGCGTC CGCGTCTGCA ACCGGGCGAT GAAATTATTG TCAGTGTGGC AGAACACCAC GCCAACCTCG TCCCCTGGCT GATGGTCGCC CAACAAACGG GGGCCAAAGT GGTGAAATTG CCGCTTAATG CGCAGCGACT GCCGGATGTC GATTTGTTGC CAGAACTGAT TACTCCCCGT AGTCGGATTC TGGCGTTGGG CCAGATGTCG AACGTCACTG GCGGTTGCCC GGATCTGGCG AGAGCGATTA CTTTTGCTCA TTCAGCCGGG ATGGTGGTGA TGGTTGATGG TGCTCAGGGG GCGGTGCATT TCCCCGCGGA TGTTCAGCAA CTGGATATTG ATTTCTATGC TTTTTCAGGT CACAAACTGT ATGGCCCGAC AGGCATCGGC GTGCTGTATG GCAAACCAGA ACTGCTGGAA GCGATGTCGC CCTGGCTGGG CGGCGGCAAA ATGGTTCACG AAGTGAGTTT TGACGGCTTC ACGACTCAAT CTGCGCCGTG GAAACTGGAA GCAGGAACGC CAAATGTCGC TGGCGTCATA GGATTAAGCG CGGCGCTGGA ATGGCTGACA GATTACGATA TCAACCAGGC CGAAAGCTGG AGCCGTAGCT TAGCAACGCT TGCAGAAGAA GCGCTGGCGA AACGTCCAGG CTTTCGTTCA TTCCGCTGCC AGGATTCCAG CCTGCTGGCC TTTGATTTTG CTGGAGTTCA TCACAGCGAT ATGGTGACAC TGCTGGCGGA GTACGGTATT GCCCTGCGGG CCGGGCAGCA TTGCGCTCAG CCGCTACTGG CAGAATTAGG CGTAACCGGC ACACTGCGCG CCTCTTTTGC GCCATATAAT ACAAAGAGTG ATGTGGATGC GCTGGTGAAT GCCGTTGACC GCGCGCTGGA ATTATTGGTG GATTAA
|
Protein sequence | MNVFNPAQFR AQFPALQDAG VYLDSAATAL KPEAVVEATR QFYSLSAGNV HRSQFAEAQR LTARYEAARE KVAQLLNAPD DKTIVWTRGT TESINMVAQC YARPRLQPGD EIIVSVAEHH ANLVPWLMVA QQTGAKVVKL PLNAQRLPDV DLLPELITPR SRILALGQMS NVTGGCPDLA RAITFAHSAG MVVMVDGAQG AVHFPADVQQ LDIDFYAFSG HKLYGPTGIG VLYGKPELLE AMSPWLGGGK MVHEVSFDGF TTQSAPWKLE AGTPNVAGVI GLSAALEWLT DYDINQAESW SRSLATLAEE ALAKRPGFRS FRCQDSSLLA FDFAGVHHSD MVTLLAEYGI ALRAGQHCAQ PLLAELGVTG TLRASFAPYN TKSDVDALVN AVDRALELLV D
|
| |