Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECD_02661 |
Symbol | csdA |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21(DE3) |
Kingdom | Bacteria |
Replicon accession | CP001509 |
Strand | + |
Start bp | 2781617 |
End bp | 2782822 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | cysteine sulfinate desulfinase |
Protein accession | ACT44477 |
Protein GI | 253978807 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGTTT TTAATCCCGC GCAGTTTCGC GCCCAGTTTC CCGCACTACA GGATGCGGGC GTCTATCTCG ACAGCGCCGC GACCGCGCTT AAACCTGAAG CCGTGGTTGA AGCCACCCAA CAGTTTTACA GTCTGAGCGC CGGAAACGTC CATCGCAGCC AGTTTGCCGA AGCCCAACGC CTGACCGCGC GTTATGAAGC TGCACGAGAG AAAGTGGCGC AATTACTGAA TGCACCGGAT GATAAAACTA TCGTCTGGAC GCGCGGCACC ACTGAATCCA TCAACATGGT GGCACAATGC TATGCGCGTC CGCGTCTGCA ACCGGGCGAT GAGATTATTG TCAGCGTGGC AGAACACCAC GCCAACCTCG TCCCCTGGCT GATGGTCGCC CAACAAACTG GAGCCAAAGT GGTGAAATTG CCGCTTAATG CGCAGCGACT GCCGGATGTC GATTTGTTGC CAGAACTGAT TACTCCCCGT AGTCGGATTC TGGCGTTGGG TCAGATGTCG AACGTTACTG GCGGTTGCCC GGATCTGGCG CGAGCGATTA CCTTTGCTCA TTCAGCCGGG ATGGTGGTGA TGGTTGATGG TGCTCAGGGG GCAGTGCATT TCCCCGCGGA TGTTCAGCAA CTGGATATTG ATTTCTATGC TTTTTCAGGT CACAAACTGT ATGGCCCGAC AGGTATCGGC GTGCTGTATG GTAAATCAGA ACTGCTGGAG GCGATGTCGC CCTGGCTGGG CGGCGGCAAA ATGGTTCACG AAGTGAGTTT TGACGGCTTC ACGACTCAAT CTGCGCCGTG GAAACTGGAA GCTGGAACGC CAAATGTCGC TGGTGTCATA GGATTAAGCG CGGCGCTGGA ATGGCTGGCA GATTACGATA TCAACCAGGC CGAAAGCTGG AGCCGTAGCT TAGCAACGCT GGCGGAAGAT GCGCTGGCGA AACGTCCCGG CTTTCGTTCA TTCCGCTGCC AGGATTCCAG CCTGCTGGCC TTTGATTTTG CTGGCGTTCA TCATAGCGAT ATGGTGACGC TGCTGGCGGA GTACGGTATT GCCCTGCGGG CCGGGCAGCA TTGCGCTCAG CCGCTACTGG CAGAATTAGG CGTAACCGGC ACACTGCGCG CCTCTTTTGC GCCATATAAT ACAAAGAGTG ATGTGGATGC GCTGGTGAAT GCCGTTGACC GCGCGCTGGA ATTATTGGTG GATTAA
|
Protein sequence | MNVFNPAQFR AQFPALQDAG VYLDSAATAL KPEAVVEATQ QFYSLSAGNV HRSQFAEAQR LTARYEAARE KVAQLLNAPD DKTIVWTRGT TESINMVAQC YARPRLQPGD EIIVSVAEHH ANLVPWLMVA QQTGAKVVKL PLNAQRLPDV DLLPELITPR SRILALGQMS NVTGGCPDLA RAITFAHSAG MVVMVDGAQG AVHFPADVQQ LDIDFYAFSG HKLYGPTGIG VLYGKSELLE AMSPWLGGGK MVHEVSFDGF TTQSAPWKLE AGTPNVAGVI GLSAALEWLA DYDINQAESW SRSLATLAED ALAKRPGFRS FRCQDSSLLA FDFAGVHHSD MVTLLAEYGI ALRAGQHCAQ PLLAELGVTG TLRASFAPYN TKSDVDALVN AVDRALELLV D
|
| |