Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1263 |
Symbol | dcyD |
ID | 6144001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1260037 |
End bp | 1261023 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641616141 |
Product | D-cysteine desulfhydrase |
Protein accession | YP_001743324 |
Protein GI | 170683740 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2515] 1-aminocyclopropane-1-carboxylate deaminase |
TIGRFAM ID | [TIGR01275] pyridoxal phosphate-dependent enzymes, D-cysteine desulfhydrase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0547515 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.0193238 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACTGC ATAATTTAAC CCGTTTTCCA CGGTTGGAGT TTATCGGCGC GCCAACACCG CTCGAATATC TGCCGCGCTT TTCTGATTAT CTTGGACGGG AAATTTTCAT CAAACGGGAT GACGTCACCC CCATGGCAAT GGGCGGCAAT AAATTACGTA AGCTGGAATT TCTCGCAGCA GATGCTCTGC GCGAAGGTGC CGATACGCTG ATTACTGCCG GCGCGATCCA GTCTAACCAT GTGCGCCAGA CTGCCGCAGT TGCGGCGAAA CTCGGTCTGC ACTGCGTGGC GCTGCTGGAA AATCCTATTG GCACAACCGC AGAAAACTAT TTAACCAACG GCAATCGTTT GTTGCTGGAT CTGTTCAATA CCCAGATTGA AATGTGTGAC GCACTGACCG ATCCCAATGC CCAACTGGAA GAGCTGGCGA CGCGAGTCGA AGCACAAGGC TTTCGCCCGT ATGTCATTCC GGTTGGCGGT TCTAATGCTC TGGGCGCGCT GGGTTATGTG GAGAGTGCGC TGGAAATCGC GCAACAGTGT GAAGGGGCGG TTAATATTTC GTCGGTGGTA GTCGCATCGG GCAGTGCCGG AACTCACGCC GGACTGGCTG TTGGGCTGGA ACACCTTATG CCTGAAAGCG AACTGATTGG CGTTACCGTG TCGCGTTCCG TTGCCGATCA ATTGCCGAAA GTGGTTAACC TACAACAGGC GATTGCGAAA GAACTGGAGC TGACCGCATC AGCGGAAATT TTACTCTGGG ATGACTATTT TGCACCTGGC TACGGCGTGC CGAACGATGA AGGCATGGAA GCAGTGAAAT TGCTGGCGCG GCTGGAAGGC ATTCTGCTTG ATCCTGTGTA TACCGGAAAA GCGATGGCGG GGCTGATTGA CGGTATCAGT CAGAAACGCT TCAAAGATGA AGGGCCGATT CTGTTTATTC ATACCGGCGG CGCGCCTGCG CTGTTCGCCT ATCATCCCCA CGTTTAG
|
Protein sequence | MPLHNLTRFP RLEFIGAPTP LEYLPRFSDY LGREIFIKRD DVTPMAMGGN KLRKLEFLAA DALREGADTL ITAGAIQSNH VRQTAAVAAK LGLHCVALLE NPIGTTAENY LTNGNRLLLD LFNTQIEMCD ALTDPNAQLE ELATRVEAQG FRPYVIPVGG SNALGALGYV ESALEIAQQC EGAVNISSVV VASGSAGTHA GLAVGLEHLM PESELIGVTV SRSVADQLPK VVNLQQAIAK ELELTASAEI LLWDDYFAPG YGVPNDEGME AVKLLARLEG ILLDPVYTGK AMAGLIDGIS QKRFKDEGPI LFIHTGGAPA LFAYHPHV
|
| |