Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2018 |
Symbol | dcyD |
ID | 5593828 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2018121 |
End bp | 2019107 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640921164 |
Product | D-cysteine desulfhydrase |
Protein accession | YP_001458709 |
Protein GI | 157161391 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2515] 1-aminocyclopropane-1-carboxylate deaminase |
TIGRFAM ID | [TIGR01275] pyridoxal phosphate-dependent enzymes, D-cysteine desulfhydrase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 0.0820004 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACTGC ATAATTTAAC CCGTTTTCCA CGGTTGGAGT TTATCGGCGC GCCAACGCCG CTCGAATATC TGCCGCGCTT TTCTGATTAT CTTGGACGGG AAATTTTCAT CAAACGGGAT GACGTCACCC CCATGGCAAT GGGCGGCAAT AAATTACGTA AGCTGGAATT TCTCGCAGCA GATGCTCTGC GCGAAGGTGC CGATACGCTG ATTACTGCCG GCGCGATCCA GTCTAACCAT GTGCGCCAGA CTGCCGCAGT TGCGGCGAAA CTCGGTCTGC ACTGCGTGGC GCTGCTGGAA AATCCTATTG GCACAACCGC AGAAAACTAT TTAACCAACG GCAATCGCTT GTTGCTGGAT CTGTTCAATA CCCAGATTGA AATGTGCGAC GCACTGACCG ATCCCAATGC CCAACTGGAA GAGCTGGCGA CGCGAGTCGA AGCACAAGGC TTTCGCCCGT ATGTCATTCC GGTTGGCGGT TCTAATGCTT TGGGCGCGCT GGGTTATGTG GAGAGTGCGC TGGAAATCGC GCAACAGTGT GAAGGGGCGG TTAATATTTC GTCGGTGGTA GTCGCATCGG GCAGTGCCGG AACTCACGCC GGACTGGCTG TTGGGCTGGA ACACCTGCTG CCTGAAAGCG AACTGATTGG CGTGACCGTG TCGCGTTCCG TTGCCGATCA ATTGCCGAAA GTGGTTAACC TACAACAGGC GATTGCGAAA GAACTGGAGC TGACCGCATC AGCGGAAATT TTACTCTGGG ATGACTATTT TGCACCTGGC TACGGCGTGC CGAACGACGA AGGCATGGAA GCAGTGAAAT TGCTGGCGCG GTTTGAAGGC ATTCTGCTTG ATCCTGTGTA TACCGGAAAA GCGATGGCGG GTCTGATTGA CGGTATCAGT CAGAAACGCT TCAAAGATGA AGGGCCGATT CTGTTTATTC ATACCGGCGG CGCGCCTGCG CTGTTCGCCT ATCATCCCCA CGTTTAG
|
Protein sequence | MPLHNLTRFP RLEFIGAPTP LEYLPRFSDY LGREIFIKRD DVTPMAMGGN KLRKLEFLAA DALREGADTL ITAGAIQSNH VRQTAAVAAK LGLHCVALLE NPIGTTAENY LTNGNRLLLD LFNTQIEMCD ALTDPNAQLE ELATRVEAQG FRPYVIPVGG SNALGALGYV ESALEIAQQC EGAVNISSVV VASGSAGTHA GLAVGLEHLL PESELIGVTV SRSVADQLPK VVNLQQAIAK ELELTASAEI LLWDDYFAPG YGVPNDEGME AVKLLARFEG ILLDPVYTGK AMAGLIDGIS QKRFKDEGPI LFIHTGGAPA LFAYHPHV
|
| |