Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1720 |
Symbol | |
ID | 6067313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 1917211 |
End bp | 1918197 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641601132 |
Product | D-cysteine desulfhydrase |
Protein accession | YP_001724697 |
Protein GI | 170019743 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2515] 1-aminocyclopropane-1-carboxylate deaminase |
TIGRFAM ID | [TIGR01275] pyridoxal phosphate-dependent enzymes, D-cysteine desulfhydrase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.628209 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACTGC ATAATTTAAC CCGTTTTCCA CGGCTGGAGT TTATCGGCGC GCCAACGCCG CTCGAATATC TGCCGCGCTT TTCTGATTAT CTTGGACGGG AAATTTTCAT CAAACGGGAT GACGTCACAC CCATGGCAAT GGGCGGCAAT AAATTACGTA AGCTGGAATT TCTCGCGGCA GATGCTCTGC GTGAAGGTGC CGATACGCTG ATTACTGCCG GGGCGATCCA GTCTAACCAT GTGCGCCAGA CTGCCGCAGT TGCGGCGAAA CTCGGTCTGC ACTGCGTGGC GCTGCTGGAA AATCCTATTG GCACAACCGC AGAAAACTAT TTAACCAACG GCAATCGTTT GTTGCTGGAT CTGTTCAATA CCCAGATTGA AATGTGCGAC GCACTGACCG ATCCCAATGC CCAACTGGAA GAGCTGGCGA CGCGAGTCGA AGCACAAGGC TTTCGCCCGT ATGTCATTCC GGTTGGCGGT TCTAATGCTC TGGGCGCGCT GGGTTATGTG GAGAGTGCGC TGGAAATTGC GCAACAGTGT GAAGGGGCGG TTAATATTTC GTCGGTGGTG GTCGCATCGG GCAGTGCCGG AACTCACGCC GGACTGGCTG TTGGGCTGGA ACACCTGATG CCTGAAAGCG AACTGATTGG CGTGACCGTG TCGCGTTCCG TTGCCGATCA ATTGCCGAAA GTGGTTAACC TACAACAGGC GATTGCGAAA GAACTGGAGC TGACCGCATC AGCGGAAATT TTACTCTGGG ATGACTATTT TGCACCTGGC TACGGCGTGC CGAACGACGA AGGCATGGAA GCAGTGAAAT TGCTGGCGCG GCTGGAAGGC ATTCTGCTTG ATCCTGTGTA TACCGGAAAA GCGATGGCGG GTCTGATTGA CGGTATCAGT CAGAAACGTT TCAAAGATGA AGGGCCGATT CTGTTTATTC ATACCGGCGG TGCGCCTGCG CTGTTCGCCT ATCATCCCCA CGTTTAG
|
Protein sequence | MPLHNLTRFP RLEFIGAPTP LEYLPRFSDY LGREIFIKRD DVTPMAMGGN KLRKLEFLAA DALREGADTL ITAGAIQSNH VRQTAAVAAK LGLHCVALLE NPIGTTAENY LTNGNRLLLD LFNTQIEMCD ALTDPNAQLE ELATRVEAQG FRPYVIPVGG SNALGALGYV ESALEIAQQC EGAVNISSVV VASGSAGTHA GLAVGLEHLM PESELIGVTV SRSVADQLPK VVNLQQAIAK ELELTASAEI LLWDDYFAPG YGVPNDEGME AVKLLARLEG ILLDPVYTGK AMAGLIDGIS QKRFKDEGPI LFIHTGGAPA LFAYHPHV
|
| |