Gene ECH74115_2693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2693 
SymboldcyD 
ID6968508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2528375 
End bp2529361 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content54% 
IMG OID643386554 
ProductD-cysteine desulfhydrase 
Protein accessionYP_002271033 
Protein GI209400168 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2515] 1-aminocyclopropane-1-carboxylate deaminase 
TIGRFAM ID[TIGR01275] pyridoxal phosphate-dependent enzymes, D-cysteine desulfhydrase family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.102426 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.0614251 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACTGC ATAATTTAAC CCGTTTTCCA CGGCTGGAGT TTATCGGCGC GCCAACGCCG 
CTCGAATATC TGCCGCGCTT TTCTGATTAT CTAGGACGGG AAATTTTCAT CAAACGGGAT
GACGTCACCC CCATGGCAAT GGGCGGCAAT AAATTACGTA AGCTAGAATT TCTCGCGGCA
GATGCTCTGC GTGAAGGTGC CGATACGCTG ATTACTGCCG GGGCGATCCA GTCTAACCAT
GTGCGGCAGA CTGCCGCAGT CGCTGCCAAA CTCGGTCTGC ACTGCGTGGC GCTGCTGGAA
AATCCTATTG GCACAACCGC AGAAAACTAT TTAACCAACG GCAATCGCTT GTTGCTGGAT
CTGTTCAATA CCCAGATTGA AATGTGCGAC GCACTGACCG ATCCCAATGC CCAACTGGAA
GAGCTGGCGA CGCGAGTCGA AGCACAAGGC TTTCGCCCGT ATGTCATTCC GGTTGGCGGT
TCTAATGCTC TTGGCGCGCT AGGTTATGTG GAGAGTGCGC TGGAAATCGC GCAACAGTGT
GAAGGGGCGG TTAATATTTC ATCGGTGGTA GTCGCATCGG GCAGTGCCGG AACTCACGCC
GGACTGGCTG TTGGGCTGGA ACACCTGATG CCTGAAAGCG AACTGATTGG CGTGACCGTG
TCGCGTTCCG TTGCCGATCA ATTGCCGAAA GTGGTTAACC TACAACAGGC GATTGCGAAA
GAACTGGAGC TGACCGCATC AGTGGAAATT TTACTCTGGG ATGACTATTT TGCACCTGGC
TACGGCGTGC CGAACGACGA AGGCATGGAA GCAGTGAAAT TGCTGGCGCG GCTGGAAGGC
ATTCTGCTTG ATCCTGTGTA TACCGGAAAA GCGATGGCGG GTCTGATTGA CGGTATCAGT
CAGAAACGCT TCAAAGATGA AGGGCCGATT CTGTTTATTC ATACCGGCGG CGCGCCTGCG
CTGTTCGCCT ATCATCCCCA CGTTTAG
 
Protein sequence
MPLHNLTRFP RLEFIGAPTP LEYLPRFSDY LGREIFIKRD DVTPMAMGGN KLRKLEFLAA 
DALREGADTL ITAGAIQSNH VRQTAAVAAK LGLHCVALLE NPIGTTAENY LTNGNRLLLD
LFNTQIEMCD ALTDPNAQLE ELATRVEAQG FRPYVIPVGG SNALGALGYV ESALEIAQQC
EGAVNISSVV VASGSAGTHA GLAVGLEHLM PESELIGVTV SRSVADQLPK VVNLQQAIAK
ELELTASVEI LLWDDYFAPG YGVPNDEGME AVKLLARLEG ILLDPVYTGK AMAGLIDGIS
QKRFKDEGPI LFIHTGGAPA LFAYHPHV