Gene EcHS_A2018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2018 
SymboldcyD 
ID5593828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2018121 
End bp2019107 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content54% 
IMG OID640921164 
ProductD-cysteine desulfhydrase 
Protein accessionYP_001458709 
Protein GI157161391 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2515] 1-aminocyclopropane-1-carboxylate deaminase 
TIGRFAM ID[TIGR01275] pyridoxal phosphate-dependent enzymes, D-cysteine desulfhydrase family 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value0.0820004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACTGC ATAATTTAAC CCGTTTTCCA CGGTTGGAGT TTATCGGCGC GCCAACGCCG 
CTCGAATATC TGCCGCGCTT TTCTGATTAT CTTGGACGGG AAATTTTCAT CAAACGGGAT
GACGTCACCC CCATGGCAAT GGGCGGCAAT AAATTACGTA AGCTGGAATT TCTCGCAGCA
GATGCTCTGC GCGAAGGTGC CGATACGCTG ATTACTGCCG GCGCGATCCA GTCTAACCAT
GTGCGCCAGA CTGCCGCAGT TGCGGCGAAA CTCGGTCTGC ACTGCGTGGC GCTGCTGGAA
AATCCTATTG GCACAACCGC AGAAAACTAT TTAACCAACG GCAATCGCTT GTTGCTGGAT
CTGTTCAATA CCCAGATTGA AATGTGCGAC GCACTGACCG ATCCCAATGC CCAACTGGAA
GAGCTGGCGA CGCGAGTCGA AGCACAAGGC TTTCGCCCGT ATGTCATTCC GGTTGGCGGT
TCTAATGCTT TGGGCGCGCT GGGTTATGTG GAGAGTGCGC TGGAAATCGC GCAACAGTGT
GAAGGGGCGG TTAATATTTC GTCGGTGGTA GTCGCATCGG GCAGTGCCGG AACTCACGCC
GGACTGGCTG TTGGGCTGGA ACACCTGCTG CCTGAAAGCG AACTGATTGG CGTGACCGTG
TCGCGTTCCG TTGCCGATCA ATTGCCGAAA GTGGTTAACC TACAACAGGC GATTGCGAAA
GAACTGGAGC TGACCGCATC AGCGGAAATT TTACTCTGGG ATGACTATTT TGCACCTGGC
TACGGCGTGC CGAACGACGA AGGCATGGAA GCAGTGAAAT TGCTGGCGCG GTTTGAAGGC
ATTCTGCTTG ATCCTGTGTA TACCGGAAAA GCGATGGCGG GTCTGATTGA CGGTATCAGT
CAGAAACGCT TCAAAGATGA AGGGCCGATT CTGTTTATTC ATACCGGCGG CGCGCCTGCG
CTGTTCGCCT ATCATCCCCA CGTTTAG
 
Protein sequence
MPLHNLTRFP RLEFIGAPTP LEYLPRFSDY LGREIFIKRD DVTPMAMGGN KLRKLEFLAA 
DALREGADTL ITAGAIQSNH VRQTAAVAAK LGLHCVALLE NPIGTTAENY LTNGNRLLLD
LFNTQIEMCD ALTDPNAQLE ELATRVEAQG FRPYVIPVGG SNALGALGYV ESALEIAQQC
EGAVNISSVV VASGSAGTHA GLAVGLEHLL PESELIGVTV SRSVADQLPK VVNLQQAIAK
ELELTASAEI LLWDDYFAPG YGVPNDEGME AVKLLARFEG ILLDPVYTGK AMAGLIDGIS
QKRFKDEGPI LFIHTGGAPA LFAYHPHV