Gene EcSMS35_1263 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1263 
SymboldcyD 
ID6144001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1260037 
End bp1261023 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content54% 
IMG OID641616141 
ProductD-cysteine desulfhydrase 
Protein accessionYP_001743324 
Protein GI170683740 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2515] 1-aminocyclopropane-1-carboxylate deaminase 
TIGRFAM ID[TIGR01275] pyridoxal phosphate-dependent enzymes, D-cysteine desulfhydrase family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0547515 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0193238 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACTGC ATAATTTAAC CCGTTTTCCA CGGTTGGAGT TTATCGGCGC GCCAACACCG 
CTCGAATATC TGCCGCGCTT TTCTGATTAT CTTGGACGGG AAATTTTCAT CAAACGGGAT
GACGTCACCC CCATGGCAAT GGGCGGCAAT AAATTACGTA AGCTGGAATT TCTCGCAGCA
GATGCTCTGC GCGAAGGTGC CGATACGCTG ATTACTGCCG GCGCGATCCA GTCTAACCAT
GTGCGCCAGA CTGCCGCAGT TGCGGCGAAA CTCGGTCTGC ACTGCGTGGC GCTGCTGGAA
AATCCTATTG GCACAACCGC AGAAAACTAT TTAACCAACG GCAATCGTTT GTTGCTGGAT
CTGTTCAATA CCCAGATTGA AATGTGTGAC GCACTGACCG ATCCCAATGC CCAACTGGAA
GAGCTGGCGA CGCGAGTCGA AGCACAAGGC TTTCGCCCGT ATGTCATTCC GGTTGGCGGT
TCTAATGCTC TGGGCGCGCT GGGTTATGTG GAGAGTGCGC TGGAAATCGC GCAACAGTGT
GAAGGGGCGG TTAATATTTC GTCGGTGGTA GTCGCATCGG GCAGTGCCGG AACTCACGCC
GGACTGGCTG TTGGGCTGGA ACACCTTATG CCTGAAAGCG AACTGATTGG CGTTACCGTG
TCGCGTTCCG TTGCCGATCA ATTGCCGAAA GTGGTTAACC TACAACAGGC GATTGCGAAA
GAACTGGAGC TGACCGCATC AGCGGAAATT TTACTCTGGG ATGACTATTT TGCACCTGGC
TACGGCGTGC CGAACGATGA AGGCATGGAA GCAGTGAAAT TGCTGGCGCG GCTGGAAGGC
ATTCTGCTTG ATCCTGTGTA TACCGGAAAA GCGATGGCGG GGCTGATTGA CGGTATCAGT
CAGAAACGCT TCAAAGATGA AGGGCCGATT CTGTTTATTC ATACCGGCGG CGCGCCTGCG
CTGTTCGCCT ATCATCCCCA CGTTTAG
 
Protein sequence
MPLHNLTRFP RLEFIGAPTP LEYLPRFSDY LGREIFIKRD DVTPMAMGGN KLRKLEFLAA 
DALREGADTL ITAGAIQSNH VRQTAAVAAK LGLHCVALLE NPIGTTAENY LTNGNRLLLD
LFNTQIEMCD ALTDPNAQLE ELATRVEAQG FRPYVIPVGG SNALGALGYV ESALEIAQQC
EGAVNISSVV VASGSAGTHA GLAVGLEHLM PESELIGVTV SRSVADQLPK VVNLQQAIAK
ELELTASAEI LLWDDYFAPG YGVPNDEGME AVKLLARLEG ILLDPVYTGK AMAGLIDGIS
QKRFKDEGPI LFIHTGGAPA LFAYHPHV