Gene EcolC_1720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1720 
Symbol 
ID6067313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1917211 
End bp1918197 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content54% 
IMG OID641601132 
ProductD-cysteine desulfhydrase 
Protein accessionYP_001724697 
Protein GI170019743 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2515] 1-aminocyclopropane-1-carboxylate deaminase 
TIGRFAM ID[TIGR01275] pyridoxal phosphate-dependent enzymes, D-cysteine desulfhydrase family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.628209 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACTGC ATAATTTAAC CCGTTTTCCA CGGCTGGAGT TTATCGGCGC GCCAACGCCG 
CTCGAATATC TGCCGCGCTT TTCTGATTAT CTTGGACGGG AAATTTTCAT CAAACGGGAT
GACGTCACAC CCATGGCAAT GGGCGGCAAT AAATTACGTA AGCTGGAATT TCTCGCGGCA
GATGCTCTGC GTGAAGGTGC CGATACGCTG ATTACTGCCG GGGCGATCCA GTCTAACCAT
GTGCGCCAGA CTGCCGCAGT TGCGGCGAAA CTCGGTCTGC ACTGCGTGGC GCTGCTGGAA
AATCCTATTG GCACAACCGC AGAAAACTAT TTAACCAACG GCAATCGTTT GTTGCTGGAT
CTGTTCAATA CCCAGATTGA AATGTGCGAC GCACTGACCG ATCCCAATGC CCAACTGGAA
GAGCTGGCGA CGCGAGTCGA AGCACAAGGC TTTCGCCCGT ATGTCATTCC GGTTGGCGGT
TCTAATGCTC TGGGCGCGCT GGGTTATGTG GAGAGTGCGC TGGAAATTGC GCAACAGTGT
GAAGGGGCGG TTAATATTTC GTCGGTGGTG GTCGCATCGG GCAGTGCCGG AACTCACGCC
GGACTGGCTG TTGGGCTGGA ACACCTGATG CCTGAAAGCG AACTGATTGG CGTGACCGTG
TCGCGTTCCG TTGCCGATCA ATTGCCGAAA GTGGTTAACC TACAACAGGC GATTGCGAAA
GAACTGGAGC TGACCGCATC AGCGGAAATT TTACTCTGGG ATGACTATTT TGCACCTGGC
TACGGCGTGC CGAACGACGA AGGCATGGAA GCAGTGAAAT TGCTGGCGCG GCTGGAAGGC
ATTCTGCTTG ATCCTGTGTA TACCGGAAAA GCGATGGCGG GTCTGATTGA CGGTATCAGT
CAGAAACGTT TCAAAGATGA AGGGCCGATT CTGTTTATTC ATACCGGCGG TGCGCCTGCG
CTGTTCGCCT ATCATCCCCA CGTTTAG
 
Protein sequence
MPLHNLTRFP RLEFIGAPTP LEYLPRFSDY LGREIFIKRD DVTPMAMGGN KLRKLEFLAA 
DALREGADTL ITAGAIQSNH VRQTAAVAAK LGLHCVALLE NPIGTTAENY LTNGNRLLLD
LFNTQIEMCD ALTDPNAQLE ELATRVEAQG FRPYVIPVGG SNALGALGYV ESALEIAQQC
EGAVNISSVV VASGSAGTHA GLAVGLEHLM PESELIGVTV SRSVADQLPK VVNLQQAIAK
ELELTASAEI LLWDDYFAPG YGVPNDEGME AVKLLARLEG ILLDPVYTGK AMAGLIDGIS
QKRFKDEGPI LFIHTGGAPA LFAYHPHV