Gene ECH74115_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1049 
SymbolcydD 
ID6969908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1063629 
End bp1065395 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content53% 
IMG OID643385061 
Productcysteine/glutathione ABC transporter membrane/ATP-binding component 
Protein accessionYP_002269560 
Protein GI209400891 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4988] ABC-type transport system involved in cytochrome bd biosynthesis, ATPase and permease components 
TIGRFAM ID[TIGR02857] thiol reductant ABC exporter, CydD subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0961899 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAT CTCGTCAAAA AGAGTTAACC CGCTGGTTAA AACAGCAAAG CGTCATCTCC 
CAACGTTGGC TGAATATTTC TCGTCTGCTG GGCTTTGTGA GCGGCATATT GATCATTGCC
CAGGCATGGT TCATGGCGCG AATTCTGCAA CATATGATTA TGGAGAATAT TCCCCGTGAA
GCCCTGCTGC TTCCCTTTAC GTTACTGGTT CTGACCTTTG TACTGCGCGC ATGGGTGGTC
TGGTTACGCG AACGGGTGGG TTATCACGCC GGGCAGCATA TCCGCTTTGC CATCCGCCGT
CAGGTTCTCG ACCGTCTGCA ACAAGCAGGG CCAGCGTGGA TTCAGGGTAA ACCTGCGGGG
AGCTGGGCGA CGCTGGTACT CGAGCAAATT GACGATATGC ATGATTACTA TGCACGCTAC
CTGCCGCAAA TGGCGCTGGC AGTGTCGGTG CCGTTGCTGA TTGTGGTGGC GATCTTCCCC
TCTAACTGGG CTGCGGCGCT CATTCTGCTG GGCACTGCAC CGTTAATTCC GTTGTTTATG
GCGCTGGTTG GGATGGGGGC TGCCGATGCT AACCGACGTA ACTTTCTCGC TCTTGCTCGC
TTAAGTGGGC ATTTCCTCGA TCGCCTGCGC GGCATGGAAA CATTGCGTAT TTTTGGTCGT
GGTGAAGCTG AAATTGAAAG TATTCGTTCT GCTTCGGAAG ATTTCCGCCA ACGGACAATG
GAAGTGCTAC GGCTGGCGTT TTTATCCTCC GGCATTCTCG AATTTTTTAC CTCGCTGTCA
ATTGCTCTGG TGGCGGTCTA CTTTGGTTTT TCCTATCTCG GCGAGCTGGA TTTTGGTCAC
TACGATACTG GCGTGACGCT GGCTGCGGGT TTTCTGGCCC TGATCCTTGC GCCAGAGTTT
TTCCAGCCAT TACGCGATCT CGGTACGTTT TATCATGCTA AAGCCCAGGC TGTTGGTGCA
GCTGACAGTC TGAAAACGTT TATGGAAACC CCGCTCGCCC ATCCGCAGCG CGGTGAGACG
GAATTAGCAT CAACCGATCC GCTGACCATT GAGGCCGAGG ATCTGTTTAT CACGTCGCCG
GAAGGTAAAA CGCTGGCCGG ACCGCTGAAC TTTACTTTGC CAGCAGGCCA ACGTGCAGTG
TTGGTTGGTC GCAGCGGTTC AGGTAAAAGC TCACTGCTGA ACGCGCTTTC TGGTTTTCTC
TCATATCAGG GATCATTACG AATCAACGGG ATAGAATTAC GCGATTTATC ACCAGAATCA
TGGCGTAAAC ATCTCTCCTG GGTTGGTCAA AACCCACAAT TACCGGCAGC AACATTACGG
GATAACGTAC TACTGGCGCG ACCTGATGCC AGCGAACAAG AATTACAAGC AGCGCTGGAT
AACGCCTGGG TCAGCGAGTT TCTACCGCTC CTGCCACAAG GCGTTGATAC GCCTGTTGGC
GACCAGGCTG CCCGCCTTTC CGTGGGGCAG GCGCAGCGCG TGGCGGTGGC CCGTGCGTTA
CTAAATCCCT GTTCGCTATT ACTGTTGGAT GAACCCGCTG CCAGCCTTGA TGCTCACAGT
GAACAGCGCG TAATGGAGGC GCTGAATGCC GCCTCTCTGC GCCAGACAAC GTTAATGGTC
ACCCACCAGT TAGAAGATCT TGCTGACTGG GATGTCATTT GGGTTATGCA GGATGGCCGG
ATTATTGAGC AAGGACGTTA CGCGGAATTA AGTGTGGCTG GTGGCCCATT CGCCACATTA
CTGGCCCATC GTCAGGAGGA GATTTAA
 
Protein sequence
MNKSRQKELT RWLKQQSVIS QRWLNISRLL GFVSGILIIA QAWFMARILQ HMIMENIPRE 
ALLLPFTLLV LTFVLRAWVV WLRERVGYHA GQHIRFAIRR QVLDRLQQAG PAWIQGKPAG
SWATLVLEQI DDMHDYYARY LPQMALAVSV PLLIVVAIFP SNWAAALILL GTAPLIPLFM
ALVGMGAADA NRRNFLALAR LSGHFLDRLR GMETLRIFGR GEAEIESIRS ASEDFRQRTM
EVLRLAFLSS GILEFFTSLS IALVAVYFGF SYLGELDFGH YDTGVTLAAG FLALILAPEF
FQPLRDLGTF YHAKAQAVGA ADSLKTFMET PLAHPQRGET ELASTDPLTI EAEDLFITSP
EGKTLAGPLN FTLPAGQRAV LVGRSGSGKS SLLNALSGFL SYQGSLRING IELRDLSPES
WRKHLSWVGQ NPQLPAATLR DNVLLARPDA SEQELQAALD NAWVSEFLPL LPQGVDTPVG
DQAARLSVGQ AQRVAVARAL LNPCSLLLLD EPAASLDAHS EQRVMEALNA ASLRQTTLMV
THQLEDLADW DVIWVMQDGR IIEQGRYAEL SVAGGPFATL LAHRQEEI