Gene EcHS_A0785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0785 
SymbolcydA 
ID5593430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp798311 
End bp799879 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content53% 
IMG OID640919959 
Productcytochrome d ubiquinol oxidase, subunit I 
Protein accessionYP_001457533 
Protein GI157160215 
COG category[C] Energy production and conversion 
COG ID[COG1271] Cytochrome bd-type quinol oxidase, subunit 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value3.21454e-05 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAGATA TAGTCGAACT GTCGCGCTTA CAGTTTGCCT TGACCGCGAT GTACCACTTC 
CTTTTTGTGC CACTGACGCT CGGTATGGCG TTCCTGCTGG CCATTATGGA AACGGTCTAC
GTCCTCTCCG GCAAACAGAT TTATAAAGAT ATGACCAAGT TCTGGGGCAA GTTGTTTGGT
ATCAACTTCG CTCTGGGTGT GGCTACCGGT CTGACCATGG AGTTCCAGTT CGGGACTAAC
TGGTCTTACT ATTCCCACTA TGTAGGGGAT ATCTTCGGTG CGCCGCTGGC AATCGAAGGT
CTGATGGCCT TCTTCCTCGA ATCCACCTTT GTAGGTCTGT TCTTCTTCGG TTGGGATCGT
CTGGGTAAAG TTCAGCATAT GTGTGTCACC TGGCTGGTGG CGCTCGGTTC AAACCTGTCC
GCACTGTGGA TTCTGGTTGC GAACGGCTGG ATGCAAAACC CAATCGCGTC CGATTTCAAC
TTTGAAACTA TGCGTATGGA GATGGTGAGC TTCTCCGAGC TGGTGCTTAA CCCGGTTGCT
CAGGTGAAAT TCGTTCACAC TGTAGCGTCT GGTTATGTGA CTGGCGCGAT GTTCATCCTC
GGTATCAGCG CATGGTATAT GCTGAAAGGT CGTGACTTCG CCTTCGCTAA ACGCTCCTTT
GCTATCGCTG CCAGCTTCGG TATGGCTGCT GTTCTGTCTG TTATTGTTCT GGGTGATGAA
TCCGGCTACG AAATGGGCGA CGTGCAGAAA ACCAAACTGG CTGCTATTGA AGCCGAGTGG
GAAACGCAAC CTGCGCCTGC TGCCTTTACT CTGTTCGGCA TTCCTGATCA GGAAGAGGAG
ACGAACAAAT TTGCGATTCA GATCCCTTAC GCACTGGGCA TCATTGCAAC GCGTTCCGTG
GATACCCCGG TTATCGGCCT GAAAGAGCTG ATGGTGCAGC ATGAAGAACG CATTCGTAAC
GGGATGAAGG CGTACTCTCT GCTCGAACAA CTGCGTTCTG GTTCTACCGA CCAGGCGGTT
CGTGACCAGT TCAATAGCAT GAAGAAAGAC CTCGGTTACG GTCTGCTGCT GAAACGCTAT
ACGCCAAACG TGGCTGATGC GACTGAAGCG CAGATTCAAC AGGCAACCAA AGACTCCATC
CCGCGTGTAG CGCCGCTGTA CTTTGCGTTC CGTATCATGG TGGCGTGTGG CTTCCTGCTT
CTGGCAATCA TCGCGCTCTC TTTCTGGAGT GTCATCCGCA ACCGCATTGG CGAGAAAAAA
TGGCTTCTGC GCGCCGCGCT GTACGGTATT CCGCTGCCGT GGATTGCTGT AGAAGCGGGC
TGGTTCGTGG CTGAATATGG CCGCCAACCG TGGGCTATCG GTGAAGTGCT GCCGACAGCT
GTGGCGAACT CGTCACTGAC CGCAGGCGAT CTCATCTTCT CAATGGTGCT GATTTGCGGC
CTGTATACCC TGTTCCTGGT GGCAGAATTG TTCTTAATGT TCAAGTTTGC ACGCCTCGGC
CCAAGCAGCC TGAAAACCGG TCGCTATCAC TTTGAGCAGT CTTCCACGAC TACTCAGCCG
GCACGCTAA
 
Protein sequence
MLDIVELSRL QFALTAMYHF LFVPLTLGMA FLLAIMETVY VLSGKQIYKD MTKFWGKLFG 
INFALGVATG LTMEFQFGTN WSYYSHYVGD IFGAPLAIEG LMAFFLESTF VGLFFFGWDR
LGKVQHMCVT WLVALGSNLS ALWILVANGW MQNPIASDFN FETMRMEMVS FSELVLNPVA
QVKFVHTVAS GYVTGAMFIL GISAWYMLKG RDFAFAKRSF AIAASFGMAA VLSVIVLGDE
SGYEMGDVQK TKLAAIEAEW ETQPAPAAFT LFGIPDQEEE TNKFAIQIPY ALGIIATRSV
DTPVIGLKEL MVQHEERIRN GMKAYSLLEQ LRSGSTDQAV RDQFNSMKKD LGYGLLLKRY
TPNVADATEA QIQQATKDSI PRVAPLYFAF RIMVACGFLL LAIIALSFWS VIRNRIGEKK
WLLRAALYGI PLPWIAVEAG WFVAEYGRQP WAIGEVLPTA VANSSLTAGD LIFSMVLICG
LYTLFLVAEL FLMFKFARLG PSSLKTGRYH FEQSSTTTQP AR