Gene EcSMS35_0755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0755 
SymbolcydA 
ID6146606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp764041 
End bp765609 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content53% 
IMG OID641615644 
Productcytochrome d ubiquinol oxidase, subunit I 
Protein accessionYP_001742843 
Protein GI170681426 
COG category[C] Energy production and conversion 
COG ID[COG1271] Cytochrome bd-type quinol oxidase, subunit 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0332587 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGATA TAGTCGAACT GTCGCGCTTA CAGTTTGCCT TGACCGCGAT GTACCACTTC 
CTTTTTGTGC CACTGACGCT CGGTATGGCG TTCCTGCTGG CCATTATGGA AACGGTCTAC
GTCCTTTCCG GCAAACAGAT TTATAAAGAT ATGACCAAGT TCTGGGGCAA GTTGTTTGGT
ATCAACTTCG CTCTGGGTGT GGCTACCGGT CTGACCATGG AGTTCCAGTT CGGGACTAAC
TGGTCTTACT ATTCCCACTA TGTAGGGGAT ATCTTCGGTG CGCCGCTGGC AATCGAAGGT
CTAATGGCCT TCTTCCTCGA ATCCACCTTT GTAGGTCTGT TCTTCTTCGG TTGGGATCGT
TTGGGTAAAG TTCAGCATAT GTGTGTCACC TGGCTGGTGG CGCTCGGTTC AAACCTGTCC
GCACTGTGGA TTCTGGTTGC GAACGGCTGG ATGCAAAACC CAATCGCGTC CGATTTCAAC
TTTGAAACTA TGCGTATGGA GATGGTGAGC TTCTCCGAGC TGGTGCTTAA CCCGGTTGCT
CAGGTGAAAT TCGTTCACAC TGTAGCGTCT GGTTATGTGA CTGGCGCGAT GTTCATCCTC
GGTATCAGCG CATGGTATAT GCTGAAAGGT CGTGACTTCG CCTTCGCTAA ACGCTCCTTT
GCTATCGCTG CCAGCTTCGG TATGGCTGCC GTTCTGTCTG TTATTGTTCT GGGTGATGAA
TCTGGTTACG AAATGGGCGA CGTGCAGAAA ACCAAACTGG CTGCTATTGA AGCCGAGTGG
GAAACGCAAC CTGCGCCTGC TGCCTTTACT CTGTTCGGCA TTCCTGATCA GGAAGAGGAG
ACGAACAAAT TTGCGATCCA GATCCCTTAC GCGCTGGGCA TCATTGCAAC GCGTTCCGTG
GATACTCCAG TTATCGGCCT GAAAGAACTG ATGGTGCAGC ATGAAGAACG CATTCGTAAC
GGGATGAAGG CGTACTCTCT GCTCGAGCAA CTGCGTTCTG GTTCTACCGA CCAGGCGGTT
CGTGACCAGT TCAATAGCAT GAAGAAAGAC CTCGGTTACG GTCTGCTGCT GAAACGCTAT
ACGCCAAACG TGGCTGATGC GACTGAAGCG CAGATTCAAC AGGCAACCAA AGACTCCATT
CCGCGTGTAG CGCCGCTGTA CTTCGCGTTC CGTATCATGG TGGCGTGTGG CTTCCTGCTG
CTGGCAATCA TCGCGCTCTC TTTCTGGAGT GTCATCCGCA ACCGCATTGG CGAGAAAAAA
TGGCTGCTGC GCGCCGCGCT GTACGGTATT CCGCTGCCGT GGATTGCTGT AGAAGCGGGC
TGGTTTGTGG CTGAATATGG CCGCCAACCG TGGGCTATCG GTGAAGTGCT ACCGACAGCT
GTGGCGAACT CGTCACTGAC CGCAGGCGAT CTCATCTTCT CAATGGTGCT GATTTGCGGC
CTGTATACCC TGTTCCTGGT GGCAGAATTG TTCTTAATGT TCAAGTTTGC ACGCCTCGGC
CCAAGCAGCC TGAAAACCGG TCGCTATCAC TTTGAGCAGT CTTCCACGAC TACTCAGCCG
GCACGCTAA
 
Protein sequence
MLDIVELSRL QFALTAMYHF LFVPLTLGMA FLLAIMETVY VLSGKQIYKD MTKFWGKLFG 
INFALGVATG LTMEFQFGTN WSYYSHYVGD IFGAPLAIEG LMAFFLESTF VGLFFFGWDR
LGKVQHMCVT WLVALGSNLS ALWILVANGW MQNPIASDFN FETMRMEMVS FSELVLNPVA
QVKFVHTVAS GYVTGAMFIL GISAWYMLKG RDFAFAKRSF AIAASFGMAA VLSVIVLGDE
SGYEMGDVQK TKLAAIEAEW ETQPAPAAFT LFGIPDQEEE TNKFAIQIPY ALGIIATRSV
DTPVIGLKEL MVQHEERIRN GMKAYSLLEQ LRSGSTDQAV RDQFNSMKKD LGYGLLLKRY
TPNVADATEA QIQQATKDSI PRVAPLYFAF RIMVACGFLL LAIIALSFWS VIRNRIGEKK
WLLRAALYGI PLPWIAVEAG WFVAEYGRQP WAIGEVLPTA VANSSLTAGD LIFSMVLICG
LYTLFLVAEL FLMFKFARLG PSSLKTGRYH FEQSSTTTQP AR