Gene EcSMS35_0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0471 
SymbolcyoB 
ID6144798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp476952 
End bp478943 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content53% 
IMG OID641615365 
Productcytochrome o ubiquinol oxidase, subunit I 
Protein accessionYP_001742572 
Protein GI170683544 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02843] cytochrome o ubiquinol oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000517017 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGGAA AATTATCACT TGATGCAGTC CCGTTCCATG AACCTATCGT CATGGTTACG 
ATCGCTGGCA TTATTTTGGG AGGTCTGGCG CTCGTTGGCC TGATCACTTA CTTCGGTAAG
TGGACCTACC TGTGGAAAGA GTGGCTGACC TCCGTCGACC ATAAACGCCT AGGTATCATG
TATATCATCG TGGCGATTGT GATGTTGCTG CGTGGTTTTG CTGACGCTAT TATGATGCGT
AGCCAGCAGG CCCTTGCCTC GGCGGGCGAA GCGGGCTTCC TGCCTCCTCA CCACTACGAT
CAGATCTTCA CCGCGCACGG CGTGATTATG ATCTTCTTCG TGGCGATGCC TTTCGTTATC
GGTCTGATGA ACCTGGTGGT TCCGCTGCAG ATCGGCGCGC GTGACGTTGC GTTCCCGTTC
CTCAACAACT TAAGCTTCTG GTTTACCGTT GTTGGTGTGA TTCTGGTCAA CGTTTCTCTC
GGCGTGGGCG AATTTGCGCA GACCGGCTGG CTGGCCTATC CACCGCTATC GGGAATAGAG
TACAGTCCGG GAGTCGGTGT CGATTACTGG ATATGGAGTC TCCAGCTATC CGGTATAGGT
ACGACGCTTA CCGGTATCAA CTTCTTCGTT ACCATTCTGA AGATGCGCGC ACCGGGCATG
ACCATGTTCA AGATGCCAGT ATTTACCTGG GCATCACTGT GCGCGAACGT ACTGATTATT
GCTTCCTTCC CAATTCTGAC GGTTACCGTT GCGTTGTTGA CCCTGGATCG CTATCTGGGC
ACCCATTTCT TTACCAACGA TATGGGTGGC AACATGATGA TGTACATCAA CCTGATTTGG
GCCTGGGGCC ACCCGGAAGT TTACATCCTG ATCCTGCCTG TTTTCGGTGT GTTCTCCGAA
ATTGCGGCAA CCTTCTCGCG TAAACGTCTG TTTGGTTATA CCTCGCTGGT ATGGGCAACC
GTCTGTATCA CCGTGCTGTC GTTCATCGTC TGGCTGCACC ACTTCTTTAC GATGGGTGCG
GGCGCGAACG TAAACGCCTT CTTTGGTATC ACCACCATGA TTATCGCCAT CCCGACCGGG
GTGAAGATCT TCAACTGGCT GTTCACCATG TATCAGGGCC GCATCGTGTT CCATTCTGCG
ATGCTGTGGA CCATCGGTTT TATCGTCACC TTCTCGGTGG GCGGGATGAC TGGCGTGCTG
CTGGCAGTAC CGGGCGCGGA CTTCGTTCTG CATAACAGCC TGTTCCTGAT TGCGCACTTC
CATAACGTGA TCATCGGCGG CGTGGTCTTC GGCTGCTTCG CAGGGATGAC CTACTGGTGG
CCTAAAGCGT TCGGTTTCAA ACTGAACGAA ACCTGGGGTA AACGCGCGTT CTGGTTCTGG
ATCATCGGCT TCTTCGTTGC CTTTATGCCA CTGTATGCGC TGGGCTTCAT GGGCATGACC
CGTCGTTTGA GCCAGCAGAT TGACCCGCAG TTCCACACCA TGCTGATGAT TGCAGCCAGC
GGTGCGGTAC TGATTGCGCT GGGTATTCTC TGCCTCGTTA TTCAGATGTA CGTTTCTATT
CGTGACCGCG ACCAGAACCG TGACCTGACT GGCGACCCGT GGGGTGGCCG TACGCTGGAG
TGGGCAACCT CTTCCCCGCC TCCGTTCTAT AACTTTGCCG TTGTGCCGCA CGTTCACGAA
CGTGATGCAT TCTGGGAAAT GAAAGAGAAA GGCGAAGCGT ACAAAAAGCC TGACCACTAT
GAAGAAATTC ATATGCCGAA AAACAGCGGT GCCGGTATCG TCATTGCGGC TTTCTCCACC
ATCTTCGGTT TCGCCATGAT CTGGCATATC TGGTGGCTGG CGATTGTTGG CTTCGCAGGC
ATGATCATCA CCTGGATCGT GAAAAGCTTC GACGAGGACG TGGATTACTA CGTGCCGGTG
GCAGAAATCG AAAAACTGGA AAACCAGCAT TTCGATGAGA TTACTAAGGC AGGGCTGAAA
AATGGCAACT GA
 
Protein sequence
MFGKLSLDAV PFHEPIVMVT IAGIILGGLA LVGLITYFGK WTYLWKEWLT SVDHKRLGIM 
YIIVAIVMLL RGFADAIMMR SQQALASAGE AGFLPPHHYD QIFTAHGVIM IFFVAMPFVI
GLMNLVVPLQ IGARDVAFPF LNNLSFWFTV VGVILVNVSL GVGEFAQTGW LAYPPLSGIE
YSPGVGVDYW IWSLQLSGIG TTLTGINFFV TILKMRAPGM TMFKMPVFTW ASLCANVLII
ASFPILTVTV ALLTLDRYLG THFFTNDMGG NMMMYINLIW AWGHPEVYIL ILPVFGVFSE
IAATFSRKRL FGYTSLVWAT VCITVLSFIV WLHHFFTMGA GANVNAFFGI TTMIIAIPTG
VKIFNWLFTM YQGRIVFHSA MLWTIGFIVT FSVGGMTGVL LAVPGADFVL HNSLFLIAHF
HNVIIGGVVF GCFAGMTYWW PKAFGFKLNE TWGKRAFWFW IIGFFVAFMP LYALGFMGMT
RRLSQQIDPQ FHTMLMIAAS GAVLIALGIL CLVIQMYVSI RDRDQNRDLT GDPWGGRTLE
WATSSPPPFY NFAVVPHVHE RDAFWEMKEK GEAYKKPDHY EEIHMPKNSG AGIVIAAFST
IFGFAMIWHI WWLAIVGFAG MIITWIVKSF DEDVDYYVPV AEIEKLENQH FDEITKAGLK
NGN