Gene EcSMS35_0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0472 
SymbolcyoA 
ID6142834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp478965 
End bp479912 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content51% 
IMG OID641615366 
Productcytochrome o ubiquinol oxidase subunit II 
Protein accessionYP_001742573 
Protein GI170683800 
COG category[C] Energy production and conversion 
COG ID[COG1622] Heme/copper-type cytochrome/quinol oxidases, subunit 2 
TIGRFAM ID[TIGR01433] cytochrome o ubiquinol oxidase subunit II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000304701 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACTCA GGAAATACAA TAAAAGTTTG GGATGGTTGT CATTATTTGC AGGCACTGTA 
TTGCTCAGTG GCTGTAATTC TGCGCTGTTA GATCCCAAAG GACAGATTGG TCTGGAGCAA
CGTTCACTGA TACTGACGGC ATTTGGCCTG ATGTTGATTG TCGTTATTCC CGCAATCTTG
ATGGCTGTTG GTTTCGCCTG GAAGTACCGT GCGAGCAATA AAGATGCTAA GTACAGCCCG
AACTGGTCAC ACTCCAATAA AGTGGAAGCT GTGGTCTGGA CGGTACCTAT CTTAATCATC
ATCTTCCTTG CGGTACTGAC CTGGAAAACC ACTCACGCTC TTGAGCCTAG CAAGCCGCTG
GCACACGACG AGAAGCCCAT TACCATCGAA GTGGTTTCCA TGGACTGGAA ATGGTTCTTC
ATCTACCCGG AACAGGGCAT TGCTACCGTG AATGAAATCG CTTTCCCGGC GAACACTCCG
GTGTACTTCA AAGTGACCTC CAACTCCGTG ATGAACTCCT TCTTCATTCC GCGTCTGGGT
AGCCAGATTT ATGCCATGGC CGGTATGCAG ACTCGCCTGC ATCTGATCGC CAACGAACCC
GGTACTTATG ACGGTATCTC CGCCAGCTAC AGCGGCCCGG GCTTCTCAGG CATGAAGTTC
AAAGCTATTG CAACACCGGA TCGCGCCGCA TTCGACCAAT GGGTCGCAAA AGCGAAACAG
TCGCCGAACA CCATGTCTGA CATGGCAGCG TTCGAAAAAC TGGCCGCGCC TAGCGAATAC
AACCAGGTGG AATATTTCTC CAACGTGAAA CCAGACTTGT TTGCTGATGT GATTAACAAG
TTTATGGCTC ACGGTAAGAG CATGGACATG ACCCAGCCAG AAGGTGAGCA CAGTGCACAC
GAAGGTATGG AAGGCATGGA CATGAGCCAC GCGGAATCCG CCCATTAA
 
Protein sequence
MRLRKYNKSL GWLSLFAGTV LLSGCNSALL DPKGQIGLEQ RSLILTAFGL MLIVVIPAIL 
MAVGFAWKYR ASNKDAKYSP NWSHSNKVEA VVWTVPILII IFLAVLTWKT THALEPSKPL
AHDEKPITIE VVSMDWKWFF IYPEQGIATV NEIAFPANTP VYFKVTSNSV MNSFFIPRLG
SQIYAMAGMQ TRLHLIANEP GTYDGISASY SGPGFSGMKF KAIATPDRAA FDQWVAKAKQ
SPNTMSDMAA FEKLAAPSEY NQVEYFSNVK PDLFADVINK FMAHGKSMDM TQPEGEHSAH
EGMEGMDMSH AESAH