Gene EcolC_3201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3201 
Symbol 
ID6066653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3508037 
End bp3508984 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content51% 
IMG OID641602616 
Productcytochrome o ubiquinol oxidase subunit II 
Protein accessionYP_001726150 
Protein GI170021196 
COG category[C] Energy production and conversion 
COG ID[COG1622] Heme/copper-type cytochrome/quinol oxidases, subunit 2 
TIGRFAM ID[TIGR01433] cytochrome o ubiquinol oxidase subunit II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000154009 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000824271 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGACTCA GGAAATACAA TAAAAGTTTG GGATGGTTGT CATTATTTGC AGGCACTGTA 
TTGCTCAGTG GCTGTAATTC TGCGCTGTTA GATCCCAAAG GACAGATTGG TCTGGAGCAA
CGTTCACTGA TACTGACGGC ATTTGGCCTG ATGTTGATTG TCGTTATTCC CGCAATCTTG
ATGGCTGTTG GTTTCGCCTG GAAGTATCGT GCGAGCAATA AAGATGCTAA GTACAGCCCG
AACTGGTCAC ACTCCAATAA AGTGGAAGCT GTGGTCTGGA CGGTACCTAT CTTAATCATC
ATCTTCCTTG CGGTACTGAC CTGGAAAACC ACTCACGCTC TTGAGCCTAG CAAGCCGCTG
GCACACGACG AGAAGCCCAT TACCATCGAA GTGGTTTCCA TGGACTGGAA ATGGTTCTTC
ATCTACCCGG AACAGGGCAT TGCTACCGTG AATGAAATCG CTTTCCCGGC GAACACTCCG
GTGTACTTCA AAGTGACCTC CAACTCCGTG ATGAACTCCT TCTTCATTCC GCGTCTGGGT
AGCCAGATTT ATGCCATGGC CGGTATGCAG ACTCGCCTGC ATCTGATCGC CAACGAACCC
GGTACTTATG ACGGTATCTC CGCCAGCTAC AGCGGGCCGG GCTTCTCAGG CATGAAGTTC
AAAGCTATTG CAACACCGGA TCGCGCCGAA TTCGACCAAT GGGTCGCAAA AGCGAAACAG
TCGCCGAACT CCATGTCTGA CATGGCAGCG TTCGAAAAAC TGGCCGCGCC TAGCGAATAC
AACCAGGTGG AATATTTCTC CAACGTGAAA CCAGACTTGT TTGCCGATGT GATTAACAAG
TTTATGGCTC ACGGTAAGAG CATGGACATG ACCCAGCCAG AAGGTGAGCA CAGCGCACAC
GAAGGTATGG AAGGCATGGA CATGAGCCAC GCGGAATCCG CCCATTAA
 
Protein sequence
MRLRKYNKSL GWLSLFAGTV LLSGCNSALL DPKGQIGLEQ RSLILTAFGL MLIVVIPAIL 
MAVGFAWKYR ASNKDAKYSP NWSHSNKVEA VVWTVPILII IFLAVLTWKT THALEPSKPL
AHDEKPITIE VVSMDWKWFF IYPEQGIATV NEIAFPANTP VYFKVTSNSV MNSFFIPRLG
SQIYAMAGMQ TRLHLIANEP GTYDGISASY SGPGFSGMKF KAIATPDRAE FDQWVAKAKQ
SPNSMSDMAA FEKLAAPSEY NQVEYFSNVK PDLFADVINK FMAHGKSMDM TQPEGEHSAH
EGMEGMDMSH AESAH