Gene EcolC_3202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3202 
Symbol 
ID6066657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3509006 
End bp3510997 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content53% 
IMG OID641602617 
Productcytochrome o ubiquinol oxidase, subunit I 
Protein accessionYP_001726151 
Protein GI170021197 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02843] cytochrome o ubiquinol oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.154912 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000119789 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTCGGAA AATTATCACT TGATGCAGTC CCGTTCCATG AACCTATCGT CATGGTTACG 
ATCGCTGGCA TTATTTTGGG AGGTCTGGCG CTCGTTGGCC TGATCACTTA CTTCGGTAAG
TGGACCTACC TGTGGAAAGA GTGGCTGACC TCCGTCGACC ATAAACGCCT CGGTATCATG
TATATCATCG TGGCGATTGT GATGTTGCTG CGTGGTTTTG CTGACGCCAT TATGATGCGT
AGCCAGCAGG CTCTTGCCTC GGCGGGTGAA GCGGGCTTCC TGCCACCTCA CCACTACGAT
CAGATCTTCA CCGCGCACGG CGTGATTATG ATCTTCTTCG TGGCGATGCC TTTCGTTATC
GGTCTGATGA ACCTGGTGGT TCCGCTGCAG ATCGGCGCGC GTGACGTTGC GTTCCCGTTC
CTCAACAACT TAAGCTTCTG GTTTACTGTT GTTGGTGTGA TTCTGGTTAA CGTTTCTCTC
GGCGTGGGCG AATTTGCGCA GACCGGCTGG CTGGCCTATC CACCGCTATC GGGAATAGAG
TACAGTCCGG GAGTCGGTGT TGATTACTGG ATATGGAGTC TCCAGCTATC CGGTATAGGT
ACGACGCTTA CCGGTATCAA CTTCTTCGTT ACCATTCTGA AGATGCGCGC ACCGGGCATG
ACCATGTTCA AGATGCCAGT ATTTACCTGG GCATCACTGT GCGCGAACGT ACTGATTATT
GCTTCCTTCC CAATTCTGAC GGTTACCGTC GCGTTGTTGA CCCTGGATCG CTATCTGGGC
ACCCATTTCT TTACCAACGA TATGGGTGGC AACATGATGA TGTACATCAA CCTGATTTGG
GCCTGGGGCC ACCCGGAAGT TTACATCCTG ATCCTGCCTG TTTTCGGTGT GTTCTCCGAA
ATTGCGGCAA CCTTCTCGCG TAAACGTCTG TTTGGTTATA CCTCGCTGGT ATGGGCAACC
GTCTGCATCA CCGTGCTGTC GTTCATCGTT TGGCTGCACC ACTTCTTTAC GATGGGTGCG
GGCGCGAACG TAAACGCCTT CTTTGGTATC ACCACAATGA TTATCGCCAT CCCGACCGGG
GTGAAGATCT TCAACTGGCT GTTCACCATG TATCAGGGCC GCATCGTGTT CCATTCTGCG
ATGCTGTGGA CCATCGGTTT TATCGTCACT TTCTCGGTGG GCGGGATGAC TGGCGTGCTG
CTGGCAGTAC CGGGCGCGGA CTTCGTTCTG CATAACAGCC TGTTCCTGAT TGCGCACTTC
CATAACGTGA TCATCGGCGG CGTGGTCTTC GGCTGCTTCG CAGGGATGAC CTACTGGTGG
CCTAAAGCGT TCGGTTTCAA ACTGAACGAA ACCTGGGGTA AACGCGCGTT CTGGTTCTGG
ATCATCGGCT TCTTCGTTGC CTTTATGCCG CTGTATGCGC TGGGCTTCAT GGGCATGACC
CGTCGTTTGA GCCAGCAGAT TGACCCGCAG TTCCACACCA TGCTGATGAT TGCAGCCAGC
GGTGCAGTAC TGATTGCGCT GGGTATTCTC TGCCTCGTTA TTCAGATGTA CGTTTCTATT
CGCGACCGCG ACCAGAACCG TGACCTGACT GGCGACCCGT GGGGTGGCCG TACGCTGGAG
TGGGCAACCT CTTCCCCGCC TCCGTTCTAT AACTTTGCCG TTGTGCCGCA CGTTCACGAG
CGTGATGCAT TCTGGGAAAT GAAAGAGAAA GGCGAAGCGT ATAAAAAGCC TGACCACTAT
GAAGAAATTC ATATGCCGAA AAACAGCGGT GCAGGTATCG TCATTGCAGC TTTCTCCACC
ATCTTCGGTT TCGCCATGAT CTGGCATATC TGGTGGCTGG CGATTGTTGG CTTCGCAGGC
ATGATCATCA CCTGGATCGT GAAAAGCTTC GACGAGGACG TGGATTACTA CGTGCCGGTG
GCAGAAATCG AAAAACTGGA AAACCAGCAT TTCGATGAGA TTACTAAGGC AGGGCTGAAA
AATGGCAACT GA
 
Protein sequence
MFGKLSLDAV PFHEPIVMVT IAGIILGGLA LVGLITYFGK WTYLWKEWLT SVDHKRLGIM 
YIIVAIVMLL RGFADAIMMR SQQALASAGE AGFLPPHHYD QIFTAHGVIM IFFVAMPFVI
GLMNLVVPLQ IGARDVAFPF LNNLSFWFTV VGVILVNVSL GVGEFAQTGW LAYPPLSGIE
YSPGVGVDYW IWSLQLSGIG TTLTGINFFV TILKMRAPGM TMFKMPVFTW ASLCANVLII
ASFPILTVTV ALLTLDRYLG THFFTNDMGG NMMMYINLIW AWGHPEVYIL ILPVFGVFSE
IAATFSRKRL FGYTSLVWAT VCITVLSFIV WLHHFFTMGA GANVNAFFGI TTMIIAIPTG
VKIFNWLFTM YQGRIVFHSA MLWTIGFIVT FSVGGMTGVL LAVPGADFVL HNSLFLIAHF
HNVIIGGVVF GCFAGMTYWW PKAFGFKLNE TWGKRAFWFW IIGFFVAFMP LYALGFMGMT
RRLSQQIDPQ FHTMLMIAAS GAVLIALGIL CLVIQMYVSI RDRDQNRDLT GDPWGGRTLE
WATSSPPPFY NFAVVPHVHE RDAFWEMKEK GEAYKKPDHY EEIHMPKNSG AGIVIAAFST
IFGFAMIWHI WWLAIVGFAG MIITWIVKSF DEDVDYYVPV AEIEKLENQH FDEITKAGLK
NGN