Gene EcolC_4173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4173 
SymbolubiB 
ID6067256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4609793 
End bp4611433 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content50% 
IMG OID641603601 
Productputative ubiquinone biosynthesis protein UbiB 
Protein accessionYP_001727097 
Protein GI170022143 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID[TIGR01982] 2-polyprenylphenol 6-hydroxylase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00514072 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGCCAG GTGAAGTACG GCGCCTATAT TTCATCATTC GCACTTTTTT AAGCTACGGA 
CTTGATGAAC TGATCCCCAA AATGCGTATC ACCCTGCCGC TACGGCTATG GCGATACTCA
TTATTCTGGA TGCCAAATCG GCATAAAGAC AAACTTTTAG GTGAGCGACT ACGACTGGCC
CTGCAAGAAC TGGGGCCGGT TTGGATCAAG TTCGGGCAAA TGTTATCAAC CCGCCGCGAT
CTTTTTCCAC CGCATATTGC CGATCAGCTG GCGTTATTGC AGGACAAAGT TGCTCCGTTT
GATGGCAAGC TGGCGAAGCA GCAGATTGAA GCTGCAATGG GCGGCTTGCC GGTAGAAGCG
TGGTTTGACG ATTTTGAAAT CAAGCCGCTG GCTTCTGCTT CTATCGCCCA GGTTCATACC
GCGCGATTGA AATCGAATGG TAAAGAGGTG GTGATTAAAG TCATCCGCCC GGATATTTTG
CCGGTCATTA AAGCGGATCT GAAACTTATC TACCGTCTGG CTCGCTGGGT GCCGCGTTTG
CTGCCGGATG GTCGCCGTCT GCGCCCAACC GAAGTGGTGC GCGAGTACGA AAAGACCTTG
ATTGATGAAC TGAATTTGCT GCGGGAATCT GCCAATGCCA TTCAGCTTCG GCGCAATTTT
GAAGACAGCC CGATGCTCTA CATCCCGGAA GTTTACCCTG ACTATTGTAG TGAAGGGATG
ATGGTGATGG AGCGCATTTA CGGCATTCCG GTGTCTGATG TTGCGGCGCT GGAGAAAAAC
GGCACTAACA TGAAATTGCT GGCGGAACGC GGCGTGCAGG TGTTCTTCAC TCAGGTTTTC
CGTGACAGCT TTTTCCATGC CGATATGCAC CCTGGCAACA TCTTCGTAAG CTATGAACAC
CCGGAAAACC CGAAATATAT CGGCATTGAT TGCGGGATTG TTGGCTCGCT AAACAAAGAA
GATAAACGCT ATCTGGCAGA AAACTTTATC GCCTTCTTTA ATCGCGACTA TCGCAAAGTG
GCAGAGCTAC ACGTCGATTC AGGCTGGGTG CCACCAGATA CCAACGTTGA AGAGTTCGAA
TTTGCCATTC GTACGGTCTG TGAACCTATC TTTGAGAAAC CGCTGGCCGA AATTTCGTTT
GGACATGTAC TGTTAAATCT GTTTAATACG GCGCGTCGCT TCAATATGGA AGTGCAGCCG
CAACTGGTGT TACTCCAGAA AACCCTGCTC TATGTCGAAG GGGTAGGACG CCAGCTTTAT
CCGCAGCTCG ATTTATGGAA AACAGCGAAG CCTTTCCTGG AGTCGTGGAT TAAAGATCAG
GTCGGTATTC CTGCGCTGGT GAGAGCATTT AAAGAAAAAG CGCCGTTCTG GGTCGAAAAA
ATGCCAGAAC TGCCTGAATT GGTTTACGAC AGTTTGCGCC AGGGCAAGTA TTTACAGCAC
AGTGTTGATA AGATTGCCCG CGAGCTTCAG TCAAATCATG TACGTCAGGG ACAATCGCGT
TATTTTCTCG GAATTGGCGC TACGTTAGTA TTAAGTGGCA CATTCTTGTT GGTCAGCCGA
CCTGAATGGG GGCTGATGCC CGGCTGGTTA ATGGCAGGTG GTCTGATCGC CTGGTTTGTC
GGTTGGCGCA AAACACGCTG A
 
Protein sequence
MTPGEVRRLY FIIRTFLSYG LDELIPKMRI TLPLRLWRYS LFWMPNRHKD KLLGERLRLA 
LQELGPVWIK FGQMLSTRRD LFPPHIADQL ALLQDKVAPF DGKLAKQQIE AAMGGLPVEA
WFDDFEIKPL ASASIAQVHT ARLKSNGKEV VIKVIRPDIL PVIKADLKLI YRLARWVPRL
LPDGRRLRPT EVVREYEKTL IDELNLLRES ANAIQLRRNF EDSPMLYIPE VYPDYCSEGM
MVMERIYGIP VSDVAALEKN GTNMKLLAER GVQVFFTQVF RDSFFHADMH PGNIFVSYEH
PENPKYIGID CGIVGSLNKE DKRYLAENFI AFFNRDYRKV AELHVDSGWV PPDTNVEEFE
FAIRTVCEPI FEKPLAEISF GHVLLNLFNT ARRFNMEVQP QLVLLQKTLL YVEGVGRQLY
PQLDLWKTAK PFLESWIKDQ VGIPALVRAF KEKAPFWVEK MPELPELVYD SLRQGKYLQH
SVDKIARELQ SNHVRQGQSR YFLGIGATLV LSGTFLLVSR PEWGLMPGWL MAGGLIAWFV
GWRKTR