Gene ECH74115_5276 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5276 
SymbolubiB 
ID6972027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4919650 
End bp4921290 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content50% 
IMG OID643388940 
Productputative ubiquinone biosynthesis protein UbiB 
Protein accessionYP_002273354 
Protein GI209399288 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID[TIGR01982] 2-polyprenylphenol 6-hydroxylase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.709941 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.225701 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCAG GTGAAGTACG GCGCCTATAT TTCATCATTC GCACTTTTTT AAGCTACGGA 
CTTGATGAAC TGATCCCCAA AATGCGTATC ACCCTGCCGC TACGGCTATG GCGATACTCA
TTATTCTGGA TGCCAAATCG GCATAAAGAC AAACTTTTAG GTGAGCGACT ACGACTGGCC
CTGCAAGAAC TGGGGCCGGT ATGGATCAAG TTCGGGCAAA TGTTATCAAC CCGCCGCGAT
CTTTTTCCGC CGCATATTGC CGATCAGCTG GCGTTATTGC AGGACAAAGT TGCTCCGTTT
GATGGCAAGC TGGCGAAGCA GCAGATTGAA GCTGCAATGG GCGGCTTGCC GGTAGAAGCG
TGGTTTGACG ATTTTGAAAT CAAGCCACTG GCTTCTGCTT CTATCGCCCA GGTTCATACC
GCGCGATTGA AATCGAATGG TAAAGAGGTG GTGATTAAAG TCATCCGCCC GGATATTTTG
CCGGTCATTA AAGCAGACCT GAAACTTATC TACCGTCTGG CTCGCTGGGT GCCGCGTTTG
CTGCCGGATG GTCGCCGTCT ACGCCCAACC GAAGTGGTGC GCGAGTACGA AAAGACCTTG
ATTGATGAAC TGAATTTGCT GCGGGAATCT GCCAACGCCA TTCAGCTTCG GCGAAATTTT
GAAGACAGCC CGATGCTCTA CATCCCGGAG GTTTACCCTG ACTATTGTAG TGAAGGGATG
ATGGTGATGG AGCGTATTTA CGGCATTCCG GTGTCTGATG TTGCGGCGCT GGAGAAAAAC
GGCACCAACA TGAAATTGCT GGCGGAACGC GGCGTGCAGG TGTTCTTCAC TCAGGTCTTT
CGCGACAGCT TTTTCCATGC CGATATGCAC CCTGGCAACA TCTTCGTAAG CTATGAACAC
CCGGAAAACC CGAAATATAT CGGCATTGAT TGCGGGATTG TTGGCTCGCT AAACAAAGAA
GATAAACGCT ATCTGGCAGA AAACTTTATT GCCTTCTTTA ATCGCGACTA TCGCAAAGTG
GCAGAGCTAC ACGTCGATTC TGGCTGGGTG CCACCAGATA CCAACGTTGA AGAGTTCGAA
TTTGCTATTC GTACGGTCTG TGAACCTATC TTTGAGAAAC CGCTGGCCGA AATTTCGTTT
GGACATGTAC TGTTAAATCT GTTTAATACG GCGCGTCGCT TCAATATGGA AGTGCAGCCG
CAACTGGTGT TACTCCAGAA AACCCTGCTC TACGTCGAAG GGGTAGGACG CCAGCTTTAT
CCGCAACTCG ATTTATGGAA AACGGCGAAG CCTTTCCTGG AGTCGTGGAT TAAAGATCAG
GTCGGTATTC CTGCGCTGGT GAGAGCATTT AAAGAAAAAG CGCCGTTCTG GGTCGAAAAA
ATGCCAGAAC TGCCTGAATT GGTTTACGAC AGTTTGCGCC AGGGCAAGTA TTTACAGCAC
AGTGTTGATA AGATTGCCCG CGAGCTTCAG TCAAATCATG TACGTCAGGG ACAATCGCGT
TATTTTCTCG GAATTGGCGC TACGTTAGTA TTAAGTGGCA CATTCTTGTT GGTCAGCCGA
CCTGAATGGG GGCTGATGCC CGGCTGGTTA ATGGCAGGTG GTCTGATCGC CTGGTTTGTC
GGTTGGCGCA AAACACGCTG A
 
Protein sequence
MTPGEVRRLY FIIRTFLSYG LDELIPKMRI TLPLRLWRYS LFWMPNRHKD KLLGERLRLA 
LQELGPVWIK FGQMLSTRRD LFPPHIADQL ALLQDKVAPF DGKLAKQQIE AAMGGLPVEA
WFDDFEIKPL ASASIAQVHT ARLKSNGKEV VIKVIRPDIL PVIKADLKLI YRLARWVPRL
LPDGRRLRPT EVVREYEKTL IDELNLLRES ANAIQLRRNF EDSPMLYIPE VYPDYCSEGM
MVMERIYGIP VSDVAALEKN GTNMKLLAER GVQVFFTQVF RDSFFHADMH PGNIFVSYEH
PENPKYIGID CGIVGSLNKE DKRYLAENFI AFFNRDYRKV AELHVDSGWV PPDTNVEEFE
FAIRTVCEPI FEKPLAEISF GHVLLNLFNT ARRFNMEVQP QLVLLQKTLL YVEGVGRQLY
PQLDLWKTAK PFLESWIKDQ VGIPALVRAF KEKAPFWVEK MPELPELVYD SLRQGKYLQH
SVDKIARELQ SNHVRQGQSR YFLGIGATLV LSGTFLLVSR PEWGLMPGWL MAGGLIAWFV
GWRKTR