Gene EcSMS35_4218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4218 
SymbolubiB 
ID6143929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4315462 
End bp4317102 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content50% 
IMG OID641619041 
Productputative ubiquinone biosynthesis protein UbiB 
Protein accessionYP_001746169 
Protein GI170681731 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID[TIGR01982] 2-polyprenylphenol 6-hydroxylase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.00118244 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACGCCAG GTGAAGTACG GCGCCTATAT TTCATCATTC GCACTTTTTT AAGCTACGGA 
CTTGATGAAC TGATCCCCAA AATGCGTATC ACCCTGCCGC TACGGCTATG GCGATACTCA
TTATTCTGGA TGCCAAATCG GCATAAAGAC AAACCTTTAG GTGAGCGACT ACGACTGGCC
CTGCAAGAAC TGGGACCGGT ATGGATCAAG TTCGGGCAGA TGTTATCAAC CCGCCGCGAT
CTTTTTCCGC CGCATATTGC CGATCAGCTG GCGTTATTGC AGGACAAAGT CGCTCCGTTT
GATGGCAAGC TGGCGAAGCA GCAGATTGAA GCTGCAATGG GCGGCTTGCC GGTAGAAGCG
TGGTTTGACG ATTTTGAAAT CAAGCCGCTG GCTTCTGCTT CTATCGCCCA GGTTCATACC
GCGCGATTGA AATCGAATGG TAAAGAGGTG GTGATTAAAG TCATCCGCCC GGATATTTTG
CCGGTTATTA AAGCGGATCT GAAACTTATC TACCGGCTGG CTCGCTGGGT GCCGCGTTTG
CTGCCGGATG GTCGCCGTCT GCGCCCAACC GAAGTGGTGC GCGAGTACGA AAAGACCTTG
ATTGATGAAC TGAATTTGCT GCGGGAATCT GCCAACGCCA TTCAGCTTCG ACGCAATTTT
GAAGACAGCC CGATGCTCTA CATCCCGGAA GTTTACCCTG ACTATTGTAG TGAAGGGATG
ATGGTAATGG AGCGTATTTA CGGCATTCCG GTGTCTGATG TTGCGACGCT GGAGAAAAAC
GGCACAAACA TGAAATTGCT GGCGGAACGC GGCGTGCAGG TGTTCTTCAC TCAGGTCTTT
CGCGACAGCT TTTTCCATGC CGATATGCAC CCTGGCAACA TCTTCGTAAG CTATGAACAC
CCGGAAAACC CGAAATATAT CGGCATTGAT TGCGGGATTG TTGGCTCGCT AAACAAAGAA
GATAAACGCT ATCTGGCAGA AAACTTTATC GCCTTCTTTA ATCGCGACTA TCGCAAAGTG
GCAGAGCTAC ACGTCGATTC TGGCTGGGTG CCACCAGATA CCAACGTTGA AGAGTTCGAA
TTTGCCATTC GTACGGTCTG TGAACCTATC TTTGAGAAAC CGCTGGCCGA AATTTCGTTT
GGACATGTAC TGTTAAATCT GTTTAATACG GCGCGTCGCT TCAATATGGA AGTGCAGCCG
CAACTGGTGT TACTCCAGAA AACCCTGCTC TACGTCGAAG GGGTAGGACG CCAGCTTTAT
CCGCAACTCG ATTTATGGAA AACGGCGAAG CCTTTCCTGG AGTCGTGGAT TAAAGATCAG
GTCGGTATTC CTGCGCTGGT GAGAGCATTT AAAGAAAAAG CGCCGTTCTG GGTCGAAAAA
ATGCCAGAAC TGCCTGAACT GGTTTACGAC AGTTTGCGCC AGGGCAAGTA TTTACAGCAC
AGTGTTGATA AGATTGCCCG CGAGCTTCAG TCAAATCATG TACGTCAGGG ACAATCGCGT
TATTTTCTCG GAATTGGCGC TACGTTAGTA TTAAGTGGCA CATTCTTGTT GGTCAGCCGA
CCTGAATGGG GGCTGATGCC CGTCTGGTTA ATGGCAGGTG GTCTGATCGC CTGGTTTGTC
GGTTGGCGCA AAACACGCTG A
 
Protein sequence
MTPGEVRRLY FIIRTFLSYG LDELIPKMRI TLPLRLWRYS LFWMPNRHKD KPLGERLRLA 
LQELGPVWIK FGQMLSTRRD LFPPHIADQL ALLQDKVAPF DGKLAKQQIE AAMGGLPVEA
WFDDFEIKPL ASASIAQVHT ARLKSNGKEV VIKVIRPDIL PVIKADLKLI YRLARWVPRL
LPDGRRLRPT EVVREYEKTL IDELNLLRES ANAIQLRRNF EDSPMLYIPE VYPDYCSEGM
MVMERIYGIP VSDVATLEKN GTNMKLLAER GVQVFFTQVF RDSFFHADMH PGNIFVSYEH
PENPKYIGID CGIVGSLNKE DKRYLAENFI AFFNRDYRKV AELHVDSGWV PPDTNVEEFE
FAIRTVCEPI FEKPLAEISF GHVLLNLFNT ARRFNMEVQP QLVLLQKTLL YVEGVGRQLY
PQLDLWKTAK PFLESWIKDQ VGIPALVRAF KEKAPFWVEK MPELPELVYD SLRQGKYLQH
SVDKIARELQ SNHVRQGQSR YFLGIGATLV LSGTFLLVSR PEWGLMPVWL MAGGLIAWFV
GWRKTR