Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4224 |
Symbol | ubiD |
ID | 6146624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4320224 |
End bp | 4321717 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641619047 |
Product | 3-octaprenyl-4-hydroxybenzoate decarboxylase |
Protein accession | YP_001746175 |
Protein GI | 170682759 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases |
TIGRFAM ID | [TIGR00148] UbiD family decarboxylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.514143 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.00200609 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACGCCA TGAAATATAA CGATTTACGC GACTTCTTGA CGCTGCTTGA ACAGCAGGGT GAGCTAAAAC GTATCACGCT CCCGGTGGAT CCGCATCTGG AAATCACTGA AATTGCTGAC CGCACTTTGC GTGCCGGTGG GCCTGCGCTG TTGTTCGAAA ACCCTAAAGG CTACTCAATG CCGGTGCTGT GCAACCTGTT CGGTACGCCA AAGCGCGTGG CGATGGGCAT GGGGCAGGAA GATGTTTCGG CGCTGCGTGA AGTTGGTAAA TTATTGGCGT TTCTGAAAGA GCCGGAGCCG CCAAAAGGTT TCCGCGACCT GTTTGATAAA CTGCCGCAGT TTAAGCAAGT ATTGAACATG CCGACAAAGC GGCTGCGTGG TGCGCCTTGC CAACAAAAAA TCGTCTCTGG CGATGACGTC GATCTCAATC GCATTCCCAT TATGACCTGC TGGCCGGAAG ATGCCGCGCC GCTGATTACC TGGGGGCTGA CCGTGACGCG CGGCCCGCAT AAAGAGCGGC AGAATCTGGG CATTTATCGC CAGCAGCTGA TTGGTAAAAA CAAACTGATT ATGCGCTGGC TGTCGCATCG CGGCGGCGCG CTGGATTATC AGGAGTGGTG TGTGGCGCAT CCGGGCGAAC GTTTCCCGGT TTCTGTGGCG CTGGGTGCCG ATCCCGCCAC GATTCTCGGT GCAGTCACCC CCGTTCCGGA TACGCTTTCA GAGTATGCGT TTGCCGGATT GCTACGCGGC ACCAAAACCG AAGTAGTAAA GTGTATTTCC AATGATCTCG AAGTGCCCGC CAGTGCGGAG ATTGTGCTGG AAGGGTATAT CGAACAAGGC GAAACTGCGC CGGAAGGGCC GTATGGCGAC CACACCGGTT ACTATAACGA AGTCGATAGT TTCCCGGTAT TTACCGTGAC GCATATTACC CAGCGTGAAG ATGCGATTTA TCATTCCACC TATACCGGGC GTCCGCCGGA TGAACCTGCG GTGCTGGGTG TCGCACTGAA CGAAGTGTTT GTGCCGATTC TGCAAAAACA GTTCCCGGAA ATTGTCGATT TTTACCTGCC GCCGGAAGGC TGCTCTTATC GCCTGGCGGT AGTGACGATC AAAAAACAGT ACGCCGGACA CGCGAAGCGC GTCATGATGG GCGTCTGGTC GTTCTTACGC CAGTTTATGT ACACTAAATT TGTGATCGTT TGCGATGATG ACGTCAACGC ACGCGACTGG AACGATGTGA TTTGGGCGAT TACCACCCGT ATGGACCCGG CGCGGGACAC GGTGTTGGTC GAAAATACGC CTATTGATTA TCTGGATTTT GCCTCGCCTG TGTCCGGGCT GGGTTCAAAA ATGGGGCTGG ATGCCACGAA TAAATGGCCA GGGGAAACCC AGCGTGAATG GGGACGTCCC ATCAAAAAAG ATCCGGATGT TGTCGCGCAT ATTGACGCCA TCTGGGATGA ACTGGCTATT TTTAACAACG GTAAAAGCGC CTGA
|
Protein sequence | MDAMKYNDLR DFLTLLEQQG ELKRITLPVD PHLEITEIAD RTLRAGGPAL LFENPKGYSM PVLCNLFGTP KRVAMGMGQE DVSALREVGK LLAFLKEPEP PKGFRDLFDK LPQFKQVLNM PTKRLRGAPC QQKIVSGDDV DLNRIPIMTC WPEDAAPLIT WGLTVTRGPH KERQNLGIYR QQLIGKNKLI MRWLSHRGGA LDYQEWCVAH PGERFPVSVA LGADPATILG AVTPVPDTLS EYAFAGLLRG TKTEVVKCIS NDLEVPASAE IVLEGYIEQG ETAPEGPYGD HTGYYNEVDS FPVFTVTHIT QREDAIYHST YTGRPPDEPA VLGVALNEVF VPILQKQFPE IVDFYLPPEG CSYRLAVVTI KKQYAGHAKR VMMGVWSFLR QFMYTKFVIV CDDDVNARDW NDVIWAITTR MDPARDTVLV ENTPIDYLDF ASPVSGLGSK MGLDATNKWP GETQREWGRP IKKDPDVVAH IDAIWDELAI FNNGKSA
|
| |