Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3040 |
Symbol | |
ID | 6146532 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3128893 |
End bp | 3130095 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641617909 |
Product | hypothetical protein |
Protein accession | YP_001745060 |
Protein GI | 170683096 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | [TIGR01988] Ubiquinone biosynthesis hydroxylase, UbiH/UbiF/VisC/COQ6 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.184016 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAGTG TTGATGTAGC CATTGTTGGT GGCGGCATGG TGGGGCTGGC GGTTGCCTGC GGCTTACAGG GGAGCGGCTT ACGCGTTGCC GTACTGGAGC AGCGCGTACC GGAACCTCTG GCGGCGGATG CACCACCACA ACTGCGCGTT TCGGCTATCA ATGCCGCCAG CGAAAAATTA CTCACCCGTC TTGGCGTCTG GCAGGACATT CTCTCTCGTA GGGCCAGCTG TTATCACGGC ATGGAAGTGT GGGATAAAGA CAGCTTTGGT CACATTTCGT TTGACGATCA AAGCATGGGC TATAGCCATC TTGGGCATAT CGTTGAAAAT TCAGTGATTC ACTACGCGCT GTGGAACAAA GCGCAGCAGT CGTCAGATAT CACTCTGTTG GCCCCCGCAG AATTACAGCA GGTCGCCTGG GGAGAAAATG AAACCTTCCT GACGCTGAAA GATGGCAGCA TGTTAACGGC GCGTCTGGTG ATTGGCGCGG ACGGCGCTAA TTCCTGGTTG CGCAACAAAG CCGATATTCC GCTGACTTTC TGGGATTATC AGCATCACGC GCTGGTAGCG ACCATTCGCA CGGAAGAACC GCATGATGCG GTGGCGCGGC AGGTTTTCCA TGGCGAAGGC ATTCTGGCCT TTTTACCGCT TAGCGATCCG CATCTTTGCT CGATTGTCTG GTCACTGTCG CCAGAGGAAG CGCAGCGGAT GCAGCAGGCA AGTGAAGACG AATTTAATCG CGCGTTAAAT ATCGCTTTTG ATAATCGCCT GGGCTTATGC AAGGTTGAGA GCGCGCGTCA GGTGTTCCCA CTGACGGGGC GTTATGCGCG CCAGTTTGCC GCGCACCGTC TGGCGTTGGT GGGCGACGCC GCACATACCA TTCACCCGCT CGCGGGGCAG GGCGTTAATC TCGGCTTTAT GGATGCTGCA GAGCTGGTTG CTGAACTGAA ACGGTTGCAT CGTCAGGGGA AAGACATCGG GCAGTACATT TATCTGCGTC GCTATGAGCG TAGCCGCAAG CACAGTGCGG CGCTGATGCT GGCTGGTATG CAGGGATTCC GCGATCTGTT TTCCGGTACC AATCCGGCGA AAAAACTGCT GCGTGATATT GGTTTGAAAC TGGCCGACAC GCTTCCTGGC GTTAAACCAC AACTTATCCG CCAGGCAATG GGATTAAACG ATTTGCCTGA ATGGCTGCGT TAA
|
Protein sequence | MQSVDVAIVG GGMVGLAVAC GLQGSGLRVA VLEQRVPEPL AADAPPQLRV SAINAASEKL LTRLGVWQDI LSRRASCYHG MEVWDKDSFG HISFDDQSMG YSHLGHIVEN SVIHYALWNK AQQSSDITLL APAELQQVAW GENETFLTLK DGSMLTARLV IGADGANSWL RNKADIPLTF WDYQHHALVA TIRTEEPHDA VARQVFHGEG ILAFLPLSDP HLCSIVWSLS PEEAQRMQQA SEDEFNRALN IAFDNRLGLC KVESARQVFP LTGRYARQFA AHRLALVGDA AHTIHPLAGQ GVNLGFMDAA ELVAELKRLH RQGKDIGQYI YLRRYERSRK HSAALMLAGM QGFRDLFSGT NPAKKLLRDI GLKLADTLPG VKPQLIRQAM GLNDLPEWLR
|
| |