Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1535 |
Symbol | |
ID | 4075833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1640675 |
End bp | 1642567 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638006848 |
Product | glucosyltransferase MdoH |
Protein accession | YP_613530 |
Protein GI | 99081376 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2943] Membrane glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.217961 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.15626 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGACA TTGGTTTTAC AGATACATCC GCGCTGTTGT TGCCACCAGA GGCTCCGCTG GCGATGCCGG CGCAGGACTT TGGACGCCAA TTTCACGACG CCAGCGCGCC GGCTGCCTCC GCCCGGGATG GAACCGGCGC GGCGCTGTGG CGGGTGCTGG CGTTTTCGCC TGCCATGGTT GCAACTCTTG CGCTTGCGTG GGTGATGCAG GGCTGGTTTG CCGCAGATGG CACCACCTCG CTGGAGTGGG TGCTGTTGGT TCTGATTGCT TTCAACTTCT TCTGGATCAC TTTCACGGTC TCGACCGTCC TGCTTGGACT GTTCAGCCTC TCGCGCACCC GTCCGCGCCC CGAACGCGGC CTGCGCAAAC CAATGCGGGT GGCCCTCTTG GTGCCAATCT ACAACGAAGT GCCCTGGTAT GTGCTTGGCA ACGCGCGTTC CATGCTCGAG GAGCTGCGGG CCATCGGCGG GCCCCATCGC TACGAGATGT TCATTCTCTC GGACACCCGC GACCCAGAGA TTGCCGCACA AGAGCTGCAG AGCATCAAGG CTTTACGCGC GGATTTGCCC GAGGGGATCA CGCTCTATTA TCGTCGGCGC GCCGAAAACA CCGCCCGCAA GGTGGGCAAT ATCCATGATT GGGTGACCCG CTGGGGCGGC AGCTATGAGG CCATGTTGGT GCTGGATGCG GACAGCCTGA TGACAGGCCG CGCCATCCAG CGCCTCACGG ATGCACTGGC GCGGGACCCG GCTGCTGGTC TCATTCAGAG CTTCCCGCAA CTTATTGGTG CACAATCCGT TTTTGGCCGC ATGCAACAGT TTGCCAACGG CGTATACGGC CTCGCGCTGG CCGAGGGTCT GGCGCGTTGG ACTGGTCATG AGGGCAATTA CTGGGGCCAC AACGCCATCA TTCGCACCCG CGCCTTTGCC GCGTCGGCCG GGTTGCCGGA GCTGCGCGGG TTCACCGGAG GCAGCAGCCT CATCATGAGC CATGATTTTG TGGAGGCAGG CTTGTTGCGA CGGGCTGGCT GGCGCGTGAG GTTCCTGCCC CGCATCCGCG GCTCCTACGA GGAAACACCG GCCACTCTGG TGGATCATAT CCAGCGCGAC CGGCGCTGGT GTCAGGGCAA CCTGCAACAC CTGCGTCTTT TGTCGGCAAC CGGGTTTCAC GCGATGTCGC GGTTCCACCT CGCGCATGGT GCCATCGGCT ATCTGATGGC GCCGGTCTGG TTTGCGCTCT TGGTGATCTG GGCCGTGATC GGCCAGGACG AGGGCGGATC AGTGATCACC TACTTCTCCG AGGCAAACCC GCTGCGGCCC AACTGGCCGG ATATGAGCGA GCCACGCCAT GTGGCGGTGA TCGTCCTGAT TTATGCCATG TTGCTCGCGC CCAAGGTCCT CTCTGTCGCG GCCCTGCCTC TGACCGGGCG GCGGATCGCG GATTATGGCG GTCTGGGGCG GTTCCTTCTG TCCATGCTCA CCGAAATTTT GCTTGCGATC CTCTATGCGC CGATCCTGAT GGTTCAGCAG ATGATTGCCG TGTTGCGCAG CGTCTTTGGC CTGCAAAAAG GCTGGTCCCC GCAGGCGCGC GCAGGTGGTG AGTATAGTCT TGCAACCCTG TGCAAGTGCC ACCTTCTAGA GACCGTGAGT GGCATCGCGC TCTGCATCGG GATTGCTGCG GGCCTGGTGT CACTCTGGCT GTTGCCAATC GCACTGTCAC TTGTGCTGGC CGTGCCGCTC TCTGCGATGT CGGGCCTGCG CCTGCCGCGG GGCTGGATGG GCACGGCCGA GACCTTGAAC GAGCCGCAGA TCAACCGCGC GGCCCATCAC TACCGCAATC TGTTGCGGCA ACACGCGCAA GGGACCGACG TGCCCGTGCA GGCCGCGGAG TGA
|
Protein sequence | MKDIGFTDTS ALLLPPEAPL AMPAQDFGRQ FHDASAPAAS ARDGTGAALW RVLAFSPAMV ATLALAWVMQ GWFAADGTTS LEWVLLVLIA FNFFWITFTV STVLLGLFSL SRTRPRPERG LRKPMRVALL VPIYNEVPWY VLGNARSMLE ELRAIGGPHR YEMFILSDTR DPEIAAQELQ SIKALRADLP EGITLYYRRR AENTARKVGN IHDWVTRWGG SYEAMLVLDA DSLMTGRAIQ RLTDALARDP AAGLIQSFPQ LIGAQSVFGR MQQFANGVYG LALAEGLARW TGHEGNYWGH NAIIRTRAFA ASAGLPELRG FTGGSSLIMS HDFVEAGLLR RAGWRVRFLP RIRGSYEETP ATLVDHIQRD RRWCQGNLQH LRLLSATGFH AMSRFHLAHG AIGYLMAPVW FALLVIWAVI GQDEGGSVIT YFSEANPLRP NWPDMSEPRH VAVIVLIYAM LLAPKVLSVA ALPLTGRRIA DYGGLGRFLL SMLTEILLAI LYAPILMVQQ MIAVLRSVFG LQKGWSPQAR AGGEYSLATL CKCHLLETVS GIALCIGIAA GLVSLWLLPI ALSLVLAVPL SAMSGLRLPR GWMGTAETLN EPQINRAAHH YRNLLRQHAQ GTDVPVQAAE
|
| |