Gene TM1040_1535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1535 
Symbol 
ID4075833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1640675 
End bp1642567 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content63% 
IMG OID638006848 
Productglucosyltransferase MdoH 
Protein accessionYP_613530 
Protein GI99081376 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2943] Membrane glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.217961 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.15626 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGACA TTGGTTTTAC AGATACATCC GCGCTGTTGT TGCCACCAGA GGCTCCGCTG 
GCGATGCCGG CGCAGGACTT TGGACGCCAA TTTCACGACG CCAGCGCGCC GGCTGCCTCC
GCCCGGGATG GAACCGGCGC GGCGCTGTGG CGGGTGCTGG CGTTTTCGCC TGCCATGGTT
GCAACTCTTG CGCTTGCGTG GGTGATGCAG GGCTGGTTTG CCGCAGATGG CACCACCTCG
CTGGAGTGGG TGCTGTTGGT TCTGATTGCT TTCAACTTCT TCTGGATCAC TTTCACGGTC
TCGACCGTCC TGCTTGGACT GTTCAGCCTC TCGCGCACCC GTCCGCGCCC CGAACGCGGC
CTGCGCAAAC CAATGCGGGT GGCCCTCTTG GTGCCAATCT ACAACGAAGT GCCCTGGTAT
GTGCTTGGCA ACGCGCGTTC CATGCTCGAG GAGCTGCGGG CCATCGGCGG GCCCCATCGC
TACGAGATGT TCATTCTCTC GGACACCCGC GACCCAGAGA TTGCCGCACA AGAGCTGCAG
AGCATCAAGG CTTTACGCGC GGATTTGCCC GAGGGGATCA CGCTCTATTA TCGTCGGCGC
GCCGAAAACA CCGCCCGCAA GGTGGGCAAT ATCCATGATT GGGTGACCCG CTGGGGCGGC
AGCTATGAGG CCATGTTGGT GCTGGATGCG GACAGCCTGA TGACAGGCCG CGCCATCCAG
CGCCTCACGG ATGCACTGGC GCGGGACCCG GCTGCTGGTC TCATTCAGAG CTTCCCGCAA
CTTATTGGTG CACAATCCGT TTTTGGCCGC ATGCAACAGT TTGCCAACGG CGTATACGGC
CTCGCGCTGG CCGAGGGTCT GGCGCGTTGG ACTGGTCATG AGGGCAATTA CTGGGGCCAC
AACGCCATCA TTCGCACCCG CGCCTTTGCC GCGTCGGCCG GGTTGCCGGA GCTGCGCGGG
TTCACCGGAG GCAGCAGCCT CATCATGAGC CATGATTTTG TGGAGGCAGG CTTGTTGCGA
CGGGCTGGCT GGCGCGTGAG GTTCCTGCCC CGCATCCGCG GCTCCTACGA GGAAACACCG
GCCACTCTGG TGGATCATAT CCAGCGCGAC CGGCGCTGGT GTCAGGGCAA CCTGCAACAC
CTGCGTCTTT TGTCGGCAAC CGGGTTTCAC GCGATGTCGC GGTTCCACCT CGCGCATGGT
GCCATCGGCT ATCTGATGGC GCCGGTCTGG TTTGCGCTCT TGGTGATCTG GGCCGTGATC
GGCCAGGACG AGGGCGGATC AGTGATCACC TACTTCTCCG AGGCAAACCC GCTGCGGCCC
AACTGGCCGG ATATGAGCGA GCCACGCCAT GTGGCGGTGA TCGTCCTGAT TTATGCCATG
TTGCTCGCGC CCAAGGTCCT CTCTGTCGCG GCCCTGCCTC TGACCGGGCG GCGGATCGCG
GATTATGGCG GTCTGGGGCG GTTCCTTCTG TCCATGCTCA CCGAAATTTT GCTTGCGATC
CTCTATGCGC CGATCCTGAT GGTTCAGCAG ATGATTGCCG TGTTGCGCAG CGTCTTTGGC
CTGCAAAAAG GCTGGTCCCC GCAGGCGCGC GCAGGTGGTG AGTATAGTCT TGCAACCCTG
TGCAAGTGCC ACCTTCTAGA GACCGTGAGT GGCATCGCGC TCTGCATCGG GATTGCTGCG
GGCCTGGTGT CACTCTGGCT GTTGCCAATC GCACTGTCAC TTGTGCTGGC CGTGCCGCTC
TCTGCGATGT CGGGCCTGCG CCTGCCGCGG GGCTGGATGG GCACGGCCGA GACCTTGAAC
GAGCCGCAGA TCAACCGCGC GGCCCATCAC TACCGCAATC TGTTGCGGCA ACACGCGCAA
GGGACCGACG TGCCCGTGCA GGCCGCGGAG TGA
 
Protein sequence
MKDIGFTDTS ALLLPPEAPL AMPAQDFGRQ FHDASAPAAS ARDGTGAALW RVLAFSPAMV 
ATLALAWVMQ GWFAADGTTS LEWVLLVLIA FNFFWITFTV STVLLGLFSL SRTRPRPERG
LRKPMRVALL VPIYNEVPWY VLGNARSMLE ELRAIGGPHR YEMFILSDTR DPEIAAQELQ
SIKALRADLP EGITLYYRRR AENTARKVGN IHDWVTRWGG SYEAMLVLDA DSLMTGRAIQ
RLTDALARDP AAGLIQSFPQ LIGAQSVFGR MQQFANGVYG LALAEGLARW TGHEGNYWGH
NAIIRTRAFA ASAGLPELRG FTGGSSLIMS HDFVEAGLLR RAGWRVRFLP RIRGSYEETP
ATLVDHIQRD RRWCQGNLQH LRLLSATGFH AMSRFHLAHG AIGYLMAPVW FALLVIWAVI
GQDEGGSVIT YFSEANPLRP NWPDMSEPRH VAVIVLIYAM LLAPKVLSVA ALPLTGRRIA
DYGGLGRFLL SMLTEILLAI LYAPILMVQQ MIAVLRSVFG LQKGWSPQAR AGGEYSLATL
CKCHLLETVS GIALCIGIAA GLVSLWLLPI ALSLVLAVPL SAMSGLRLPR GWMGTAETLN
EPQINRAAHH YRNLLRQHAQ GTDVPVQAAE