Gene TM1040_1534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1534 
SymbolmdoG 
ID4075832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1639148 
End bp1640674 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content61% 
IMG OID638006847 
Productglucan biosynthesis protein G 
Protein accessionYP_613529 
Protein GI99081375 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.134451 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACAT TCAGCCGTCG TTCCTTCCTT CAATTCGCCT CTGCCCTGTC ACTTGCGGCA 
CTGGCCCCCG CCGCGCGCGC CAGCGCCTCC GAGGAAGCGT TTTCGCGCGC CGATGTGATT
GCGCGCGCCA AGGCTCTCGC TGAACGGCCG TTCAAGGCCC GCAGCGCGGT TCCCGATGAC
TGGCTCGCTC TGAGTTATGA CCAGTATCGC TCGATCCAGT TTGACCTCGA CAAGGCGCTC
TGGGCCGGCT CTGATCGCAG CTACAACGTG GATTTCTTTC TACCCGGCCT CTATTTCGAA
CGCCCGGTCC TCGTCCATGA AGTTGAAGAC GGGATGGCGC ATCGCATCCC CTTCGATTTG
GGTCATTTCA AGAAACATCC CGAGATCTCA CCCGATCTGT CGACCGAAGG GCCGCTGGGG
TATTCGGGTT TCCGCCTGCG CACGGATCTT GGCGACCGGG ATCATCCGGG AAAGAAAACC
GAATTCTGCG TCTTTCAGGG CGCGAGCTAC TTCCGCGCGA TTGCCAATGG CAACAACTAT
GGTCTGTCTG CACGCGGGCT GGCGCTCAAG ACGGCCGATC CCGAGGGCGA GGAATTCCCC
GAATTTGTCA GCTTCTGGCT CGAAGCGCCG GGCCCCTTGC AGGAAAACAT GGTGGTGCAC
GCGCTGATGG ACAGCCCCTC GGTGACGGGC GCCTACCGGT TTGACATCAC CCCCGGCACC
CCTTGTGTGA TGGATGTGGA AGCGACGCTC TTTGCACGCA AGGACCTGAC GCATGCGGGT
CTTGCACCGC TTACCTCAAT GTTCCTCTTT GATGGCACCA ACCATCAGCG GTTCGATGAT
TTCCGCCCGG CGGTTCACGA TAGCGATGGG CTCTTGATGA AAAACGGCAA TGGCGAGGTC
ATCTGGCGAC CCCTTGCAAA CCCGACCCGT CTTCAGGTGT CGAGCTTCGT GGACAAGAAC
CCCGCCGGTT TTGGCCTGAT GCAGCGCGCG CGCAAGCTGT CGGACTTTGC CGATCTCGAA
GCCCACTACC ATCGCCGCCC CTCGCTCTGG GTCGAGCCCC GCGAGGATTG GGGCGCGGGA
AGCGTCACCC TGGTTGAAAT CCCGGCGGAC AAGGAAATCT ACGACAATAT CGTCGCCTAT
TGGCGTCCGG AAAAACCCTA TCCCGCTGGC GCGCGCGTGG ATCTCAACTA TCGTCTGAGC
TGGGGCGAAG AGCCGGTGCT GGATCTGCCG CGCGTGATCA ACACCGCCTC AGGCGCCAAG
ATCTTTGGCG ACCCCGGGCG TCTCATGGTG ATCGATTTTG AGGCGCACCC GGTGTTTGAA
GCCGATCCCG AGGCCTTCAG CTTGCATATC TCCTCGCCAC ATCTTGAGAC ATCTGATGGT
GTGTTGCAGC GCAACCCGGA GACCGGCGGC ATGCGACTGG CGTTCTCCTT TGATCCGGGC
GAGCAAACCC ATGTGGAACT GCGCGCGCAA CTGCGCAAGG ATGGGCAGGA TGCTTCTGAG
GTCTGGCTGT ATCGGTGGAC CGTCTGA
 
Protein sequence
MSTFSRRSFL QFASALSLAA LAPAARASAS EEAFSRADVI ARAKALAERP FKARSAVPDD 
WLALSYDQYR SIQFDLDKAL WAGSDRSYNV DFFLPGLYFE RPVLVHEVED GMAHRIPFDL
GHFKKHPEIS PDLSTEGPLG YSGFRLRTDL GDRDHPGKKT EFCVFQGASY FRAIANGNNY
GLSARGLALK TADPEGEEFP EFVSFWLEAP GPLQENMVVH ALMDSPSVTG AYRFDITPGT
PCVMDVEATL FARKDLTHAG LAPLTSMFLF DGTNHQRFDD FRPAVHDSDG LLMKNGNGEV
IWRPLANPTR LQVSSFVDKN PAGFGLMQRA RKLSDFADLE AHYHRRPSLW VEPREDWGAG
SVTLVEIPAD KEIYDNIVAY WRPEKPYPAG ARVDLNYRLS WGEEPVLDLP RVINTASGAK
IFGDPGRLMV IDFEAHPVFE ADPEAFSLHI SSPHLETSDG VLQRNPETGG MRLAFSFDPG
EQTHVELRAQ LRKDGQDASE VWLYRWTV