Gene TM1040_3845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3845 
Symbol 
ID4074908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008042 
Strand
Start bp93592 
End bp96000 
Gene Length2409 bp 
Protein Length802 aa 
Translation table11 
GC content52% 
IMG OID638004502 
Productglycosyl transferase, group 1 
Protein accessionYP_611237 
Protein GI99077978 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.880485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.505849 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGTTG GTGTGCTGCG AACACAGGTT CCCTTTGTGA GTGGGGGAGC TGAGCGTCAT 
AGTATGGGTC TGGTTAATGC TCTGGAGGCC AGAGGGTACG AGGCTACGGA AATTACACTG
CCCTTCAAAT GGTATCCCGG AAATGTCTTG GCCGACCATA TTGCTGCTGC CAAGTTTCTC
GATATTTCCG AGGTTGAAGG TGTCAAGATT GATTTGGCGG TGGGCCTAAA GTTCCCTGCT
TGGCTTGCTC AACATCCGAA TATGGTATTG TGGGTGATCC ATCAGCACCG ACAAGCCTAC
GACATGTGGG AGGCTGGCAC CTCTGATTTG CTTGATGACC CACAGGGGGA GGCTTTACGT
GCGCTAATTC ACGAAGAAGA TCGTGCTGCG TTTCTGGCAT CCCCACATCC GATATATGCA
AATTCATGCA ATGTTGCCGA TCGCTTGAAG CGACATCTCG GCACAGCGGC TACCACGCTT
TATCACCCAC CACCAAACGC AGAGCTTTTG CGGCAAGGAG ATTATGGTGA TTATCTTTTT
GCGCCTGGGC GAATTAACGC GTCAAAACGT TTGGAATTAC CGCTGCGCGC CTTGGTTCAT
GCCCCCGCAA GTCGCTTGAT CATTGCCGGT GTCGCTGAGA ATCCGGCGTA TCAAAAGCGT
TTGTATTCCT TGGCACATGA GCTTGGGGTT TCCGGGAGAG TTGAGTGGCT CGGACGTGTG
GATGATGAAA CGCTTGTTCG GTATTATGCT AATGCTCGCG GCGTCGTGTT CACCCCTCAA
GACGAAGATT ATGGGTATAT CACCCTTGAG GCGATGGTCA GCGGGAAGCC CGTTGTTACA
ACCAAGGACT CTGGTGGTCC ATTGGAGTTT ATTTCAGACG GAATAGAGGG ATTGGTTGTT
GATCCTGATG CCAAAGCGTT GGGGGATGCT TTTACGTTTC TGTCAGAAGA CACCGCGACA
GCAGAGCGTA TGGGGCAGGC TGGGTATATG TGTTATGCCC AGTGTAATAT TTCTTGGGAT
CATGTGGTTT CCACACTCAC GGGACAAGCC CATCTCCCTG TGACTTTTGA CGCTGTGCAG
AAGATTGACG CTCAAGCTCC AGAGCATGGG CACGAGGTCT GCACGGAAGA TGCAGTGAGA
CAACTACGCG CCGCTGTAGC TCCTCCTAGC CGGCCTGACA ATGTGCCGTT CGCCAACATC
AATGAACTGC TTGAGGCTTA TGCGTTTGAC GAGTTGCCTG CAGCTCTTGG GCGCGATGAG
CCTCCGATTG ATGCAGGGCT CTCAGGTTAT CTAGGTACAC ATTGGACACG GTTTATGTCG
ACACTGCGAC AGCTGGAGGG GTTGGAAATT TCCTCTGCCC TTGATGTCGG TGTTTTTCCT
CCGTTGGTTT TTCAGGCGCT TCTTGCCAAT CAATTTCCAA ACATCGATCT GCACGGGTTG
TGGGAAGGGC CAAATCCTTA CGCGCAGTCC GTGCGTCCAC GCCCTGGCTA CGATGTCGAT
GGATTTGAGA TTACGCTAAA GGTTGCAAAT GGCGAACGCG ATCCTTGGCC GTATCCAGAT
GAGACCTTTG ATCTTGTCAC TGGGATGGAA ATTTTAGAGC ATTTGGCTCT AGATCCATAT
TTCTTCTTCT GCGAGGCTGC GCGCGTTCTG AAGCCCGGTG GCAATATTTT GATCACAACA
CCCAATGTGA ACAGTCACCG CGGTGTCTGG AAGACACTCA ATAACGTTGC TCCGTATTCG
TTTGGTATTT TTGTGCCTTC TGGCGGTGTT TATGGGCGCC ACAATCGTGA ATATGCGCCA
ACAGAACTAG ATATGCTGGG CCGATCAGCC GGGTTTGAGA CTGTTCGCCT ACAAACGTTT
GATGTCTATG ATCGACACAT AGAACCGGAA CTGGCCAACT TGTTAGTGTC CCGTAGAGAT
AATCTTTCTT TGCGTGGAGA GAACATTCAG TATTTGGCCA AGAAAGTCGG TTCCCCGAAA
GGGGTGCCGC AGAGCCTATA CCATGGTGAT CCTGCGCGTA TGCATGGTGA GCTACAAGTC
GTTACACACG AAGAAGAAAC GGGGCTGCTT ACTATTTCTG TGCGCAATAC GTCGTATAGC
ATTTGGCCTC TTGAGGGCGA TCGCGCCACC TGTGTGCTGG CTGAATGGGT CGATGAGGCC
GGAGTGCTTC GACATCAGCA TCTGATGCAG CCGCTTACAG AGGTGTTGTC GCCCGGGGAT
GAAGGTCGAA TTCGATTTAC GCTTGACGCT CAGAAACAAA ATTTGGGTGT TTTGCGTCTG
CATCTGTATC AGTCCGGTGT TGGTGTCTTT TCCGGAAGAG GGCGAGCGGT TCCCTTATCG
ATCCCGTGTT CTCACAGTGC ATTCCTGACG CTAGTTGATA AAACGCCTCT CCCGAGAGCG
AAGGTCTGA
 
Protein sequence
MRVGVLRTQV PFVSGGAERH SMGLVNALEA RGYEATEITL PFKWYPGNVL ADHIAAAKFL 
DISEVEGVKI DLAVGLKFPA WLAQHPNMVL WVIHQHRQAY DMWEAGTSDL LDDPQGEALR
ALIHEEDRAA FLASPHPIYA NSCNVADRLK RHLGTAATTL YHPPPNAELL RQGDYGDYLF
APGRINASKR LELPLRALVH APASRLIIAG VAENPAYQKR LYSLAHELGV SGRVEWLGRV
DDETLVRYYA NARGVVFTPQ DEDYGYITLE AMVSGKPVVT TKDSGGPLEF ISDGIEGLVV
DPDAKALGDA FTFLSEDTAT AERMGQAGYM CYAQCNISWD HVVSTLTGQA HLPVTFDAVQ
KIDAQAPEHG HEVCTEDAVR QLRAAVAPPS RPDNVPFANI NELLEAYAFD ELPAALGRDE
PPIDAGLSGY LGTHWTRFMS TLRQLEGLEI SSALDVGVFP PLVFQALLAN QFPNIDLHGL
WEGPNPYAQS VRPRPGYDVD GFEITLKVAN GERDPWPYPD ETFDLVTGME ILEHLALDPY
FFFCEAARVL KPGGNILITT PNVNSHRGVW KTLNNVAPYS FGIFVPSGGV YGRHNREYAP
TELDMLGRSA GFETVRLQTF DVYDRHIEPE LANLLVSRRD NLSLRGENIQ YLAKKVGSPK
GVPQSLYHGD PARMHGELQV VTHEEETGLL TISVRNTSYS IWPLEGDRAT CVLAEWVDEA
GVLRHQHLMQ PLTEVLSPGD EGRIRFTLDA QKQNLGVLRL HLYQSGVGVF SGRGRAVPLS
IPCSHSAFLT LVDKTPLPRA KV