Gene TM1040_3689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3689 
Symbol 
ID4075658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp750163 
End bp751407 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content64% 
IMG OID638005209 
Productglycosyl transferase, group 1 
Protein accessionYP_611918 
Protein GI99078660 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.466266 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.187253 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGGAA AACGCCCCGC GCTTGCTGTG TTGGTCAAAG GCTGGCCGCG CCTTTCCGAG 
ACCTTTATCG CGCAGGAACT TGTCGCTCTT GAGGCGGCGG GGCAGCCTTT TGAGATTTGG
TCGCTGCGTC ACCCCACCGA CACCAAAACT CACCCGCTTC ATGATCGCCT CCAGGCACCG
GTGCATTACC TGCCGGAGTA TCTCTATGAT GCGCCCGCTC GGGTGGCCGA GGCTCGCACT
CGGGCGCAGA CGCTGCCGGG CTACGCGGCG GCCTACGAAG TCTGGCGCGC TGATCTGCGC
CGCGACCCGA CGCACAACCG CATTCGCCGC TTTGGTCAGG CCTGTGTCCT GGCGGCGGAA
CTGCCGCCCG AGGTGCGCGG CCTTTATGCC CATTTCCTGC ATACGCCCGC TTCGGTGGCG
CGCTATGCCG CAATCATGCG GGGCCTGCCG TGGAGCTTTT CGGCCCATGC AAAAGACATC
TGGACCTCGC CCGAATGGGA GTTGCGCGAA AAGCTCTCGG CGGCCAGCCA TGGCGCGGCC
TTTGGGGCCA CCTGCACAGG GTTTGGCGCG AAGCATCTAC AAGAGCTCTC TGACGGCACG
CCTGTGGATC TGATCTATCA CGGGCTTGAT CTGTCGCGGT TCCCCGCCCC CCCTGCGCGT
GTACTGCGCA GCCCGAATGC GCCGTTTCAC ATGATGTCGG TGGGGCGGCT GGTGGAGAAG
AAAGGCTTTG ACCGCTTGAT CGCCGCGCTT GCGCTCCTGC CTCGGGATCT TGACTGGCAC
TGGACCCATA TCGGTGGTGG CGGACTTGGG GATCTGTTGC AGGGCATGGC CGAAGACGCA
GGCATTTCTG CTCGTATCAC ATGGCGCGGC GCCTGCGATC AGCCCGAGGT GATTGATGCG
ATGCGTGCGG CGGATCTCTT TGTGCTGCCT TCCCGTGTGG CTTCGGATGG CGACCGCGAC
GGCTTGCCCA ATGTGCTGAT GGAGGCGGCT TCGCAAGGCC TGCCGATCCT CTCGACCCCG
GTGTCGGCTA TTCCCGAGTT CATCGAAAGT GGCACCCATG GCCTCCTCAG CAGCGACGCG
CCCGAGGCTT TGGCGGACGC GATGCTGCGT TTGGCCCATG CGCCCGAAGA GGCGCAGCGC
ATGGCCAAAG CCGCGCTTCT GCGTCTGCGC GCTGAGTTTG GCATGGATCC GGGTATTGCG
CAGTTGAACA CGCGCCTCAA TGCGATGCTG AAGGACGCTG GATGA
 
Protein sequence
MTGKRPALAV LVKGWPRLSE TFIAQELVAL EAAGQPFEIW SLRHPTDTKT HPLHDRLQAP 
VHYLPEYLYD APARVAEART RAQTLPGYAA AYEVWRADLR RDPTHNRIRR FGQACVLAAE
LPPEVRGLYA HFLHTPASVA RYAAIMRGLP WSFSAHAKDI WTSPEWELRE KLSAASHGAA
FGATCTGFGA KHLQELSDGT PVDLIYHGLD LSRFPAPPAR VLRSPNAPFH MMSVGRLVEK
KGFDRLIAAL ALLPRDLDWH WTHIGGGGLG DLLQGMAEDA GISARITWRG ACDQPEVIDA
MRAADLFVLP SRVASDGDRD GLPNVLMEAA SQGLPILSTP VSAIPEFIES GTHGLLSSDA
PEALADAMLR LAHAPEEAQR MAKAALLRLR AEFGMDPGIA QLNTRLNAML KDAG