Gene TM1040_0228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0228 
Symbol 
ID4076261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp242545 
End bp243768 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content63% 
IMG OID638005522 
Productglycosyl transferase, group 1 
Protein accessionYP_612223 
Protein GI99080069 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.219003 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTGC TCGTCGTATC GACGAATGCT GCCCTCACCA TGGGGGGCGA GGCGATGAAG 
GCGCTGCAGT ATATGCAGCA GCTCTTGGCG GATGGACGCG ATGCCACTCT CATCACCCAT
GAACGCTGTC GCGAGGCTCT TGCGGGGCAA TTGCCAGAAG ATCGGGTGAT CTATGTGCAT
GACAGCCGTG CAATGAAGGC CTGTTGGCGC ACGCCGGGGC TTGGGCGGTT GGTGAACAGT
TTTTTTCACC TCGAGGTCGC CCGGATCTGT CGCGGCTTTA ACCCGAGCGA GGTGGTGATC
CACTATCTTT GTCCGATCTC CCCCGTCGAG CAGCGCTTCC CGCCGAAGGG GTATCGCTAT
GTCATCGGCC CGCTTTCAGG CAATATCTTC TACCCAGAGG GGTTTCGACA TCTTGCGGGG
CGGGGGCTGC GCCTGCAGCA TCAGGCGTAT CGGCCTTTGC AGATGGCGCT TGGCCTCTTG
TCTAGGCAAT TCACGCGCGC CTCGACCGTG TTGGTCTCTG GCTATGACCG TACCCGAGAG
GCCCTCGGCT GGGCGGGTTG CCCGGAGGCC CGCATGCAGG ACGTCTGGGA TGCGGGCCTG
TCTCCAGATT TCTTTGCGCG TTCCCGGATC CGGCCGGGCA AGAACCCGGC GCATTTTGTG
TGGATTGGAC GTATGGTGCC CTACAAGGGG GCGGATCTTG CGTTGCGCGC GCTGGCGCTT
GCCCCGGCAG AGGCACGGCT CACGCTCTAT GGAGATGGGC CGGATCGTGC CGAACTGGAG
GCGCTCGCCC GCGATCTTGG CCTGATGTCG CGGGTCACCT TTGCGGGCTG GCTTGCGCAT
GGGGATCTCT CCGAGGCGTT GGGCCAGTAC CGAGCACTTT TGTTCCCGAG CCTCAAAGAA
GCCAACGGCA TCATCGTGCA GGAATGTATG GCGATCGGCT TGCCGGTCGT GGCCTTGCGC
TGGGGCGGGC CTGTGGGGCT CGCGGATGAC ACTGAGGCGC TGTTTGTCGA GGCGCAGAAT
GCCGTACAGG TCGAGCAGGA CTTGGCTGCG GCCATGGCGC GTCTGACAGA AGACCCAGCC
CTTGCGGAGG CGCTCTCTGA TGCGGCGCGA CGCAAGGCTG AAAACGAGTT CCCCTGGCCG
CAGGTGGCCC AAAGCTGGTG CAGCGCAGCG CTTCGCGCGC AGGATGCGGC TGCAGCAGAG
CCAAAGCACC GCGGCGGGGG TTGA
 
Protein sequence
MKLLVVSTNA ALTMGGEAMK ALQYMQQLLA DGRDATLITH ERCREALAGQ LPEDRVIYVH 
DSRAMKACWR TPGLGRLVNS FFHLEVARIC RGFNPSEVVI HYLCPISPVE QRFPPKGYRY
VIGPLSGNIF YPEGFRHLAG RGLRLQHQAY RPLQMALGLL SRQFTRASTV LVSGYDRTRE
ALGWAGCPEA RMQDVWDAGL SPDFFARSRI RPGKNPAHFV WIGRMVPYKG ADLALRALAL
APAEARLTLY GDGPDRAELE ALARDLGLMS RVTFAGWLAH GDLSEALGQY RALLFPSLKE
ANGIIVQECM AIGLPVVALR WGGPVGLADD TEALFVEAQN AVQVEQDLAA AMARLTEDPA
LAEALSDAAR RKAENEFPWP QVAQSWCSAA LRAQDAAAAE PKHRGGG