Gene TM1040_2295 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2295 
Symbol 
ID4078479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2414270 
End bp2415799 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content61% 
IMG OID638007617 
Productglycosyl transferase family protein 
Protein accessionYP_614289 
Protein GI99082135 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGGCC GAACCGCACT GGTGCTGATG GCCGATCCCG CCAGCCTTGC CGCCCTAAGG 
CAGGATCTTC GGGGTCAATT TGATGAGGTG ATCCCAATTC CCACCCCCGC GGCAGAGATT
GAGGCCATCT TGCTGAGGGC GCTTCAGGCA CAGATGACAC GGGAGGCTGC GCGCAGCGTT
GCCCCCGCCC TCAGTTGCCG CAGCTTTGAT TACCGCGCTG CCCGCGCCCC AGCCTTTGCA
ACGGGGACGC TTTTGTGTCT GTTTTCCATT CTCGCGCCGC ATCTTGTGAC TGCGGTGTTG
GCTGTAGCGT CGCTTGTGAC ATTGCTGATG TTCACGGCCC TCAGGATCTC CGGCCTCTTG
GCGGCGGCAC GCCCGGACCA GCCGAAATCC GAAACACCCA AGGATCTGCC TCAGATGTCG
ATGCTGGTGC CGCTCTATCG TGAGGCGGAA ATCGGCAAGC ATCTCTTGCG CCGCCTGTGC
CGCCTCACCT ATCCGCGCGA CCGCCTTGAG GTATTGCTTG TCCTTGAGGA AAACGACGAT
GTGACCCGAA ATGCCGTAAA ATGCGCAGAC CTGCCCGATT GGTTCCGGGT GGTCGAGGTG
CCGGGCGACG GCACGCTCAC CACAAAGCCG CGCGCAATGA ACTATGCGCT GAATTTCTGC
CGGGGAGAGA TCATCGGGAT CTGGGATGCT GAAGATGCCC CAGCGCCAGA CCAACTCGAG
AGTGCGGCCA GCGCCTTTGC TCATGCCCCC CCCGACGTGG TGTGCTTTCA GGGAATTCTC
GATTTCTACA ATCCCAGCCG CAATCTGATT TCGCGCTGCT TTACGCTCGA ATATGCGGGA
TGGTTTCGCG TCCTGCTCCA AGGCATCGCG CGGCTGGGGC TGGTGATCCC GCTTGGAGGC
ACCACGCTGT TTATCCGCCG CGACGCGCTC GAACAGCTGG GCGCGTGGGA TGCACATAAC
GTCACCGAGG ATGCCGACCT TGGCGTGCGG ATTGCGCGCG CGTGCTATCG CACCGAAATG
CTGCCCACAA CCACCTATGA AGAAGCCAAC AGCCGCATCA CGCCCTGGAT CAAACAGCGG
TCTCGCTGGC TGAAGGGGTT CATGATGACC TATCTGGTCC ACATGCGCGC CCCAAAGGCA
CTGTTACGGG ATGTCGGGTG GCGGCGTTTC TGGGGGCTAC AGGCGTTCTT TCTGGGCACC
CTCGGGCAAT TCCTGCTGGC ACCAGTCCTC TGGAGCTTCT GGCTGGTGGC GCTCGGAGTA
TCGCATCCGC TCGAAGCGTC ACTGCCCCGG GATATGCTGT CTGTCGCTGT CGGGGCGCTT
GTGTTCTTTG AGGTGCTCAA CCTGTGCATC TGGTATTGCG GCGCACGGGC TTCGGGGCGG
CCAGTCCTCG CGTTCTGCGC GCCCCTGATG CCTCTCTATT TCATACTTGG CTGTTTTGCC
GCCTACAAAG CCCTCTGGGA GGTGTTCGCA GCGCCGTTTT TCTGGGACAA GACCGCGCAT
GGGGATCATG GCGGCACCAC AGAGCATTGA
 
Protein sequence
MLGRTALVLM ADPASLAALR QDLRGQFDEV IPIPTPAAEI EAILLRALQA QMTREAARSV 
APALSCRSFD YRAARAPAFA TGTLLCLFSI LAPHLVTAVL AVASLVTLLM FTALRISGLL
AAARPDQPKS ETPKDLPQMS MLVPLYREAE IGKHLLRRLC RLTYPRDRLE VLLVLEENDD
VTRNAVKCAD LPDWFRVVEV PGDGTLTTKP RAMNYALNFC RGEIIGIWDA EDAPAPDQLE
SAASAFAHAP PDVVCFQGIL DFYNPSRNLI SRCFTLEYAG WFRVLLQGIA RLGLVIPLGG
TTLFIRRDAL EQLGAWDAHN VTEDADLGVR IARACYRTEM LPTTTYEEAN SRITPWIKQR
SRWLKGFMMT YLVHMRAPKA LLRDVGWRRF WGLQAFFLGT LGQFLLAPVL WSFWLVALGV
SHPLEASLPR DMLSVAVGAL VFFEVLNLCI WYCGARASGR PVLAFCAPLM PLYFILGCFA
AYKALWEVFA APFFWDKTAH GDHGGTTEH