Gene TM1040_3475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3475 
Symbol 
ID4075109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp499399 
End bp500814 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content64% 
IMG OID638004984 
Productpolysaccharide deacetylase 
Protein accessionYP_611709 
Protein GI99078451 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0726] Predicted xylanase/chitin deacetylase
[COG3195] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03164] OHCU decarboxylase
[TIGR03212] putative urate catabolism protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.40619 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACACGCT ATCCCCGCGA CATGCGCGGC TACGGCGCAA CGCCCCCCCA CCCTGCCTGG 
CCAAATGGCG CAAAGATCGC CGTGCAATTT GTCCTGAACT ACGAGGAAGG GGGCGAGAAC
TGCACCCTGC ACGGGGATGC GGCCTCCGAG GCGTTTCTCT CCGACATCCC CGGCGCTGCG
CAATGGCCGG GCCAGCGCCA CTGGAACATG GAGTCGATCT ATGAATATGG CGCGCGCGCA
GGCTTTTGGC GTCTGCACCG CCTGTTCACC GGCGCGGGCA TCCCGCTGAC CATCTACGGC
GTCGCCAGTG CGCTTGCCCG CAGCCCCGAG CAGCTGAAGG CGATGAAGGA CGCCGACTGG
GAAATCGCCT CTCATGGTCT CAAATGGGTC GAACACAAGG ACATGGCCGA GGACGACGAG
CGCGCCTCCA TCAAAGAGGC GATCCGCCTA CATACCGAAG TGGTCGGCAC CCGCCCCCGC
GGCTGGTACA CCGGGCGCTG CAGCGCCAAT ACGGTGCGGC TCGTCGCCGA GGAAGGCGGA
TTTGACTATA TCTCCGACAC CTATGATGAC GACCTGCCCT ATTGGCTCGA GGTGGGCGCG
CGCGATCAGC TCATCATTCC CTACACGCTT GAAGCCAACG ACATGCGCTT TGCCACCGCG
CCGGGCTGGG TCACGGGATC TGATTTTGGG GACTATCTGA CCGACGCCTT TGATACGCTC
TACTCCGAAG GCGCGGCGGG GGCGCCCAAG ATGATGACCA TCGGTTTGCA CTGCCGCCTG
ATCGGGCGTC CGGGCAAGAT CGCCGCGCTC AAACGCTTTA TCGACCATAT CCAGAGCCAT
CCGGGCGTCT GGTGCCCGCG CCGCATCGAT ATCGCCGAAC ATTGGGCCAC AGAGCATCCG
CATCAGCGCC GCCAGCGCCC GAGCCAGATG GACCGAGACA CATTTGTGGG CGCTTATGGG
TCAATCTTTG AGCACTCCCC CTGGATTGCT GATCGCGCCT TTGATCTCGA ACTTGGACCC
GCGCATGATT GCGCGGCGGG CGTGCATAAT GCGCTCTGCC GGATCTTCCG CAGCGCATCC
GAGGACGAAC GCCTCGGCGT TTTGACCGCG CACCCGGATC TTGCGGGCAA ACTCGCCTCT
GCCGGACGCC TCACCGCCGA GAGCACCTCG GAACAGGCCA GTGCCGGGCT CAACCTTCTG
ACCGACGCGG AGCGCGAGAC CTTTACCGCG CTCAACACCG CCTACGTGGA AAAGCACGGC
TTTCCCTTCA TCATCGCGGT GCGCGATCAC GACAAGGCGT CGATCATGGC GGCCTTCAAG
CGCCGCATCG ACAATGACCG CGCCGCGGAA TTTGACGAGG CCTGCAGACA GGTCGAGCGC
ATCGCAGAGT TTCGCCTGAT GGACCTCCTG CCATGA
 
Protein sequence
MTRYPRDMRG YGATPPHPAW PNGAKIAVQF VLNYEEGGEN CTLHGDAASE AFLSDIPGAA 
QWPGQRHWNM ESIYEYGARA GFWRLHRLFT GAGIPLTIYG VASALARSPE QLKAMKDADW
EIASHGLKWV EHKDMAEDDE RASIKEAIRL HTEVVGTRPR GWYTGRCSAN TVRLVAEEGG
FDYISDTYDD DLPYWLEVGA RDQLIIPYTL EANDMRFATA PGWVTGSDFG DYLTDAFDTL
YSEGAAGAPK MMTIGLHCRL IGRPGKIAAL KRFIDHIQSH PGVWCPRRID IAEHWATEHP
HQRRQRPSQM DRDTFVGAYG SIFEHSPWIA DRAFDLELGP AHDCAAGVHN ALCRIFRSAS
EDERLGVLTA HPDLAGKLAS AGRLTAESTS EQASAGLNLL TDAERETFTA LNTAYVEKHG
FPFIIAVRDH DKASIMAAFK RRIDNDRAAE FDEACRQVER IAEFRLMDLL P