Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3475 |
Symbol | |
ID | 4075109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 499399 |
End bp | 500814 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638004984 |
Product | polysaccharide deacetylase |
Protein accession | YP_611709 |
Protein GI | 99078451 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0726] Predicted xylanase/chitin deacetylase [COG3195] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03164] OHCU decarboxylase [TIGR03212] putative urate catabolism protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.40619 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACACGCT ATCCCCGCGA CATGCGCGGC TACGGCGCAA CGCCCCCCCA CCCTGCCTGG CCAAATGGCG CAAAGATCGC CGTGCAATTT GTCCTGAACT ACGAGGAAGG GGGCGAGAAC TGCACCCTGC ACGGGGATGC GGCCTCCGAG GCGTTTCTCT CCGACATCCC CGGCGCTGCG CAATGGCCGG GCCAGCGCCA CTGGAACATG GAGTCGATCT ATGAATATGG CGCGCGCGCA GGCTTTTGGC GTCTGCACCG CCTGTTCACC GGCGCGGGCA TCCCGCTGAC CATCTACGGC GTCGCCAGTG CGCTTGCCCG CAGCCCCGAG CAGCTGAAGG CGATGAAGGA CGCCGACTGG GAAATCGCCT CTCATGGTCT CAAATGGGTC GAACACAAGG ACATGGCCGA GGACGACGAG CGCGCCTCCA TCAAAGAGGC GATCCGCCTA CATACCGAAG TGGTCGGCAC CCGCCCCCGC GGCTGGTACA CCGGGCGCTG CAGCGCCAAT ACGGTGCGGC TCGTCGCCGA GGAAGGCGGA TTTGACTATA TCTCCGACAC CTATGATGAC GACCTGCCCT ATTGGCTCGA GGTGGGCGCG CGCGATCAGC TCATCATTCC CTACACGCTT GAAGCCAACG ACATGCGCTT TGCCACCGCG CCGGGCTGGG TCACGGGATC TGATTTTGGG GACTATCTGA CCGACGCCTT TGATACGCTC TACTCCGAAG GCGCGGCGGG GGCGCCCAAG ATGATGACCA TCGGTTTGCA CTGCCGCCTG ATCGGGCGTC CGGGCAAGAT CGCCGCGCTC AAACGCTTTA TCGACCATAT CCAGAGCCAT CCGGGCGTCT GGTGCCCGCG CCGCATCGAT ATCGCCGAAC ATTGGGCCAC AGAGCATCCG CATCAGCGCC GCCAGCGCCC GAGCCAGATG GACCGAGACA CATTTGTGGG CGCTTATGGG TCAATCTTTG AGCACTCCCC CTGGATTGCT GATCGCGCCT TTGATCTCGA ACTTGGACCC GCGCATGATT GCGCGGCGGG CGTGCATAAT GCGCTCTGCC GGATCTTCCG CAGCGCATCC GAGGACGAAC GCCTCGGCGT TTTGACCGCG CACCCGGATC TTGCGGGCAA ACTCGCCTCT GCCGGACGCC TCACCGCCGA GAGCACCTCG GAACAGGCCA GTGCCGGGCT CAACCTTCTG ACCGACGCGG AGCGCGAGAC CTTTACCGCG CTCAACACCG CCTACGTGGA AAAGCACGGC TTTCCCTTCA TCATCGCGGT GCGCGATCAC GACAAGGCGT CGATCATGGC GGCCTTCAAG CGCCGCATCG ACAATGACCG CGCCGCGGAA TTTGACGAGG CCTGCAGACA GGTCGAGCGC ATCGCAGAGT TTCGCCTGAT GGACCTCCTG CCATGA
|
Protein sequence | MTRYPRDMRG YGATPPHPAW PNGAKIAVQF VLNYEEGGEN CTLHGDAASE AFLSDIPGAA QWPGQRHWNM ESIYEYGARA GFWRLHRLFT GAGIPLTIYG VASALARSPE QLKAMKDADW EIASHGLKWV EHKDMAEDDE RASIKEAIRL HTEVVGTRPR GWYTGRCSAN TVRLVAEEGG FDYISDTYDD DLPYWLEVGA RDQLIIPYTL EANDMRFATA PGWVTGSDFG DYLTDAFDTL YSEGAAGAPK MMTIGLHCRL IGRPGKIAAL KRFIDHIQSH PGVWCPRRID IAEHWATEHP HQRRQRPSQM DRDTFVGAYG SIFEHSPWIA DRAFDLELGP AHDCAAGVHN ALCRIFRSAS EDERLGVLTA HPDLAGKLAS AGRLTAESTS EQASAGLNLL TDAERETFTA LNTAYVEKHG FPFIIAVRDH DKASIMAAFK RRIDNDRAAE FDEACRQVER IAEFRLMDLL P
|
| |