Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3204 |
Symbol | |
ID | 5712260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 3368449 |
End bp | 3369861 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641269131 |
Product | polysaccharide deacetylase |
Protein accession | YP_001534538 |
Protein GI | 159045744 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0726] Predicted xylanase/chitin deacetylase [COG3195] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03164] OHCU decarboxylase [TIGR03212] putative urate catabolism protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.933502 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.3249 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACCGCT ATCCCCGAGA TTTGACAGGA TATGGCGCGA CCCCGCCGAC CGTAACATGG CCCGGTGGGG CGAAGATCGC CGTTCAGATC GTGCTGAACT ATGAAGAGGG GGGCGAAAAC AACATCCTCC ACGGGGATGC GGCGTCAGAG GCTTTCCTGT CCGAGATCAC GGGAGCGGCC CCCTGGCCCG GGCAGCGTCA CTGGAACATG GAATCGATCT ATGAATACGG CGCGCGGGCC GGGTTCTGGC GTGTGCACCG GCTTCTGAAG GACCTGCCGG TCACCGTGTA TGGCGTGGCC ACCGCCCTGG CGCGGGCCCC GGCGCAGGTT GCCGCGATGA AGGCCAGCGG GTGGGAGATC GCCTCGCACG GGCTGAAATG GGTCGAGCAC AAGGATATGG ACCCCGAGCA AGAGCGGGCG CAGATCCGCG AGGCCATCCA GCTGCATACG GAAGTGACCG GCAGCGCGCC GCGCGGGTGG TATACCGGGC GCTGTTCCAT GAACACGGTC GACCTGGCGG CCGAAGAGGC CGATTTGGCC TATATCGCGG ACAGTTACGC CGATGATCTG CCCTACTGGA TCCGCGCGGG CGGCAAGGAC CAGCTCATCG TGCCCTATAC GATGGATTGC AACGACATGC GCTTTGCGAT CCAGGCGGGC TACACCAATG GCGACCAGTT CGAGAGCTAT CTCAAGGACA GTTTCGACAT GCTTTATGCC GAAGGAGAGG CAGGGCATCC GGCGATCCTG TCCATCGGTC TGCATTGCCG CCTGATCGGG CGGCCCGGGC GGGCCATGGC GCTGAAACGT GCGCTGGAGC ATTTCCGCAA ACACGAGGGC GTCTGGTTCG CCACGCGCGA ACAGATCGCC GACCATTGGG CGCAGCAACA CCCCCCCGCC GCGCAGCGAC GGCCCTCCGA GATGGATCGC GACAGCTTCG TGGCCGCGTT CGGCGGCATT TTTGAGCACA GCCCCTGGAT CGCCGAAGGC GCCCACGCGC TGGAGCTTGG TCCGACCCAT GACAATGCCG CCGGGGTGCA CAACGCCCTG TGCCGGATTT TCCGCGGCGC GTCCGAAGCG CAACGGCTGA GCGTTCTGAC CGCGCACCCG GATCTTGCGG GCAAACTGGC GGCGGCGGGC AAGCTCACCG CGGAAAGCAC GGCGGAACAG GCGGGCGCGG GGCTCGACCT GCTGACCGAC GCGGAGCGCG CGACATTCCA GAAGCTCAAC GCGCAGTACG TGGCGCGGCA CGGGTTTCCC TTCATCATCG CGGTCAAGGA CAACACCAAG GCCACGATCC TCGACGCGTT CCACCGCCGT ATCGAAAACG ACCGGGAGAC CGAGTTCGCT GAGGCCTGCC GCCAGGTGGA GCGTATCGCG GAACTGCGCC TGATCGAGAA GCTGGGCGCA TGA
|
Protein sequence | MNRYPRDLTG YGATPPTVTW PGGAKIAVQI VLNYEEGGEN NILHGDAASE AFLSEITGAA PWPGQRHWNM ESIYEYGARA GFWRVHRLLK DLPVTVYGVA TALARAPAQV AAMKASGWEI ASHGLKWVEH KDMDPEQERA QIREAIQLHT EVTGSAPRGW YTGRCSMNTV DLAAEEADLA YIADSYADDL PYWIRAGGKD QLIVPYTMDC NDMRFAIQAG YTNGDQFESY LKDSFDMLYA EGEAGHPAIL SIGLHCRLIG RPGRAMALKR ALEHFRKHEG VWFATREQIA DHWAQQHPPA AQRRPSEMDR DSFVAAFGGI FEHSPWIAEG AHALELGPTH DNAAGVHNAL CRIFRGASEA QRLSVLTAHP DLAGKLAAAG KLTAESTAEQ AGAGLDLLTD AERATFQKLN AQYVARHGFP FIIAVKDNTK ATILDAFHRR IENDRETEFA EACRQVERIA ELRLIEKLGA
|
| |