Gene Dshi_3204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3204 
Symbol 
ID5712260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3368449 
End bp3369861 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content66% 
IMG OID641269131 
Productpolysaccharide deacetylase 
Protein accessionYP_001534538 
Protein GI159045744 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0726] Predicted xylanase/chitin deacetylase
[COG3195] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03164] OHCU decarboxylase
[TIGR03212] putative urate catabolism protein 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.933502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.3249 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCGCT ATCCCCGAGA TTTGACAGGA TATGGCGCGA CCCCGCCGAC CGTAACATGG 
CCCGGTGGGG CGAAGATCGC CGTTCAGATC GTGCTGAACT ATGAAGAGGG GGGCGAAAAC
AACATCCTCC ACGGGGATGC GGCGTCAGAG GCTTTCCTGT CCGAGATCAC GGGAGCGGCC
CCCTGGCCCG GGCAGCGTCA CTGGAACATG GAATCGATCT ATGAATACGG CGCGCGGGCC
GGGTTCTGGC GTGTGCACCG GCTTCTGAAG GACCTGCCGG TCACCGTGTA TGGCGTGGCC
ACCGCCCTGG CGCGGGCCCC GGCGCAGGTT GCCGCGATGA AGGCCAGCGG GTGGGAGATC
GCCTCGCACG GGCTGAAATG GGTCGAGCAC AAGGATATGG ACCCCGAGCA AGAGCGGGCG
CAGATCCGCG AGGCCATCCA GCTGCATACG GAAGTGACCG GCAGCGCGCC GCGCGGGTGG
TATACCGGGC GCTGTTCCAT GAACACGGTC GACCTGGCGG CCGAAGAGGC CGATTTGGCC
TATATCGCGG ACAGTTACGC CGATGATCTG CCCTACTGGA TCCGCGCGGG CGGCAAGGAC
CAGCTCATCG TGCCCTATAC GATGGATTGC AACGACATGC GCTTTGCGAT CCAGGCGGGC
TACACCAATG GCGACCAGTT CGAGAGCTAT CTCAAGGACA GTTTCGACAT GCTTTATGCC
GAAGGAGAGG CAGGGCATCC GGCGATCCTG TCCATCGGTC TGCATTGCCG CCTGATCGGG
CGGCCCGGGC GGGCCATGGC GCTGAAACGT GCGCTGGAGC ATTTCCGCAA ACACGAGGGC
GTCTGGTTCG CCACGCGCGA ACAGATCGCC GACCATTGGG CGCAGCAACA CCCCCCCGCC
GCGCAGCGAC GGCCCTCCGA GATGGATCGC GACAGCTTCG TGGCCGCGTT CGGCGGCATT
TTTGAGCACA GCCCCTGGAT CGCCGAAGGC GCCCACGCGC TGGAGCTTGG TCCGACCCAT
GACAATGCCG CCGGGGTGCA CAACGCCCTG TGCCGGATTT TCCGCGGCGC GTCCGAAGCG
CAACGGCTGA GCGTTCTGAC CGCGCACCCG GATCTTGCGG GCAAACTGGC GGCGGCGGGC
AAGCTCACCG CGGAAAGCAC GGCGGAACAG GCGGGCGCGG GGCTCGACCT GCTGACCGAC
GCGGAGCGCG CGACATTCCA GAAGCTCAAC GCGCAGTACG TGGCGCGGCA CGGGTTTCCC
TTCATCATCG CGGTCAAGGA CAACACCAAG GCCACGATCC TCGACGCGTT CCACCGCCGT
ATCGAAAACG ACCGGGAGAC CGAGTTCGCT GAGGCCTGCC GCCAGGTGGA GCGTATCGCG
GAACTGCGCC TGATCGAGAA GCTGGGCGCA TGA
 
Protein sequence
MNRYPRDLTG YGATPPTVTW PGGAKIAVQI VLNYEEGGEN NILHGDAASE AFLSEITGAA 
PWPGQRHWNM ESIYEYGARA GFWRVHRLLK DLPVTVYGVA TALARAPAQV AAMKASGWEI
ASHGLKWVEH KDMDPEQERA QIREAIQLHT EVTGSAPRGW YTGRCSMNTV DLAAEEADLA
YIADSYADDL PYWIRAGGKD QLIVPYTMDC NDMRFAIQAG YTNGDQFESY LKDSFDMLYA
EGEAGHPAIL SIGLHCRLIG RPGRAMALKR ALEHFRKHEG VWFATREQIA DHWAQQHPPA
AQRRPSEMDR DSFVAAFGGI FEHSPWIAEG AHALELGPTH DNAAGVHNAL CRIFRGASEA
QRLSVLTAHP DLAGKLAAAG KLTAESTAEQ AGAGLDLLTD AERATFQKLN AQYVARHGFP
FIIAVKDNTK ATILDAFHRR IENDRETEFA EACRQVERIA ELRLIEKLGA