Gene Rsph17029_0206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0206 
Symbol 
ID4897476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp224482 
End bp225897 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content69% 
IMG OID640110789 
Productpolysaccharide deacetylase 
Protein accessionYP_001042097 
Protein GI126460983 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0726] Predicted xylanase/chitin deacetylase
[COG3195] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03164] OHCU decarboxylase
[TIGR03212] putative urate catabolism protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGAT ATCCCCGAGA CATGCGCGGC CATGGGCCCA CTCCGCCCGA TGCGGCCTGG 
CCCGGCGGAG CCCGGATCGC CGTCTCGATC GTGCTGAACT ACGAGGAGGG CGGCGAGAAC
TGCCTCCTGC ATGGAGATGC GCAGTCGGAG GCCTTCCTGT CCGACATCGC CGGCGCCCAG
CCCTGGCCGG GTCAGCGGCA CTGGAACATG GAATCGATCT ACGATTACGG TGCCCGCGCG
GGCTTCTGGC GGCTGCACCG GCTCTTCACC GGGCTGAACA TCCCGGTGAC GGTCTATGGC
GTGGCCACCG CGCTGGCCCG CTCGCCCGAG CAGGTGGCGG CGATGAAATC CGCCGGGTGG
GAGATCGCCT CTCACGGGCT GAAATGGGTC GAGCACCGCG ACATGCCCGA GGAGGAGGAG
CGCCGCCAGA TCGCCGAGGC GATCCGGCTG CATACCGAGG TGGTGGGCGA GCGACCGCGC
GGCTGGTATA CGGGGCGCTG CTCGGTCAAT ACCGTGCGGC TCACCGCCGA AGAGGGCGGC
TTCGACTGGA TCTCGGACAC CTATGACGAC GACCTGCCCT ACTGGCTCGA GACCGGCACG
CGCGACCAGC TGGTGATCCC CTACACGCTC GAGGCCAACG ACATGCGCTT CGCCACGGCG
CCGGGCTATA TCGAGGGCGA GCAGTTCTTC ACCTATCTGC GCGACAGTTT CGACACGCTC
TATGCCGAAG GCTGTGCGGG GCAGGCGAAG ATGTTCTCGA TCGGGCTCCA TTGCCGGCTG
ATCGGGCGAC CCGGCAAGAT CGCGGGGCTG AAGCGCTTTC TGGACTATGC CCGCACCCAC
GAGCGGGTCT GGTTCCCGCG CCGGGGCGAC ATCGCCCGCC ACTGGCACGA GACCCATCCC
CATCGCCGCC GCCCGCGGCC CTCGCGCATG GACCGCGAGA GCTTCGAGGC CCATTTCGGC
GGGATCTACG AACATTCGCC CTGGATCGCC GAGCGCGCCT TCGAGCTGGA ACTGGGCCCC
GCCCATGACA GCCCTGCGGG CCTCGCCAAT GCGCTCGCCC GCATCTTCCG CAGCGCCACG
CCCGCGGAGC GGCTGAGCGT GCTCAAGGCC CACCCCGATC TCGCCGGCAA GCTCGCGCAG
GCGCGGCGGC TGACGGCCTC CTCCTCCTTC GAACAGTCGA GCGCCGGGCT CGATGCGCTC
ACCGACGCCG AGCGGGCCGA GTTCGCCACC CTCAACGCCG ACTATGTGGC CAAGCACGGC
TTCCCCTTCA TCATCGCCGT GCGCGACCAC GACAAGGCCG GCATCCTCGC CGCCTTCGAG
ACCCGGCTCG CCCACGACAG CGCCACCGAA TTCGCCACCG CCTGCCGGCA GGTGGAACGT
ATCGCCGAAC TCCGGCTTCA GGACATGCTG AAATGA
 
Protein sequence
MKRYPRDMRG HGPTPPDAAW PGGARIAVSI VLNYEEGGEN CLLHGDAQSE AFLSDIAGAQ 
PWPGQRHWNM ESIYDYGARA GFWRLHRLFT GLNIPVTVYG VATALARSPE QVAAMKSAGW
EIASHGLKWV EHRDMPEEEE RRQIAEAIRL HTEVVGERPR GWYTGRCSVN TVRLTAEEGG
FDWISDTYDD DLPYWLETGT RDQLVIPYTL EANDMRFATA PGYIEGEQFF TYLRDSFDTL
YAEGCAGQAK MFSIGLHCRL IGRPGKIAGL KRFLDYARTH ERVWFPRRGD IARHWHETHP
HRRRPRPSRM DRESFEAHFG GIYEHSPWIA ERAFELELGP AHDSPAGLAN ALARIFRSAT
PAERLSVLKA HPDLAGKLAQ ARRLTASSSF EQSSAGLDAL TDAERAEFAT LNADYVAKHG
FPFIIAVRDH DKAGILAAFE TRLAHDSATE FATACRQVER IAELRLQDML K