Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0206 |
Symbol | |
ID | 4897476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 224482 |
End bp | 225897 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640110789 |
Product | polysaccharide deacetylase |
Protein accession | YP_001042097 |
Protein GI | 126460983 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0726] Predicted xylanase/chitin deacetylase [COG3195] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03164] OHCU decarboxylase [TIGR03212] putative urate catabolism protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGAT ATCCCCGAGA CATGCGCGGC CATGGGCCCA CTCCGCCCGA TGCGGCCTGG CCCGGCGGAG CCCGGATCGC CGTCTCGATC GTGCTGAACT ACGAGGAGGG CGGCGAGAAC TGCCTCCTGC ATGGAGATGC GCAGTCGGAG GCCTTCCTGT CCGACATCGC CGGCGCCCAG CCCTGGCCGG GTCAGCGGCA CTGGAACATG GAATCGATCT ACGATTACGG TGCCCGCGCG GGCTTCTGGC GGCTGCACCG GCTCTTCACC GGGCTGAACA TCCCGGTGAC GGTCTATGGC GTGGCCACCG CGCTGGCCCG CTCGCCCGAG CAGGTGGCGG CGATGAAATC CGCCGGGTGG GAGATCGCCT CTCACGGGCT GAAATGGGTC GAGCACCGCG ACATGCCCGA GGAGGAGGAG CGCCGCCAGA TCGCCGAGGC GATCCGGCTG CATACCGAGG TGGTGGGCGA GCGACCGCGC GGCTGGTATA CGGGGCGCTG CTCGGTCAAT ACCGTGCGGC TCACCGCCGA AGAGGGCGGC TTCGACTGGA TCTCGGACAC CTATGACGAC GACCTGCCCT ACTGGCTCGA GACCGGCACG CGCGACCAGC TGGTGATCCC CTACACGCTC GAGGCCAACG ACATGCGCTT CGCCACGGCG CCGGGCTATA TCGAGGGCGA GCAGTTCTTC ACCTATCTGC GCGACAGTTT CGACACGCTC TATGCCGAAG GCTGTGCGGG GCAGGCGAAG ATGTTCTCGA TCGGGCTCCA TTGCCGGCTG ATCGGGCGAC CCGGCAAGAT CGCGGGGCTG AAGCGCTTTC TGGACTATGC CCGCACCCAC GAGCGGGTCT GGTTCCCGCG CCGGGGCGAC ATCGCCCGCC ACTGGCACGA GACCCATCCC CATCGCCGCC GCCCGCGGCC CTCGCGCATG GACCGCGAGA GCTTCGAGGC CCATTTCGGC GGGATCTACG AACATTCGCC CTGGATCGCC GAGCGCGCCT TCGAGCTGGA ACTGGGCCCC GCCCATGACA GCCCTGCGGG CCTCGCCAAT GCGCTCGCCC GCATCTTCCG CAGCGCCACG CCCGCGGAGC GGCTGAGCGT GCTCAAGGCC CACCCCGATC TCGCCGGCAA GCTCGCGCAG GCGCGGCGGC TGACGGCCTC CTCCTCCTTC GAACAGTCGA GCGCCGGGCT CGATGCGCTC ACCGACGCCG AGCGGGCCGA GTTCGCCACC CTCAACGCCG ACTATGTGGC CAAGCACGGC TTCCCCTTCA TCATCGCCGT GCGCGACCAC GACAAGGCCG GCATCCTCGC CGCCTTCGAG ACCCGGCTCG CCCACGACAG CGCCACCGAA TTCGCCACCG CCTGCCGGCA GGTGGAACGT ATCGCCGAAC TCCGGCTTCA GGACATGCTG AAATGA
|
Protein sequence | MKRYPRDMRG HGPTPPDAAW PGGARIAVSI VLNYEEGGEN CLLHGDAQSE AFLSDIAGAQ PWPGQRHWNM ESIYDYGARA GFWRLHRLFT GLNIPVTVYG VATALARSPE QVAAMKSAGW EIASHGLKWV EHRDMPEEEE RRQIAEAIRL HTEVVGERPR GWYTGRCSVN TVRLTAEEGG FDWISDTYDD DLPYWLETGT RDQLVIPYTL EANDMRFATA PGYIEGEQFF TYLRDSFDTL YAEGCAGQAK MFSIGLHCRL IGRPGKIAGL KRFLDYARTH ERVWFPRRGD IARHWHETHP HRRRPRPSRM DRESFEAHFG GIYEHSPWIA ERAFELELGP AHDSPAGLAN ALARIFRSAT PAERLSVLKA HPDLAGKLAQ ARRLTASSSF EQSSAGLDAL TDAERAEFAT LNADYVAKHG FPFIIAVRDH DKAGILAAFE TRLAHDSATE FATACRQVER IAELRLQDML K
|
| |