Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_1554 |
Symbol | |
ID | 3718621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | - |
Start bp | 144267 |
End bp | 145682 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640069704 |
Product | polysaccharide deacetylase |
Protein accession | YP_351598 |
Protein GI | 77462094 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0726] Predicted xylanase/chitin deacetylase [COG3195] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03164] OHCU decarboxylase [TIGR03212] putative urate catabolism protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.442375 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCGAT ATCCCCGAGA CATGCGCGGC CACGGGCCCA CTCCGCCCGA TGCGGCCTGG CCCGGCGGAG CCCGGATCGC CGTCTCGATC GTGCTGAACT ACGAGGAGGG CGGCGAGAAC TGCCTCCTGC ATGGAGATGC GCAGTCGGAG GCCTTCCTGT CCGACATCGC CGGCGCCCAG CCCTGGCCGG GTCAGCGGCA CTGGAACATG GAATCGATCT ACGATTACGG TGCCCGCGCG GGCTTCTGGC GGCTGCACCG GCTCTTCACC GGGCTGAACA TCCCGGTGAC GGTCTATGGC GTGGCCACCG CGCTGGCCCG CTCGCCCGAG CAGGTGGCGG CGATGAAATC CGCCGGGTGG GAGATCGCCT CGCACGGGCT GAAATGGGTC GAGCATCGCG ACATGCCCGA GGAGGAGGAG CGCCGCCAGA TCGCCGAGGC GATCCGGCTG CATGCCGAGG TGGTGGGCGA GCGACCGCGC GGCTGGTATA CGGGGCGCTG CTCGGTCAAT ACGGTGCGGC TCACCGCCGA AGAGGGCGGC TTCGACTGGA TCTCGGACAC CTATGACGAC GACCAGCCCT ACTGGCTCGA GACCGGCACA CGCGACCAGC TGGTGATCCC CTACACGCTC GAGGCCAATG ACATGCGCTT CGCCACGGCG CCGGGCTACA TCGAGGGCGA GCAGTTCTTC ACCTATCTGC GCGACAGTTT CGACACGCTC TATGCCGAAG GCTGTGCTGG GCAGGCGAAG ATGTTCTCGA TCGGGCTCCA TTGCCGGCTG ATCGGGCGGC CCGGCAAGAT CGCGGGGCTG AAGCGCTTTC TGGACTATGC CCGCACCCAT GAGCGGGTCT GGTTCCCGTG CCGGGGCGAC ATCGCCCGCC ACTGGCACGA GACCCATCCC CATCGTCGCC GCCCGCGGCC CTCGCGCATG GATCGCGAGA GCTTCGTGGC CCATTTCGGC GGGATCTACG AACATTCGCC CTGGATCGCC GAGCGCGCCT TCGAGCTGGA ACTAGGCCCC GCCCATGACA GCCCTGCGGG CCTCGCCAAT GCGCTCGCCC GCATCTTCCG CAGCGCCACG CCCGCGGAGC GGCTGAGCGT GCTCAAGGCC CACCCCGATC TCGCCGGCAA GCTCGCGCAG GCGCGGCGGC TGACGGCCTC CTCCTCCTTC GAACAGTCGA GCGCCGGGCT CGATGCGCTC ACCGACGCCG AGCGGGCCGA GTTCGCCACC CTCAACGCCG ACTATGTGGC CAAGCACGGC TTCCCCTTCA TCATCGCCGT GCGCGACCAC GACAAGGCCG GCATCCTCGC CGCCTTCGAG ACCCGGCTCG CCCACGACAG CGCCACCGAA TTCGCCACCG CCTGCCGTCA GGTGGAACGT ATCGCCGAAC TCCGGCTTCA GGACATGCTG AAATGA
|
Protein sequence | MKRYPRDMRG HGPTPPDAAW PGGARIAVSI VLNYEEGGEN CLLHGDAQSE AFLSDIAGAQ PWPGQRHWNM ESIYDYGARA GFWRLHRLFT GLNIPVTVYG VATALARSPE QVAAMKSAGW EIASHGLKWV EHRDMPEEEE RRQIAEAIRL HAEVVGERPR GWYTGRCSVN TVRLTAEEGG FDWISDTYDD DQPYWLETGT RDQLVIPYTL EANDMRFATA PGYIEGEQFF TYLRDSFDTL YAEGCAGQAK MFSIGLHCRL IGRPGKIAGL KRFLDYARTH ERVWFPCRGD IARHWHETHP HRRRPRPSRM DRESFVAHFG GIYEHSPWIA ERAFELELGP AHDSPAGLAN ALARIFRSAT PAERLSVLKA HPDLAGKLAQ ARRLTASSSF EQSSAGLDAL TDAERAEFAT LNADYVAKHG FPFIIAVRDH DKAGILAAFE TRLAHDSATE FATACRQVER IAELRLQDML K
|
| |