Gene RSP_1554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_1554 
Symbol 
ID3718621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp144267 
End bp145682 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content68% 
IMG OID640069704 
Productpolysaccharide deacetylase 
Protein accessionYP_351598 
Protein GI77462094 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0726] Predicted xylanase/chitin deacetylase
[COG3195] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03164] OHCU decarboxylase
[TIGR03212] putative urate catabolism protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.442375 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGAT ATCCCCGAGA CATGCGCGGC CACGGGCCCA CTCCGCCCGA TGCGGCCTGG 
CCCGGCGGAG CCCGGATCGC CGTCTCGATC GTGCTGAACT ACGAGGAGGG CGGCGAGAAC
TGCCTCCTGC ATGGAGATGC GCAGTCGGAG GCCTTCCTGT CCGACATCGC CGGCGCCCAG
CCCTGGCCGG GTCAGCGGCA CTGGAACATG GAATCGATCT ACGATTACGG TGCCCGCGCG
GGCTTCTGGC GGCTGCACCG GCTCTTCACC GGGCTGAACA TCCCGGTGAC GGTCTATGGC
GTGGCCACCG CGCTGGCCCG CTCGCCCGAG CAGGTGGCGG CGATGAAATC CGCCGGGTGG
GAGATCGCCT CGCACGGGCT GAAATGGGTC GAGCATCGCG ACATGCCCGA GGAGGAGGAG
CGCCGCCAGA TCGCCGAGGC GATCCGGCTG CATGCCGAGG TGGTGGGCGA GCGACCGCGC
GGCTGGTATA CGGGGCGCTG CTCGGTCAAT ACGGTGCGGC TCACCGCCGA AGAGGGCGGC
TTCGACTGGA TCTCGGACAC CTATGACGAC GACCAGCCCT ACTGGCTCGA GACCGGCACA
CGCGACCAGC TGGTGATCCC CTACACGCTC GAGGCCAATG ACATGCGCTT CGCCACGGCG
CCGGGCTACA TCGAGGGCGA GCAGTTCTTC ACCTATCTGC GCGACAGTTT CGACACGCTC
TATGCCGAAG GCTGTGCTGG GCAGGCGAAG ATGTTCTCGA TCGGGCTCCA TTGCCGGCTG
ATCGGGCGGC CCGGCAAGAT CGCGGGGCTG AAGCGCTTTC TGGACTATGC CCGCACCCAT
GAGCGGGTCT GGTTCCCGTG CCGGGGCGAC ATCGCCCGCC ACTGGCACGA GACCCATCCC
CATCGTCGCC GCCCGCGGCC CTCGCGCATG GATCGCGAGA GCTTCGTGGC CCATTTCGGC
GGGATCTACG AACATTCGCC CTGGATCGCC GAGCGCGCCT TCGAGCTGGA ACTAGGCCCC
GCCCATGACA GCCCTGCGGG CCTCGCCAAT GCGCTCGCCC GCATCTTCCG CAGCGCCACG
CCCGCGGAGC GGCTGAGCGT GCTCAAGGCC CACCCCGATC TCGCCGGCAA GCTCGCGCAG
GCGCGGCGGC TGACGGCCTC CTCCTCCTTC GAACAGTCGA GCGCCGGGCT CGATGCGCTC
ACCGACGCCG AGCGGGCCGA GTTCGCCACC CTCAACGCCG ACTATGTGGC CAAGCACGGC
TTCCCCTTCA TCATCGCCGT GCGCGACCAC GACAAGGCCG GCATCCTCGC CGCCTTCGAG
ACCCGGCTCG CCCACGACAG CGCCACCGAA TTCGCCACCG CCTGCCGTCA GGTGGAACGT
ATCGCCGAAC TCCGGCTTCA GGACATGCTG AAATGA
 
Protein sequence
MKRYPRDMRG HGPTPPDAAW PGGARIAVSI VLNYEEGGEN CLLHGDAQSE AFLSDIAGAQ 
PWPGQRHWNM ESIYDYGARA GFWRLHRLFT GLNIPVTVYG VATALARSPE QVAAMKSAGW
EIASHGLKWV EHRDMPEEEE RRQIAEAIRL HAEVVGERPR GWYTGRCSVN TVRLTAEEGG
FDWISDTYDD DQPYWLETGT RDQLVIPYTL EANDMRFATA PGYIEGEQFF TYLRDSFDTL
YAEGCAGQAK MFSIGLHCRL IGRPGKIAGL KRFLDYARTH ERVWFPCRGD IARHWHETHP
HRRRPRPSRM DRESFVAHFG GIYEHSPWIA ERAFELELGP AHDSPAGLAN ALARIFRSAT
PAERLSVLKA HPDLAGKLAQ ARRLTASSSF EQSSAGLDAL TDAERAEFAT LNADYVAKHG
FPFIIAVRDH DKAGILAAFE TRLAHDSATE FATACRQVER IAELRLQDML K