Gene Rsph17025_3035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3035 
Symbol 
ID5084374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp3104336 
End bp3105751 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content70% 
IMG OID640484606 
Productchitin deacetylase 
Protein accessionYP_001169224 
Protein GI146279065 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0726] Predicted xylanase/chitin deacetylase
[COG3195] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03164] OHCU decarboxylase
[TIGR03212] putative urate catabolism protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGAT ACCCCCGAGA CATGCGCGGT CATGGTCCCA ATCCGCCCGA TGCGGCCTGG 
CCCGGCGGCG CCCGGATCGC CGTCTCGATC GTGCTGAACT ACGAGGAGGG CGGCGAGAAC
TGCCTCCTGC ACGGCGATGC GCAGTCGGAA GCCTTCCTGT CGGACATCGC CGGCGCGCAG
CCCTGGCCGG GCCAGCGCCA CTGGAACATG GAATCGATCT ACGACTACGG CGCGCGCGCG
GGCTTCTGGC GGCTGCACCG GCTCTTCACC GGCCTGAACA TCCCCGTGAC GGTCTATGGC
GTGGCCACGG CCCTCGCCCG CTCACCCGAG CAGGTGGCGG CGATGAAGGC CGCGGGCTGG
GAGATCGCCT CGCACGGGCT GAAATGGGTC GAGCACCGCG ACATGCCCGA GGAGGAAGAG
CGGCGCCAGA TCGCCGAAGC GATCCGCCTG CACACCGAGG TGGTGGGCGA GCGCCCGCGC
GGCTGGTACA CCGGGCGCTG CTCGATCCGT ACGGTGCGGC TCACCGCTGA AGAGGGCGGC
TTCGACTGGA TCTCGGACAC CTACGACGAT GATCTGCCCT ACTGGATCGA GGCGGGCGAG
CGCGACCAGC TCGTGATCCC CTACACGCTC GAGGCCAACG ACATGCGCTT CGCCACCGCC
CCCGGCTACA TCGAGGGCGA GCAGTTCTTC ACCTACCTGC GCGACAGCTT CGACACGCTC
TACGCCGAGG GCTGCGCGGG TCAGGCGAAG ATGTTCTCGA TCGGCCTCCA TTGCCGACTG
ATCGGGCGGC CGGGAAAGAT CGCGGGGCTG AAGCGGTTCC TCGACTACGC GCGCGGGCAT
GACGGCGTCT GGTTCCCCCG CCGCGGCGAC ATCGCCCGCC ACTGGCGCGC GGTCCACCCT
CACCGCCGCC GCCCGCGCCC CTCGCGGATG GACCGCGAGA ACTTCGTCGG CCGCTTCGGC
GGGATCTACG AACATTCCCC CTGGGTGGCC GAGCGCGCCT TCGAGCTGGA ACTCGGACCC
GCCCATGACA GCGCCGCGGG CCTTGCCAAC GCGCTCGCAC GCGCCTTCCG CACGGCCACC
CCCGCCGAGA GGCTCGACGT GCTCAAGGCC CACCCCGATC TCGCCGGCAA GCTCGCCCAG
GCGCGGCGCC TGACGGCCGC CTCCTCCGCC GAGCAGCAAG GCGCCGGCCT CGACGCGCTG
ACGGATGACG AGCGGGCCCG CTTCACCGCC CTCAATGGAG ACTATGTCGC CCGACACGGC
TTCCCCTTCG TCATCGCGGT GCGCGACCAC GACAAGGAGG GCATCCTCGC CGCCTTCCAG
ACCCGCCTCG CCAACGACAG CGCCACCGAA TTTTCCACCG CCTGCCGGCA GGTGGAACGC
ATCGCCGAAC TCCGGCTCCA GGACATGCTG AAATGA
 
Protein sequence
MNRYPRDMRG HGPNPPDAAW PGGARIAVSI VLNYEEGGEN CLLHGDAQSE AFLSDIAGAQ 
PWPGQRHWNM ESIYDYGARA GFWRLHRLFT GLNIPVTVYG VATALARSPE QVAAMKAAGW
EIASHGLKWV EHRDMPEEEE RRQIAEAIRL HTEVVGERPR GWYTGRCSIR TVRLTAEEGG
FDWISDTYDD DLPYWIEAGE RDQLVIPYTL EANDMRFATA PGYIEGEQFF TYLRDSFDTL
YAEGCAGQAK MFSIGLHCRL IGRPGKIAGL KRFLDYARGH DGVWFPRRGD IARHWRAVHP
HRRRPRPSRM DRENFVGRFG GIYEHSPWVA ERAFELELGP AHDSAAGLAN ALARAFRTAT
PAERLDVLKA HPDLAGKLAQ ARRLTAASSA EQQGAGLDAL TDDERARFTA LNGDYVARHG
FPFVIAVRDH DKEGILAAFQ TRLANDSATE FSTACRQVER IAELRLQDML K