Gene RPD_1948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1948 
Symbol 
ID4022430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2186339 
End bp2187385 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content66% 
IMG OID637962141 
Productpolysaccharide deacetylase 
Protein accessionYP_569084 
Protein GI91976425 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.171917 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAAG TTGTGGCGTT GACGGCAGGC TGCAGTGCCC TGGCCGTTCT GGTTGGCCTC 
GGTGCGGGAC GCGCTTATTT CTCGGCCCCG AGCGCGCAGA CTGTTGCGGC CTCCTCCGAA
CTCACCACCG GCGCGATCGC GTCGCGCTGG CCGACCCCGA CCTCCGAGTC GTCCAAGGCG
CCGGCGCCGA CAATTCAGCC CGCCGTCGCC AGGGAGCCCG CAGCGATGCC TGCGCCTGCG
CCGGCCCCCG CGCCTGTCCA AGCCTGCAAC AATCCAGACG CGCTCGGCAT CTCGCGCACC
GTCGAGATCG ACACCAATGG CGGACCGGGT CTCGGCATGT CGCAATATCG CGACTACGAC
TTCCTGCAGC CCGGCGAAGT CGCGCTGACC TTCGACGACG GTCCGTGGCC AGTGAACACG
CCGGCGGTTC TCGCGGCGTT GGCGGCGCAG TGCGTCAAGG CGGTGTTCTT CCCGATCGGC
AAACATGCCA GTTGGCATCC TGCGATCCTC AAGCAGGTCG TGGCCGCGGG CCACACCGTG
GGTTCGCACA CCTGGTCGCA CGTCAATCTC GCGACCAAGC CGTTTGCGGA CGCCAAGACC
GAGATTGAAA AGGGCATCAG CGGCGTCGCG CTCTCGGCCG GGCAGCCGAC CTCGCCGTTC
TTCCGCTTCC CGCAGCTTCG GCAGACGCAG GATCTCAAGG CCTATCTCGG CGAGCGCAAC
ATCGCGACGT TCTCGATCGA CATCGACTCC GAGGATTTCC GCATCCACAA GCCGGATCAA
CTGATCACCG CGGTGATGAC CAAGCTGAAG AAGGCCGGCA AGGGCATCCT GCTGATGCAC
GATTTCCAGC AATCGACCGC GCAGGCGCTG CCCGAGTTAC TCGCGCAACT CAAGGCCGGC
GGCTACAAGA TCGTGTTCAT CACCGCCAAG GACAAAGTGA ACACGCTGCC GGAATACGAC
GCACAGGTCG CCCCTGCGCA GCCGGCCGTC AGCAATGCGC GGCCGATCGG CAACGTGATC
CGCACCGTCG GCGGCAACGC GAAGTAA
 
Protein sequence
MRKVVALTAG CSALAVLVGL GAGRAYFSAP SAQTVAASSE LTTGAIASRW PTPTSESSKA 
PAPTIQPAVA REPAAMPAPA PAPAPVQACN NPDALGISRT VEIDTNGGPG LGMSQYRDYD
FLQPGEVALT FDDGPWPVNT PAVLAALAAQ CVKAVFFPIG KHASWHPAIL KQVVAAGHTV
GSHTWSHVNL ATKPFADAKT EIEKGISGVA LSAGQPTSPF FRFPQLRQTQ DLKAYLGERN
IATFSIDIDS EDFRIHKPDQ LITAVMTKLK KAGKGILLMH DFQQSTAQAL PELLAQLKAG
GYKIVFITAK DKVNTLPEYD AQVAPAQPAV SNARPIGNVI RTVGGNAK