Gene RPD_3077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3077 
Symbol 
ID4023580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3424636 
End bp3425958 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content70% 
IMG OID637963276 
Productpolysaccharide deacetylase 
Protein accessionYP_570204 
Protein GI91977545 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00148769 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.638845 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGTG AACTGTTTCG ACGAAGGGCC CGGACGTGGT CCCTGCTTGG CATCGCCTGT 
GTGACCGTAG CGCTGGCCCT CCCCGCCAAC CGGTCGATGG CGGCCGATTG CCCGGGAAAC
CCCGGCGCGC TCGGGACCTC CCGGACGCTG GTTGTGGATC CGCGCGAGCA TCCGCGGATC
GGCACGATGC AATACGCCGA GACGCTGCCG CTCCAGGACC ATGAGGTGGT GCTGACCTTC
GACGACGGCC CGCTGCCGCG TCACAGCAAT GCCGTGCTCG ATATTCTCGC CAAAGAATGC
GTCAAGGCGA CGTTCTTCGT GGTCGGTCGG ATGGCGAAGG CCTATCCGGA CGGCGTGCGC
AGAATGCACG ACGCCGGCCA CAGCATCGGC ACCCACAGCC ACAACCATCC GCTCAGCTTC
CACCGGATGA CGGTCGAGCA GGCCAAGCAG GAAGTCGATG AGGGCATCGA CGCGGTCGCG
ACCGCGCTTG GCGACCGCGC GGCGGTGGCG CCGTTCTTCC GGATTCCCGG CCTGCTGCGG
GCCGAGGCGG TCGAAGGCTA TCTCGGCTCC CAGGGCATCC AGACCTGGAG CGCCGACTTC
CCCGCCGACG ACTGGCGGCA CATTTCGCCG GCCGCCGTCT ATGGCCTAGC CATGAGCCGG
CTGCAGGCCA AGGGGCGCGG CGTGCTGCTG CTCCACGACA TCCAGCCGCG CACGGTGGCG
GCGCTGCCGC AGATCCTGCA CGAGCTGAAG GCCCGCGGCT TCCGCATCGT GCATGTTGTG
CCGGCGACGC CGGATCGGCC GAAGACGCCG ACCGAGCCGT CGCAATGGCG GCTGCGTCCG
ATCACCGAAC AGGTTGCGAT CTCGCGTTGG CCGAAGGTGC CGAGCTTCAG CTTCGCCAAC
GCCGAGATGC TGCCAGGGCC CGCGGTCTCC GATCTTGGTC TCAATGCCGG CCACATGACG
GAGTCGGTCG GTGGCCCCAG GCATCTGGCG CGCGGCCAGA CGCCGCTGCC CAAGCGCGCG
CCGTGGCCGC GACAGACCCC GATGGCGGCG TCCGCGAATC TGATCGCCTT CCCGATTCCT
GCGGGAGTCC TGTTCAGCAT CCCGGAAAAG AGCCAGCCGG CGATCCGGGC GATGATTCCC
GTCGCGTCGC ATCATGCGAC GGCCGGCGCC GGGCTGGGCG CGACCAAGCT GGACGAAGCT
GCGCCAGCGA CCGCCGCCGG CGCCCCGGTC CGCCAGATCT CCGGCAGCGC CGCGATCGCG
CCCGCCGCCC TGCAGCGCGG CCCGATCGGC CTCACCGCAC GGCCGCGACC ACTCAATCAT
TAG
 
Protein sequence
MASELFRRRA RTWSLLGIAC VTVALALPAN RSMAADCPGN PGALGTSRTL VVDPREHPRI 
GTMQYAETLP LQDHEVVLTF DDGPLPRHSN AVLDILAKEC VKATFFVVGR MAKAYPDGVR
RMHDAGHSIG THSHNHPLSF HRMTVEQAKQ EVDEGIDAVA TALGDRAAVA PFFRIPGLLR
AEAVEGYLGS QGIQTWSADF PADDWRHISP AAVYGLAMSR LQAKGRGVLL LHDIQPRTVA
ALPQILHELK ARGFRIVHVV PATPDRPKTP TEPSQWRLRP ITEQVAISRW PKVPSFSFAN
AEMLPGPAVS DLGLNAGHMT ESVGGPRHLA RGQTPLPKRA PWPRQTPMAA SANLIAFPIP
AGVLFSIPEK SQPAIRAMIP VASHHATAGA GLGATKLDEA APATAAGAPV RQISGSAAIA
PAALQRGPIG LTARPRPLNH