Gene RPB_1733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1733 
Symbol 
ID3908258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1978443 
End bp1980092 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content68% 
IMG OID637883627 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_485352 
Protein GI86748856 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0167412 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0176419 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTGGC AAACCACACG TGGAACGGGA CGGATGACGC AGATCGAGGG CGAATTCGAC 
TATATCGTGG TGGGCGCCGG CACTGCCGGC TGCATCGTCG CCAACCGGCT GTCGGCCGAT
CCGAATTGTC GGGTGCTGCT GCTGGAAGCC GGCGGCCGCG ACAACTGGAT CTGGTTTCAC
ATCCCGGTCG GCTATCTGTT CGCGATCGGC AATCCGCGCT CGGACTGGAT GTTCAGGACC
GAGCCGGAGC CCGGCCTCAA TGGTCGGTCA TTGGCCTATC CGCGCGGCAA GGTGATCGGC
GGCTCCTCGG CCATCAACGC GATGATCTCG ATGCGCGGCC AGGCCGCCGA TTACGACCAC
TGGCGCCAGC TCGGCCTGTC CGGTTGGGGC TGGGACGACG TTCGCCAGGT GTTCCGGCGG
CTGGAGGATC ACTTCCTCGG CGACAGCGAG CATCACGGTA AAGGCGGCGG CTGGCGGATC
GAGGCGGCGC GGCTGTCCTG GCCGATCCTC GACGCGGTGG CCGACGCCGC CGGCGAGATG
GGCATCCCGC GCAGCGCCGA TTTCAACACC GGCGACAACG AGGGCGTCGG CTATTTCCAC
GTCAACCAGA AGCGCGGCCG GCGCTGGTCG TCGGCGCGCG GTTTCCTCAA GCCGGCGCTG
CACCGGCCGA ATCTGAGGCT GGAAACCGGT GTCGTCACCG ACCGCGTGAT AGTCGAGAAC
GGCCGAGCCG TCGGCGTGCG GTTCCAGCAA GGCGGCGGCG TGGTCGAGGC GCGGGCGCGG
CGCGAGGTGG TGCTGTGCGC GGGATCGATC GGCTCGGTGC AAGTGCTGCA GCGTTCAGGC
ATCGGGCCGG CGGAATGGCT CACCCCGCTC GGCATCGATC CGGTGCTCGA TCGCCCGGGC
GTCGGCCGCA ATCTGCAGGA CCATCTGCAG CAGCGCGCGA TCTATCGCGT CAGCGGCGGC
CGCACCCTGA ACGAGATCTA TCACTCGCTG CCGCGGCGCG CCTGGATGGG CATGGACTAC
GCGCTGCGTC GGCGCGGCCC GCTGACGATG GCGCCATCGC AGCTCGGCAT CTTCACCCGC
TCCGATCCGC ATCAGGAGCG CGCCAACATC CAGTTTCACG TCCAGCCATT GTCTCTGGAT
AAATTCGGCG ACCCGCTGCA CCGCTTCCCG GCGATCACCG TCAGCGCCTG CAACCTGCGG
CCGACCTCGC GCGGCGAGAT CAAGCTGAAA TCCACCGCGC TCGACGCCGC CCCCTCGATT
GCGCCGCATT ATCTGACGAC CGCCGACGAC CGCCGCGTCG CCGCCGACGC GATCCGCTGT
ACGCGCCGGC TGATGCAACA GCAGGCGCTG GCGAAGTATC AACCCGAGGA GTATCTGCCC
GGCCGCGCGG TGGGCGACGA CGACGCCTCG TTGGCGAAAG CCGCCGGCGA CATCGGCACC
ACGATCTTCC ATCCGGTCGG CACCGCCAAG ATGGGCCTCG CCAGCGATCC GATGGCGGTG
GTCGACGAAC GCTTGCGCCT GCACGGCCTC GACGGCCTGC GCGTCGTCGA CGCCTCGGTG
ATGCCGACGA TCACCTCCGG CAACACCAAT ACGCCGACCG CGATGATCGC CGAGAAAGGC
GCGACGATGA TGCTGGAGGA TGGGAAGTAA
 
Protein sequence
MSWQTTRGTG RMTQIEGEFD YIVVGAGTAG CIVANRLSAD PNCRVLLLEA GGRDNWIWFH 
IPVGYLFAIG NPRSDWMFRT EPEPGLNGRS LAYPRGKVIG GSSAINAMIS MRGQAADYDH
WRQLGLSGWG WDDVRQVFRR LEDHFLGDSE HHGKGGGWRI EAARLSWPIL DAVADAAGEM
GIPRSADFNT GDNEGVGYFH VNQKRGRRWS SARGFLKPAL HRPNLRLETG VVTDRVIVEN
GRAVGVRFQQ GGGVVEARAR REVVLCAGSI GSVQVLQRSG IGPAEWLTPL GIDPVLDRPG
VGRNLQDHLQ QRAIYRVSGG RTLNEIYHSL PRRAWMGMDY ALRRRGPLTM APSQLGIFTR
SDPHQERANI QFHVQPLSLD KFGDPLHRFP AITVSACNLR PTSRGEIKLK STALDAAPSI
APHYLTTADD RRVAADAIRC TRRLMQQQAL AKYQPEEYLP GRAVGDDDAS LAKAAGDIGT
TIFHPVGTAK MGLASDPMAV VDERLRLHGL DGLRVVDASV MPTITSGNTN TPTAMIAEKG
ATMMLEDGK