Gene RPB_4062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4062 
Symbol 
ID3911869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4633199 
End bp4634794 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content68% 
IMG OID637885966 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_487666 
Protein GI86751170 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCGC AGGGTTGGGA CTACATCATC GTCGGCGCCG GCTCCGCGGG CTGCATCGTC 
GCCAACCGGC TGTCGGCCGA TCCGGCCTGC CGCGTGCTGC TGCTCGAGGC CGGCGGCTCG
GATCGCAACA TCTGGCTGAA GCTGCCGGTC GGCTATTATC GCTCGATCTA CGACGACCGC
TTCTCGCGCA AATTCATCAC CGAGCCGAGC GACGTCACCG GCGGCCGCGC CATCGTCTGG
CCGCGCGGCC GCGTGCTCGG GGGCTCGTCG TCGATCAACG GGCTGATCTT CATCCGCGGC
GAGCCGGCCG GCTTCGACGA TTGGGAGCGG CTCGGCGCCA AAGGCTGGAG CTATCAGGAG
CTGCTGCCGT ATTTCCGGCG CTACGAGCGC TATCGCGGCG GCGACAGCCA GTATCACGGC
GGTTTCGGCG AGTTCGAAGT CTCCGATCTG CGCACCGGCA GCGAGGCCGC CGCCGCATGG
GTGCAGGCCG GCATCGAATT CGGCCTGCCG CGCAATCCGG ACTTCAACGC GGAGACGACT
TACGGCGTCG GCGCCTATCA GCTCGGCATC GGCCGGCGCT GGCGCTCGAG CTCGGCTTCC
GCTTTTCTGC ATCCGGTCAT GCACCGCACG AATCTGACGG TGATCACCCG GGCACACGCG
AGCCGCGTGC TGTTCGACGG CACCACCGCC ACCGGCGTCG AATGGATCAG GGACGGGCAA
CGGATCCAGG CGCGCGCCGA ACGTGAAGTA ATCCTCTCGG CCGGCGCGCT GCAGTCGCCG
CAATTGCTAC AGCTCTCCGG TATCGGCCCT GCGGCGCTGC TGCGCGGCCT CGGCATCGAA
ATCGTGGCCG ATGCGCCCGA GGTCGGGCGC AACCTGCAGG ATCATTATCA GGCGCGGATG
ATCGTGCGGC TGAAGCAGAA GCACTCGCTC AACGATCAGG TGCGCAGCCC GGTCGGGCTC
GCGAAGATGG GCCTGCAATG GCTGCTCGCC GGCAACGGGC CGCTCACCGC CGGCGCCGGC
CAGGTCGGCG GCGCCGCCTG CACGCGCTAT GCGAAGAACG GCCGCCCCGA CGTGCAGTTC
AACGTCATGC CGCTGTCGGT CGACAAGCCC GGCGAGCCGC TGCACAGCTA CTCGGGCTTC
ACCGCTTCGG TGTGGCAGTG CCACGCCGAA TCGCGGGGCC ATCTGGCGAT CCGTTCGACC
GACCCGTTCG AGCAGCCGAC CATCGTACCG AACTATTTCG AGCGCGAGAT CGATCGTAAC
ACCATCGTCG CCGGGCTCGA GATCCTGCGC GAGATCTATC GGCAGCCGTC GTTCCGCGAG
CGCTGGGACC TCGACGTGGT GCCGGGCGAG AACATCAACG ACCCTGCCGG GCTGTGGGAG
TTCGCCCGCA CCACCGGCGG CACGGTGTTC CATGCCTGCG GCACCTGCCG GATGGGTTCC
GACGACGGCG CGGTGGTCGA TCCGCGCCTG CGCGTGCGCG GCGTCGAGCG GCTGCGCGTG
GTCGACGCCT CGGTGATGCC ACTGATCACC TCGGCCAATA CCAACGCCGC CAGCCTGATG
ATCGGCGAGA AGGGCGCCGC CCTGATCGCC TCATGA
 
Protein sequence
MAAQGWDYII VGAGSAGCIV ANRLSADPAC RVLLLEAGGS DRNIWLKLPV GYYRSIYDDR 
FSRKFITEPS DVTGGRAIVW PRGRVLGGSS SINGLIFIRG EPAGFDDWER LGAKGWSYQE
LLPYFRRYER YRGGDSQYHG GFGEFEVSDL RTGSEAAAAW VQAGIEFGLP RNPDFNAETT
YGVGAYQLGI GRRWRSSSAS AFLHPVMHRT NLTVITRAHA SRVLFDGTTA TGVEWIRDGQ
RIQARAEREV ILSAGALQSP QLLQLSGIGP AALLRGLGIE IVADAPEVGR NLQDHYQARM
IVRLKQKHSL NDQVRSPVGL AKMGLQWLLA GNGPLTAGAG QVGGAACTRY AKNGRPDVQF
NVMPLSVDKP GEPLHSYSGF TASVWQCHAE SRGHLAIRST DPFEQPTIVP NYFEREIDRN
TIVAGLEILR EIYRQPSFRE RWDLDVVPGE NINDPAGLWE FARTTGGTVF HACGTCRMGS
DDGAVVDPRL RVRGVERLRV VDASVMPLIT SANTNAASLM IGEKGAALIA S