Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4062 |
Symbol | |
ID | 3911869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4633199 |
End bp | 4634794 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637885966 |
Product | glucose-methanol-choline oxidoreductase |
Protein accession | YP_487666 |
Protein GI | 86751170 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGCGC AGGGTTGGGA CTACATCATC GTCGGCGCCG GCTCCGCGGG CTGCATCGTC GCCAACCGGC TGTCGGCCGA TCCGGCCTGC CGCGTGCTGC TGCTCGAGGC CGGCGGCTCG GATCGCAACA TCTGGCTGAA GCTGCCGGTC GGCTATTATC GCTCGATCTA CGACGACCGC TTCTCGCGCA AATTCATCAC CGAGCCGAGC GACGTCACCG GCGGCCGCGC CATCGTCTGG CCGCGCGGCC GCGTGCTCGG GGGCTCGTCG TCGATCAACG GGCTGATCTT CATCCGCGGC GAGCCGGCCG GCTTCGACGA TTGGGAGCGG CTCGGCGCCA AAGGCTGGAG CTATCAGGAG CTGCTGCCGT ATTTCCGGCG CTACGAGCGC TATCGCGGCG GCGACAGCCA GTATCACGGC GGTTTCGGCG AGTTCGAAGT CTCCGATCTG CGCACCGGCA GCGAGGCCGC CGCCGCATGG GTGCAGGCCG GCATCGAATT CGGCCTGCCG CGCAATCCGG ACTTCAACGC GGAGACGACT TACGGCGTCG GCGCCTATCA GCTCGGCATC GGCCGGCGCT GGCGCTCGAG CTCGGCTTCC GCTTTTCTGC ATCCGGTCAT GCACCGCACG AATCTGACGG TGATCACCCG GGCACACGCG AGCCGCGTGC TGTTCGACGG CACCACCGCC ACCGGCGTCG AATGGATCAG GGACGGGCAA CGGATCCAGG CGCGCGCCGA ACGTGAAGTA ATCCTCTCGG CCGGCGCGCT GCAGTCGCCG CAATTGCTAC AGCTCTCCGG TATCGGCCCT GCGGCGCTGC TGCGCGGCCT CGGCATCGAA ATCGTGGCCG ATGCGCCCGA GGTCGGGCGC AACCTGCAGG ATCATTATCA GGCGCGGATG ATCGTGCGGC TGAAGCAGAA GCACTCGCTC AACGATCAGG TGCGCAGCCC GGTCGGGCTC GCGAAGATGG GCCTGCAATG GCTGCTCGCC GGCAACGGGC CGCTCACCGC CGGCGCCGGC CAGGTCGGCG GCGCCGCCTG CACGCGCTAT GCGAAGAACG GCCGCCCCGA CGTGCAGTTC AACGTCATGC CGCTGTCGGT CGACAAGCCC GGCGAGCCGC TGCACAGCTA CTCGGGCTTC ACCGCTTCGG TGTGGCAGTG CCACGCCGAA TCGCGGGGCC ATCTGGCGAT CCGTTCGACC GACCCGTTCG AGCAGCCGAC CATCGTACCG AACTATTTCG AGCGCGAGAT CGATCGTAAC ACCATCGTCG CCGGGCTCGA GATCCTGCGC GAGATCTATC GGCAGCCGTC GTTCCGCGAG CGCTGGGACC TCGACGTGGT GCCGGGCGAG AACATCAACG ACCCTGCCGG GCTGTGGGAG TTCGCCCGCA CCACCGGCGG CACGGTGTTC CATGCCTGCG GCACCTGCCG GATGGGTTCC GACGACGGCG CGGTGGTCGA TCCGCGCCTG CGCGTGCGCG GCGTCGAGCG GCTGCGCGTG GTCGACGCCT CGGTGATGCC ACTGATCACC TCGGCCAATA CCAACGCCGC CAGCCTGATG ATCGGCGAGA AGGGCGCCGC CCTGATCGCC TCATGA
|
Protein sequence | MAAQGWDYII VGAGSAGCIV ANRLSADPAC RVLLLEAGGS DRNIWLKLPV GYYRSIYDDR FSRKFITEPS DVTGGRAIVW PRGRVLGGSS SINGLIFIRG EPAGFDDWER LGAKGWSYQE LLPYFRRYER YRGGDSQYHG GFGEFEVSDL RTGSEAAAAW VQAGIEFGLP RNPDFNAETT YGVGAYQLGI GRRWRSSSAS AFLHPVMHRT NLTVITRAHA SRVLFDGTTA TGVEWIRDGQ RIQARAEREV ILSAGALQSP QLLQLSGIGP AALLRGLGIE IVADAPEVGR NLQDHYQARM IVRLKQKHSL NDQVRSPVGL AKMGLQWLLA GNGPLTAGAG QVGGAACTRY AKNGRPDVQF NVMPLSVDKP GEPLHSYSGF TASVWQCHAE SRGHLAIRST DPFEQPTIVP NYFEREIDRN TIVAGLEILR EIYRQPSFRE RWDLDVVPGE NINDPAGLWE FARTTGGTVF HACGTCRMGS DDGAVVDPRL RVRGVERLRV VDASVMPLIT SANTNAASLM IGEKGAALIA S
|
| |