Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3847 |
Symbol | |
ID | 3911651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4398972 |
End bp | 4400576 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637885748 |
Product | glucose-methanol-choline oxidoreductase |
Protein accession | YP_487451 |
Protein GI | 86750955 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.917916 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGACA CCTATGACTT TGTCGTGGTC GGCGGCGGCT CGGGCGGCTG CGCGGTCGCA GGAAGGCTGT CGGAAGATCC CGGCACGTCG GTGGCTTTGC TGGAAGCCGG CGGCACCGCC GACAATTGGG TGGTGAAGAC ACCCTATGCG CTGTCGCTGA TGGTGCCGAG CAAACTCAAC AACTGGCACT TCGAGACGGT GCCGCAGCCC GGCCTCAACG GCCGGATCGG CTATCAGCCG CGCGGCAAGG GGCTCGGCGG ATCGTCGGCG ATCAACGCCA TGGTCTACAT TCGCGGCCAT CAATGGGACT ATGACCACTG GGCCGAACTC GGCGCCACCG GCTGGTCCTA TGCCGACGTG CTGCCGTATT TCAAACGTTC GGAGAGCAAT TCCGATTTCA ACGGCGCGTA TCACGGCCAG AGCGGCCCGC TGCACGTCAA CAAACTCCGC ACCGACAATC CGGTGCACGA GATCTTCCTG CAGGCGGCGC GCGAGGCGCA ATTCCGCATT CGCGACGATT TCAACGGCGA GGAGCAGGAA GGCCTCGGCC TGTATCAGCT CACCCAGCAC AATGGCGAGC GCTGGAGCGC GGCCCGCGCC TATGTGCATC CCTACATGGC GACGCGGCCC AATCTGCGCG TCGAGACGCA GGCGCAGGCC ACCCGCATCC TGTTCGAGGG CGGTCGCGCG GTCGGCGTCG AATATCGCCA GAACGACGAA GCCCGGCAGA TCCGCGCGCG GCGCGAAGTC ATCGTCGCGT CCGGCGCGTT CCAGTCGCCG CAACTGCTGA TGCTGTCCGG CATCGGCGAC GCCGCCACGC TGCAGCAGCA CGGCATCGCG CCGACGCATC ATCTGCCCGG CGTCGGGCAG AATTTGCAGG ATCACCCCGA CTTCATCTTC GCCTATCAAT CCGACAGCCC ATATTTCACC GGCACCAGCT TCACCGGCAT CGGCCGGTTG CTGTCGCGGA TCGGCCAGTA CCGCCGCGAG GGCCGCGGCC CGCTCACGAC CAACTTCGCC GAATGCGGCG GCTTCCTGAA AACGCGACCG GACCTCGACG TGCCCGACGT GCAGTTGCAT TTCGGCATGG CGATGGTCGA CGACCACGGC CGCAAACGGC GCTGGGGCAC GGGCTTTTCC TGCCATGTCT GCCTGCTGCG GCCGAAAAGC CGCGGCAGCG TCGGCCTCGC AAGCGCCGAT CCGCTGGCGC CGCCGCTGAT CGACCCCAAC TTTCTCGGCG AGGCGGACGA TCTCGAGGCG ATGGTCGCAG GCTACAAGAC CACGCGGCGG TTGATGGAAG CCCCCGCGCT CCGCGCGCTG CAGCAGAAGG ATCTGTTCAC CGCCGACGTC CGCACCGATG ACGACATCCG CGCCATCCTG CGCGCCCGCG TCGACACCGT GTATCACCCG GTCGGCACCT GCAGGATGGG CAGCGATCCG ATGGCGGTGG TCGATCCGCA GCTTCGTGTG CACGGAATCG GCGGGCTGCG CATCGTCGAC GCCTCGGTCA TGCCGACCCT GATCGGCGGC AACACCAACG CGCCGACGAT CATGATCGGC GAGAAGGCCG CGGACATGAT CCGGGAGGAG ATCCGGGCGA ACTGA
|
Protein sequence | MTDTYDFVVV GGGSGGCAVA GRLSEDPGTS VALLEAGGTA DNWVVKTPYA LSLMVPSKLN NWHFETVPQP GLNGRIGYQP RGKGLGGSSA INAMVYIRGH QWDYDHWAEL GATGWSYADV LPYFKRSESN SDFNGAYHGQ SGPLHVNKLR TDNPVHEIFL QAAREAQFRI RDDFNGEEQE GLGLYQLTQH NGERWSAARA YVHPYMATRP NLRVETQAQA TRILFEGGRA VGVEYRQNDE ARQIRARREV IVASGAFQSP QLLMLSGIGD AATLQQHGIA PTHHLPGVGQ NLQDHPDFIF AYQSDSPYFT GTSFTGIGRL LSRIGQYRRE GRGPLTTNFA ECGGFLKTRP DLDVPDVQLH FGMAMVDDHG RKRRWGTGFS CHVCLLRPKS RGSVGLASAD PLAPPLIDPN FLGEADDLEA MVAGYKTTRR LMEAPALRAL QQKDLFTADV RTDDDIRAIL RARVDTVYHP VGTCRMGSDP MAVVDPQLRV HGIGGLRIVD ASVMPTLIGG NTNAPTIMIG EKAADMIREE IRAN
|
| |