Gene RPC_2481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2481 
Symbol 
ID3971238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2689471 
End bp2691075 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content65% 
IMG OID637925589 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_532351 
Protein GI90423981 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.236507 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.407022 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATG ATCCCGTCGT CAACGATAAG CCTGTCGATT TGACCGCGGC GCGCGAAGCG 
CAACATCGCA ACGACCATTA CGACTTCATC GTCTGCGGAG CCGGTCCCGC GGGTTCGGTG
GTCGCCGCGC GGCTGGCCGA AAATCCCTCC GTGCGGGTGC TCCTGATTGA AGCCGGCGGC
AGCGACGACG TTGCCGACGT CGTGAACCCC GCGCACTGGC CGCTGAATCT CGGCTCGGAG
CGGGACTGGG GTTTCAGCGC GCAACCCAAT CCGCATCTCA ATGGCCGCTC GATCCCGCTC
AGCATGGGCA AGCTGCTGGG CGGCAGTTCG AGCATCAATG TCGGGGTGTG GGCGCGCGGC
CACAAGAACG ATTGGGAGCA TTTCGCCGCG GAGGCGGGCG ATCCGGCGTG GGGATATCAG
TCGATCCTTG CGATCTACCG GCGCATCGAG GACTGGCAGG GCGCCGACGA TCCCTTGCGC
CGCGGTCGCG GCGGCCCGGT TCACGTCCAG TCGGCCAGCG ATCCGCAGCC GGTCGCCTGC
GCCATGGTGG AGGCCGCCAA ATCGCTCGGG CTACCGACCT TCGACAGCCC CAACGGCGAG
ATGATGGAAG GTCGCGGCGG CGTCGCCATC AACGACCTGT TGGTTCGCGA CCGGCTGCGC
TCCTCGATCT ATCGCTCGTA TGTTCATCCG CGCCTCACGC AACCCAACCT GACGGTGCTG
ACGCGCGCGA TCGTCAGCAA GTTGTTATTC GAAGGCACCG CGGTGGTCGG CGTCGAGACG
GTCGAGCGAT CGGGGCGGCG TCGTTATCTG TCGGATCACG AAGTGGTGCT CTCGCTCGGC
GCGATCAATA CGCCAAAGCT GCTGATGCAG TCCGGGATCG GACCGCAAGA ACAACTTCGG
CGCCATGGCA TTGCGGTGGT TCAGCATCTG CCCGGCGTCG GTCAGAACCA TCAGGACCAT
GTGTCGTTCA ATTGCATTTT CGAATGCAAC GAGCCGCAGC CGGTCGGCCA TGGCGGCTCC
GAAGCGACCT TGTATTGGAC CAGCGAAACC GGCCTCGCGC TGCCAGACCT GTTTCAGTGC
CAAGTCGAAT TTCCCGTGCC CAGCGCCGAA ACCGCGGCGT TGGGCGTGCC GGAGCAGGGC
TGGACGATGT TTGCGGGATT GGCGCATCCC AAGAGCCGCG GTGAACTGCG GCTGTCCGGC
GCGGATGTGT TCGATCCGAT CCTGATCGAG GCCAACACGC TGTCGCACCC GGATGATCTC
GTGAACGCGC GGACCACCAT CGAACTATGC CGCGAGCTCG GCAACAGCTC CGCCTTTTCC
GGACTGGTGC GGCGCGAGGC CCTGCCGGGC AAGCTCGGCC CCGGCGCGAT GGAGGAGTTT
GCCCGCAATG CCGCCGTCAC TTTCTGGCAT CAGTCATGCA CCGCCAAGAT GGGACGCGAC
CCGATGGCCG TGGTCGACCA CCGCCTGAAG GTCTATGGGA TCGACCGGCT GCGCATCGCC
GACGCCTCGA TCATGCCTGA CGTCACCAGC GGCAACACCA TGGCGCCCTG CGTGGTGATC
GGCGAAAGAG CGGCCGAGAT GATCAAGGCC GCGCACCGGA TCTAG
 
Protein sequence
MTDDPVVNDK PVDLTAAREA QHRNDHYDFI VCGAGPAGSV VAARLAENPS VRVLLIEAGG 
SDDVADVVNP AHWPLNLGSE RDWGFSAQPN PHLNGRSIPL SMGKLLGGSS SINVGVWARG
HKNDWEHFAA EAGDPAWGYQ SILAIYRRIE DWQGADDPLR RGRGGPVHVQ SASDPQPVAC
AMVEAAKSLG LPTFDSPNGE MMEGRGGVAI NDLLVRDRLR SSIYRSYVHP RLTQPNLTVL
TRAIVSKLLF EGTAVVGVET VERSGRRRYL SDHEVVLSLG AINTPKLLMQ SGIGPQEQLR
RHGIAVVQHL PGVGQNHQDH VSFNCIFECN EPQPVGHGGS EATLYWTSET GLALPDLFQC
QVEFPVPSAE TAALGVPEQG WTMFAGLAHP KSRGELRLSG ADVFDPILIE ANTLSHPDDL
VNARTTIELC RELGNSSAFS GLVRREALPG KLGPGAMEEF ARNAAVTFWH QSCTAKMGRD
PMAVVDHRLK VYGIDRLRIA DASIMPDVTS GNTMAPCVVI GERAAEMIKA AHRI