Gene Rpal_4251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4251 
Symbol 
ID6411935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4567689 
End bp4569308 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content68% 
IMG OID642714133 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_001993222 
Protein GI192292617 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.773505 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGCAC GGATCGAAGG CGAATTCGAC ACCATCGTGG TGGGCGCGGG CACCGCAGGT 
TGCATCGTCG CCAACCGCCT ATCCGCCGAT CCGAGCCGGA AGGTGCTGCT GCTCGAAGCC
GGCGGCCGCG ACAACTGGAT CTGGTTTCAC ATTCCGGTCG GCTATCTGTT CGCGATCGGC
AATCCGCGCT CCGACTGGAT GTTCAAGACC GAGCCCGAGC CGGGCCTGAA TGGTCGGTCA
TTGGCCTATC CGCGCGGCAA GGTGATCGGC GGCTCCTCGG CGATCAACGC GATGATCTCG
ATGCGCGGCC AGGCCGCCGA TTACGACCAT TGGCGTCAGC TCGGCCTCGC CGGCTGGGGC
TGGGACGATG TCCGCAAGGT GTTCCGCCGG CTCGAAGACC ACTTCCTCGG CGACAGCGAG
CATCACGGCG CCGGCGGCGG CTGGCGGATC GAGGCGGCGC GGCTGTCATG GCCGATCCTC
GATGCGGTGG CGAACGCCGC CTGCGAGATG GGGATCCCGC GCAGCGCCGA CTTCAACACC
GGCGACAACG AGGGCGTCGG CTATTTCCAC GTCAACCAGA AGCGCGGCCG GCGCTGGTCG
TCGGCGCGCG GCTTTCTCAA GCCCGCGCTG CATCGCTCCA ACCTGCGGCT CGAAACCAAT
GTCGTGGTCG ACCGCGTGCT GGTCGAGAAC GGCCGCGCCG TCGGCGTGCG CTTCCTGCAG
AACGGCGTGC CGATCGAAGC CCGCGCGCGC CGTGAGGTGG TGCTGTGCGC CGGCGCGATC
GGCTCGATCC AGGTGCTGCA TCGCTCCGGC ATCGGCCCGG CCGAATGGCT GAAGCCGCTC
GGCATCGAGC CGGTGCTCGA TCGCCCCGGC GTCGGGCGCA ACCTGCAGGA CCATCTGCAG
CAGCGCGCGA TCTACAAGGT CAGCGGTGGC CGCACGCTGA ACGAGATCTA TCACTCGCTG
CCGCGCCGCG CCTGGATGGG ACTCGACTAC GCGCTGCGCC GGCGCGGGCC GCTCACCATG
GCGCCGTCGC AGCTCGGCAT CTTCACCCGC TCCGATCCGC ATCAGGAGCG CGCCAACATC
CAGTTCCACG TGCAGCCGCT GTCGCTCGAC AAGTTCGGCG ATCCGCTGCA TCGCTTCCCG
GCGATCACCG TGAGCGCCTG CAACCTGCGG CCGACCTCGC GCGGCGAGAT CAAGCTGAAA
TCCACCGCGC TCGACGCCGC GCCGTCGATC GCGCCGCATT ATCTGTCGAC CGCGGACGAC
TGCCGCGTCG CAGCCGATGC GATCCGCGTC ACGCGGCGGC TAATGAAGCA GCACGCGCTG
GCGACGTATC ACCCGGAGGA GTATCTGCCC GGCCCGTCGG TCGGCGACGA CGACGCCTCG
CTCGCCAAGG CCGCCGGTGA CATCGGCACT ACGATCTTCC ATCCCGTCGG CACCGCCAAA
ATGGGCCGCG CCGACGATCC GCTCGCGGTC GTCGATGAAA GACTTCGCTT CCACGGCCTC
GAAGCCTTGC GCGTCGTCGA CGCCTCGATC ATGCCGACGA TCACCTCCGG CAACACCAAC
ACCCCCACCG CAATGATCGC CGAGAAGGGC GCGACGATGA TCCTGGAGGA CGGGAAGTAA
 
Protein sequence
MTARIEGEFD TIVVGAGTAG CIVANRLSAD PSRKVLLLEA GGRDNWIWFH IPVGYLFAIG 
NPRSDWMFKT EPEPGLNGRS LAYPRGKVIG GSSAINAMIS MRGQAADYDH WRQLGLAGWG
WDDVRKVFRR LEDHFLGDSE HHGAGGGWRI EAARLSWPIL DAVANAACEM GIPRSADFNT
GDNEGVGYFH VNQKRGRRWS SARGFLKPAL HRSNLRLETN VVVDRVLVEN GRAVGVRFLQ
NGVPIEARAR REVVLCAGAI GSIQVLHRSG IGPAEWLKPL GIEPVLDRPG VGRNLQDHLQ
QRAIYKVSGG RTLNEIYHSL PRRAWMGLDY ALRRRGPLTM APSQLGIFTR SDPHQERANI
QFHVQPLSLD KFGDPLHRFP AITVSACNLR PTSRGEIKLK STALDAAPSI APHYLSTADD
CRVAADAIRV TRRLMKQHAL ATYHPEEYLP GPSVGDDDAS LAKAAGDIGT TIFHPVGTAK
MGRADDPLAV VDERLRFHGL EALRVVDASI MPTITSGNTN TPTAMIAEKG ATMILEDGK