Gene RPB_0200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0200 
Symbol 
ID3909441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp223625 
End bp224761 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content63% 
IMG OID637882081 
Productoxalate decarboxylase 
Protein accessionYP_483822 
Protein GI86747326 
COG category[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG2140] Thermophilic glucose-6-phosphate isomerase and related metalloenzymes 
TIGRFAM ID[TIGR03404] bicupin, oxalate decarboxylase family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.284083 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATCA CCACCGATTT CCAGCCGGTT CGCGGCGCGT ACGGCGCGAG CGATCCCGGG 
CCGCGCAATC TGGCGCTCGA TCTGCAGAAC CCCGACATTT TCATTCCGCC GCCCACCGAC
AATGGGTCGC TTCCCAACCT GAAATTCTCG TTCGGCATGG CGCATAACCG GCTCGAGGCC
GGAGGCTGGG CGCGCGAGGT GACGGTCCGA GAACTGCCGG CCGCAAAGGG CATGGCCGGA
GTCGACATGC GCCTTGGTCC CGGCGTGGTC CGCGAGTTGC ACTGGCACAA GGAAGCCGAA
TGGGGCTACG TGCTCGACGG TCGCTGCCGC GTCACCGTCG TCGACCCCGA CCGCGGGGTC
TATGTCGACG ACCTTCAGGC CGGCGATATC TGGCTGTTCC CCTCCGGCAT ACCGCATTCG
ATCCAGGCCC TGGAAGAGGG TTGCGAATTC CTGCTGGTGT TCGACGACGG CAATTTCTCC
GAGAACGAGA CCCTGCTGGT GACGGAGTTG ATGGCGCATA TGCCGATCGA CGTGGTCGCC
AAGAATTTCG GCATTCCGCA GCAGCATTTC GCCAATCTCC CGCCGAAGGA GAAGTACATC
TTCCCGCTTC CCGTCCCGCC GCCGCTCGAC GAGGTTCTCG CCAAACTGCC GCCGAAGCGG
CCCGCGATGC CGTTCACGGT CCATCGTGGT GACTTCACGC CGACGCAATG GGATGGCGGC
AAGACCACGA TAATCGATGT CCGGAACTTT CCGGTAACCA ACATGGCGGC GCTGATCATC
GATCTGGAGC CGGGCGCATT GCGCGAAATC CACTGGCATC CCGACGCGGA CGAATGGCAA
TACTACATTC AGGGCGAAGC GCGGATGACC GTGTTCGACG CCACCTCGAA GGCGCGGACG
TTCAACTATC GCGCCGGCGA CGTCGGCTAC GTGCCGAAGA CTCTCGCGCA CTACATCGAG
AATATCGGCA CGACCCCGGT CCGGGTGCTG AACGTCTTCA ACAAGCCGCT GTTCAAGGAC
GTCCCGCTGA ATCAGTGGCT GGCTTTGACT CCGCCCGACC TGGTCCGGGG CCATCTCGGT
CTCGATGACG TGGCCATGGC GGCGCTCGAT CGGAATCCAC GCTCAGTCGT GCGTTGA
 
Protein sequence
MPITTDFQPV RGAYGASDPG PRNLALDLQN PDIFIPPPTD NGSLPNLKFS FGMAHNRLEA 
GGWAREVTVR ELPAAKGMAG VDMRLGPGVV RELHWHKEAE WGYVLDGRCR VTVVDPDRGV
YVDDLQAGDI WLFPSGIPHS IQALEEGCEF LLVFDDGNFS ENETLLVTEL MAHMPIDVVA
KNFGIPQQHF ANLPPKEKYI FPLPVPPPLD EVLAKLPPKR PAMPFTVHRG DFTPTQWDGG
KTTIIDVRNF PVTNMAALII DLEPGALREI HWHPDADEWQ YYIQGEARMT VFDATSKART
FNYRAGDVGY VPKTLAHYIE NIGTTPVRVL NVFNKPLFKD VPLNQWLALT PPDLVRGHLG
LDDVAMAALD RNPRSVVR