Gene RPB_0228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0228 
Symbol 
ID3909470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp257855 
End bp261163 
Gene Length3309 bp 
Protein Length1102 aa 
Translation table11 
GC content68% 
IMG OID637882110 
Productpyruvate carboxylase 
Protein accessionYP_483850 
Protein GI86747354 
COG category[C] Energy production and conversion
[I] Lipid transport and metabolism 
COG ID[COG1038] Pyruvate carboxylase
[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit
[COG4799] Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTTTC GCAAGCTGCT GATCGCCAAC CGCGGCGAGA TCGCGATCCG CATCGCGCGG 
GCCGCGGCCG ACGCCGGGCT CGCCACCGTC GCGATCCATC CCGCCGACGA TGCGGCGTCG
CTGCATGTGC GGATCGCCGA CGAGGCGCGG GAGATTCCCG GGCGCGGCGC GCGGGCTTAT
CTGGATATCG AGGGCGTGAT CGCGGCCGCC AAGGCGGCGC ATTGCGACGC GCTGCATCCC
GGCTACGGCT TCCTCAGCGA GAACCCCGAT CTGGCGCGGC GCTGCGCCGA GGAAGGTATC
CGGTTCATCG GTCCGTCGCC AGAAGCCTTG CGGCTATTCG GTGACAAAGT TGCGGCGAAG
GAATTGGCCA AGAAATCCGG CGTGCCGATC ATCGACGGCA CCCAGGGTCC TTCGACTCTC
GATGAGGTGA AGGCGTTCTT CGGCTCGCTC GGCGATCATG CCGCGGTAAT GATCAAGGCG
ATGGCCGGCG GCGGCGGCCG CGGTATGCGT GTGGTCGAGC GCGCCGAGGA TCTCGACGAG
GCCTATGCGC GCTGCCAGTC CGAGGCGAAG GCGGCGTTCG GCAGCGACGG CGTCTATGCC
GAGCGGCTGA TCCGCAATGC GCGGCATATC GAGGTGCAGA TCATCGGCGA TCGTCACGGC
GGCATCAGTC AGTTGTGGGA GCGCGAATGC ACGATCCAGC GCCGCAACCA GAAGCTGGTC
GAGATCGCGC CGAGCCCGTC GCTGAGTGAT AGCTTGCGCG GCCGCATTCT CGAAGCCGCC
AAGACGCTGG CTGGTGCGGC GAACTACGAC AGTCTCGGTA CGTTCGAATT CCTGGTCGAC
GGCGAAGCCG GCGAGGGCGA CGGCGCGTTC GCGTTCATCG AGGCCAATCC GCGGCTGCAG
GTCGAGCACA CCGTGACCGA GGAAGTCCTT GCGATCGATC TGGTGCGATC GCAGATCGCG
GTGGCCGGCG GCGCGACGCT GGAATCGCTC GGGCTCGATC AGGCCGCGGT GCCGCGCCCG
CGCGGCTTCG CGATGCAGCT CCGTATCAAC ATGGAGACGA TGGACGCGAG CGGCGCGACC
CATCCGACCG GCGGCACGCT TTCGATCTTC GAGCCGCCGT CGGGTCCGGG CGTGCGCGTC
GATACGTTCG GCTATGCCGG CTACAGGACC AGCGCGGCGT TCGACTCGCT GCTGGCAAAA
GTGATCGTGC ACACGCCGTC GCAATGGCCG GATGTCGTCG CCAAGGCGGC GCGCAGCCTG
CGTGAGTTCC GCATCGACGG CGTCGCCACC AACATCCCGT TCATCCAGGC GATCCTGGCG
CATCCGGACT TCAAGGCCAA CAAGGTCAGC ACCAGCTTCA TCGACCGCAA CGTCGCCGAG
CTGGTCGGCG CCGCGGACAA GCTCGCCGCG CCACTGATCG CGTTGCCGGG TGGTGATGCG
CAGCACGGTG GCGCGAAGGC CGCGGTCGAA GCCGCGCCCG AAGGCGCGGT CGTGATCGCC
GCGCCGCTGC AGGGCACCGT GGTGGCGATC ACGGTCGCGG AAGGCGACGT GGTGCGGCCG
GGGCAGCAGC TCGCGGTGAT CGAATCGATG AAGATGGAGC ATCTCGTCGC CGCCGAGCAG
GGCGGCCGGA TTCGCCGCAT CGTCACCGCC GACGGCGTGA CGCTGATGCA GGGCGAGGCG
ATCCTCTATC TCGAGCCGCA GGACGTCGAG GGCGATCTGG CCGTCAAGGA GGCCGAAGTC
GATCTCGACC ATATCCGTCC CGACCTCGCC GAGATGCTGG CGCGGCAGGG CAATACGCTC
GACGAGAACC GGCCCGATTC GGTCGCGCGC CGGCGCAAGA CCAATCAGCG CACCGCGCGC
GAGAACATCG CCCAGCTGGT CGACGACGGC TCCTTCATGG AATACGGCAG CCTCGCGATC
GCAGGCCAGC GTCGCCGCCG CGCGCTCGAT GACCTGATCA AGAACACCCC GGCCGACGGT
CTCGTCACCG GCGTCGCCAC CGTCAACGCC GCGCAATTCG GCGAGCACGA TGCGCGCTGC
ATGGTGATCG CCTACGACTA CACGGTTCTG GCCGGCACCC AGGGCCATAT GAACCACAAG
AAGATCGACC GGATGCTGAC GCTGGTCGAG CAATGGAAGA TGCCGCTGGT GTTCTACGCC
GAAGGCGGCG GCGGCCGTCC CGGCGACACC GACCGGCTCG GCCTCACCGG CCTCGACGGG
CCGTCCTTCG TGCAGTTCGC GCGATTGTCC GGCCTGGTGC CGGTGATCGG CGTGGTTTCC
GGCTATTGCT TCGCCGGCAA TGCGGCGATG CTCGGCTGCT GCGACGTGAT CATCGCGACG
CAAAACGCCT CGATCGGCAT GGGCGGCCCG GCGATGATCG AGGGCGGCGG CCTCGGCGTG
TATCACCCGG CCGAAGTCGG CCCGGTGTCG TTCCAGTCGC CGAACGGCGT GGTCGATATT
CTGGTCGAGG ACGAGGAAGA GGCGACGCGG GTCGCGCAGA AATATCTGTC CTACTTCCAG
GGCCCGGTGA AGGACTGGCG CGCGGCGGAC CAGCGGCTGC TGCGCCGCGC GATTCCCGAA
AATCGTCTGC GGGTCTACGA CGTCCGCCAC GTCATCGATC TGATCGCGGA CGAAGATTCG
GTGCTGGAAA TCCGCCGCGA CTTCGGCGTC GGCATGGTCA CCGCCTTCAT CCGCATCGAG
GGCAAGCCGT TCGGCCTGAT CGCCAACAAT CCGAAGCATC TCGGCGGCGC GATCGACGCG
GCGGCCGGCG ACAAGGCGGC GCGTTTCCTG CAGCTGTGCG ACGCCTTCGA TATTCCGATC
GTGTCGCTGT GCGATACGCC CGGCTTCATG GTCGGCCCCG AAGCCGAGAA GACCGCGATC
GTACGGCACG TCGCGCGGAT GTTCGTCACC GGCGCCAGCC TGACGGTGCC GCTGTTCGGC
ATCGTGCTGC GCAAGGGCTA CGGGCTCGGC GCGCAGTCGA TGATCGGCGG CGGCTTCCAC
GCCTCGTTTT TCACCGCGGC GTGGCCGACC GGCGAATTCG GCGGCATGGG GCTGGAGGGC
TATGTCCGCC TCGGCTTCCG CAAGGAGATG GAAGCGATCG CCGACCCGGT CGAGCGCGAG
ACCTACTACA AGAACAAGGT CGCCGAGATG TACGCCAACG GCAAGGCGGT CTCGATCGCG
TCGGTGTGTG AGATCGACAA CGTGATCGAT CCCGCCGAGA CGCGGCGCTG GATCATGGCC
GGGCTGCGCT CGGTACCGAC GCCACCGCAG CGCGAGGGGC GCAAGCGGCC CTGCATCGAC
GCCTGGTAG
 
Protein sequence
MPFRKLLIAN RGEIAIRIAR AAADAGLATV AIHPADDAAS LHVRIADEAR EIPGRGARAY 
LDIEGVIAAA KAAHCDALHP GYGFLSENPD LARRCAEEGI RFIGPSPEAL RLFGDKVAAK
ELAKKSGVPI IDGTQGPSTL DEVKAFFGSL GDHAAVMIKA MAGGGGRGMR VVERAEDLDE
AYARCQSEAK AAFGSDGVYA ERLIRNARHI EVQIIGDRHG GISQLWEREC TIQRRNQKLV
EIAPSPSLSD SLRGRILEAA KTLAGAANYD SLGTFEFLVD GEAGEGDGAF AFIEANPRLQ
VEHTVTEEVL AIDLVRSQIA VAGGATLESL GLDQAAVPRP RGFAMQLRIN METMDASGAT
HPTGGTLSIF EPPSGPGVRV DTFGYAGYRT SAAFDSLLAK VIVHTPSQWP DVVAKAARSL
REFRIDGVAT NIPFIQAILA HPDFKANKVS TSFIDRNVAE LVGAADKLAA PLIALPGGDA
QHGGAKAAVE AAPEGAVVIA APLQGTVVAI TVAEGDVVRP GQQLAVIESM KMEHLVAAEQ
GGRIRRIVTA DGVTLMQGEA ILYLEPQDVE GDLAVKEAEV DLDHIRPDLA EMLARQGNTL
DENRPDSVAR RRKTNQRTAR ENIAQLVDDG SFMEYGSLAI AGQRRRRALD DLIKNTPADG
LVTGVATVNA AQFGEHDARC MVIAYDYTVL AGTQGHMNHK KIDRMLTLVE QWKMPLVFYA
EGGGGRPGDT DRLGLTGLDG PSFVQFARLS GLVPVIGVVS GYCFAGNAAM LGCCDVIIAT
QNASIGMGGP AMIEGGGLGV YHPAEVGPVS FQSPNGVVDI LVEDEEEATR VAQKYLSYFQ
GPVKDWRAAD QRLLRRAIPE NRLRVYDVRH VIDLIADEDS VLEIRRDFGV GMVTAFIRIE
GKPFGLIANN PKHLGGAIDA AAGDKAARFL QLCDAFDIPI VSLCDTPGFM VGPEAEKTAI
VRHVARMFVT GASLTVPLFG IVLRKGYGLG AQSMIGGGFH ASFFTAAWPT GEFGGMGLEG
YVRLGFRKEM EAIADPVERE TYYKNKVAEM YANGKAVSIA SVCEIDNVID PAETRRWIMA
GLRSVPTPPQ REGRKRPCID AW