Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0228 |
Symbol | |
ID | 3909470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 257855 |
End bp | 261163 |
Gene Length | 3309 bp |
Protein Length | 1102 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637882110 |
Product | pyruvate carboxylase |
Protein accession | YP_483850 |
Protein GI | 86747354 |
COG category | [C] Energy production and conversion [I] Lipid transport and metabolism |
COG ID | [COG1038] Pyruvate carboxylase [COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit [COG4799] Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGTTTC GCAAGCTGCT GATCGCCAAC CGCGGCGAGA TCGCGATCCG CATCGCGCGG GCCGCGGCCG ACGCCGGGCT CGCCACCGTC GCGATCCATC CCGCCGACGA TGCGGCGTCG CTGCATGTGC GGATCGCCGA CGAGGCGCGG GAGATTCCCG GGCGCGGCGC GCGGGCTTAT CTGGATATCG AGGGCGTGAT CGCGGCCGCC AAGGCGGCGC ATTGCGACGC GCTGCATCCC GGCTACGGCT TCCTCAGCGA GAACCCCGAT CTGGCGCGGC GCTGCGCCGA GGAAGGTATC CGGTTCATCG GTCCGTCGCC AGAAGCCTTG CGGCTATTCG GTGACAAAGT TGCGGCGAAG GAATTGGCCA AGAAATCCGG CGTGCCGATC ATCGACGGCA CCCAGGGTCC TTCGACTCTC GATGAGGTGA AGGCGTTCTT CGGCTCGCTC GGCGATCATG CCGCGGTAAT GATCAAGGCG ATGGCCGGCG GCGGCGGCCG CGGTATGCGT GTGGTCGAGC GCGCCGAGGA TCTCGACGAG GCCTATGCGC GCTGCCAGTC CGAGGCGAAG GCGGCGTTCG GCAGCGACGG CGTCTATGCC GAGCGGCTGA TCCGCAATGC GCGGCATATC GAGGTGCAGA TCATCGGCGA TCGTCACGGC GGCATCAGTC AGTTGTGGGA GCGCGAATGC ACGATCCAGC GCCGCAACCA GAAGCTGGTC GAGATCGCGC CGAGCCCGTC GCTGAGTGAT AGCTTGCGCG GCCGCATTCT CGAAGCCGCC AAGACGCTGG CTGGTGCGGC GAACTACGAC AGTCTCGGTA CGTTCGAATT CCTGGTCGAC GGCGAAGCCG GCGAGGGCGA CGGCGCGTTC GCGTTCATCG AGGCCAATCC GCGGCTGCAG GTCGAGCACA CCGTGACCGA GGAAGTCCTT GCGATCGATC TGGTGCGATC GCAGATCGCG GTGGCCGGCG GCGCGACGCT GGAATCGCTC GGGCTCGATC AGGCCGCGGT GCCGCGCCCG CGCGGCTTCG CGATGCAGCT CCGTATCAAC ATGGAGACGA TGGACGCGAG CGGCGCGACC CATCCGACCG GCGGCACGCT TTCGATCTTC GAGCCGCCGT CGGGTCCGGG CGTGCGCGTC GATACGTTCG GCTATGCCGG CTACAGGACC AGCGCGGCGT TCGACTCGCT GCTGGCAAAA GTGATCGTGC ACACGCCGTC GCAATGGCCG GATGTCGTCG CCAAGGCGGC GCGCAGCCTG CGTGAGTTCC GCATCGACGG CGTCGCCACC AACATCCCGT TCATCCAGGC GATCCTGGCG CATCCGGACT TCAAGGCCAA CAAGGTCAGC ACCAGCTTCA TCGACCGCAA CGTCGCCGAG CTGGTCGGCG CCGCGGACAA GCTCGCCGCG CCACTGATCG CGTTGCCGGG TGGTGATGCG CAGCACGGTG GCGCGAAGGC CGCGGTCGAA GCCGCGCCCG AAGGCGCGGT CGTGATCGCC GCGCCGCTGC AGGGCACCGT GGTGGCGATC ACGGTCGCGG AAGGCGACGT GGTGCGGCCG GGGCAGCAGC TCGCGGTGAT CGAATCGATG AAGATGGAGC ATCTCGTCGC CGCCGAGCAG GGCGGCCGGA TTCGCCGCAT CGTCACCGCC GACGGCGTGA CGCTGATGCA GGGCGAGGCG ATCCTCTATC TCGAGCCGCA GGACGTCGAG GGCGATCTGG CCGTCAAGGA GGCCGAAGTC GATCTCGACC ATATCCGTCC CGACCTCGCC GAGATGCTGG CGCGGCAGGG CAATACGCTC GACGAGAACC GGCCCGATTC GGTCGCGCGC CGGCGCAAGA CCAATCAGCG CACCGCGCGC GAGAACATCG CCCAGCTGGT CGACGACGGC TCCTTCATGG AATACGGCAG CCTCGCGATC GCAGGCCAGC GTCGCCGCCG CGCGCTCGAT GACCTGATCA AGAACACCCC GGCCGACGGT CTCGTCACCG GCGTCGCCAC CGTCAACGCC GCGCAATTCG GCGAGCACGA TGCGCGCTGC ATGGTGATCG CCTACGACTA CACGGTTCTG GCCGGCACCC AGGGCCATAT GAACCACAAG AAGATCGACC GGATGCTGAC GCTGGTCGAG CAATGGAAGA TGCCGCTGGT GTTCTACGCC GAAGGCGGCG GCGGCCGTCC CGGCGACACC GACCGGCTCG GCCTCACCGG CCTCGACGGG CCGTCCTTCG TGCAGTTCGC GCGATTGTCC GGCCTGGTGC CGGTGATCGG CGTGGTTTCC GGCTATTGCT TCGCCGGCAA TGCGGCGATG CTCGGCTGCT GCGACGTGAT CATCGCGACG CAAAACGCCT CGATCGGCAT GGGCGGCCCG GCGATGATCG AGGGCGGCGG CCTCGGCGTG TATCACCCGG CCGAAGTCGG CCCGGTGTCG TTCCAGTCGC CGAACGGCGT GGTCGATATT CTGGTCGAGG ACGAGGAAGA GGCGACGCGG GTCGCGCAGA AATATCTGTC CTACTTCCAG GGCCCGGTGA AGGACTGGCG CGCGGCGGAC CAGCGGCTGC TGCGCCGCGC GATTCCCGAA AATCGTCTGC GGGTCTACGA CGTCCGCCAC GTCATCGATC TGATCGCGGA CGAAGATTCG GTGCTGGAAA TCCGCCGCGA CTTCGGCGTC GGCATGGTCA CCGCCTTCAT CCGCATCGAG GGCAAGCCGT TCGGCCTGAT CGCCAACAAT CCGAAGCATC TCGGCGGCGC GATCGACGCG GCGGCCGGCG ACAAGGCGGC GCGTTTCCTG CAGCTGTGCG ACGCCTTCGA TATTCCGATC GTGTCGCTGT GCGATACGCC CGGCTTCATG GTCGGCCCCG AAGCCGAGAA GACCGCGATC GTACGGCACG TCGCGCGGAT GTTCGTCACC GGCGCCAGCC TGACGGTGCC GCTGTTCGGC ATCGTGCTGC GCAAGGGCTA CGGGCTCGGC GCGCAGTCGA TGATCGGCGG CGGCTTCCAC GCCTCGTTTT TCACCGCGGC GTGGCCGACC GGCGAATTCG GCGGCATGGG GCTGGAGGGC TATGTCCGCC TCGGCTTCCG CAAGGAGATG GAAGCGATCG CCGACCCGGT CGAGCGCGAG ACCTACTACA AGAACAAGGT CGCCGAGATG TACGCCAACG GCAAGGCGGT CTCGATCGCG TCGGTGTGTG AGATCGACAA CGTGATCGAT CCCGCCGAGA CGCGGCGCTG GATCATGGCC GGGCTGCGCT CGGTACCGAC GCCACCGCAG CGCGAGGGGC GCAAGCGGCC CTGCATCGAC GCCTGGTAG
|
Protein sequence | MPFRKLLIAN RGEIAIRIAR AAADAGLATV AIHPADDAAS LHVRIADEAR EIPGRGARAY LDIEGVIAAA KAAHCDALHP GYGFLSENPD LARRCAEEGI RFIGPSPEAL RLFGDKVAAK ELAKKSGVPI IDGTQGPSTL DEVKAFFGSL GDHAAVMIKA MAGGGGRGMR VVERAEDLDE AYARCQSEAK AAFGSDGVYA ERLIRNARHI EVQIIGDRHG GISQLWEREC TIQRRNQKLV EIAPSPSLSD SLRGRILEAA KTLAGAANYD SLGTFEFLVD GEAGEGDGAF AFIEANPRLQ VEHTVTEEVL AIDLVRSQIA VAGGATLESL GLDQAAVPRP RGFAMQLRIN METMDASGAT HPTGGTLSIF EPPSGPGVRV DTFGYAGYRT SAAFDSLLAK VIVHTPSQWP DVVAKAARSL REFRIDGVAT NIPFIQAILA HPDFKANKVS TSFIDRNVAE LVGAADKLAA PLIALPGGDA QHGGAKAAVE AAPEGAVVIA APLQGTVVAI TVAEGDVVRP GQQLAVIESM KMEHLVAAEQ GGRIRRIVTA DGVTLMQGEA ILYLEPQDVE GDLAVKEAEV DLDHIRPDLA EMLARQGNTL DENRPDSVAR RRKTNQRTAR ENIAQLVDDG SFMEYGSLAI AGQRRRRALD DLIKNTPADG LVTGVATVNA AQFGEHDARC MVIAYDYTVL AGTQGHMNHK KIDRMLTLVE QWKMPLVFYA EGGGGRPGDT DRLGLTGLDG PSFVQFARLS GLVPVIGVVS GYCFAGNAAM LGCCDVIIAT QNASIGMGGP AMIEGGGLGV YHPAEVGPVS FQSPNGVVDI LVEDEEEATR VAQKYLSYFQ GPVKDWRAAD QRLLRRAIPE NRLRVYDVRH VIDLIADEDS VLEIRRDFGV GMVTAFIRIE GKPFGLIANN PKHLGGAIDA AAGDKAARFL QLCDAFDIPI VSLCDTPGFM VGPEAEKTAI VRHVARMFVT GASLTVPLFG IVLRKGYGLG AQSMIGGGFH ASFFTAAWPT GEFGGMGLEG YVRLGFRKEM EAIADPVERE TYYKNKVAEM YANGKAVSIA SVCEIDNVID PAETRRWIMA GLRSVPTPPQ REGRKRPCID AW
|
| |