Gene RPD_2188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2188 
Symbol 
ID4022673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2442099 
End bp2445650 
Gene Length3552 bp 
Protein Length1183 aa 
Translation table11 
GC content66% 
IMG OID637962383 
Productallophanate hydrolase subunit 2 
Protein accessionYP_569324 
Protein GI91976665 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG1038] Pyruvate carboxylase
[COG1984] Allophanate hydrolase subunit 2
[COG2049] Allophanate hydrolase subunit 1
[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain
[TIGR02712] urea carboxylase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGTCA AGGTTCTGAT TGCCAACCGC GGCGAAATCG CCTCGCGGAT CGGCCGCACG 
CTGTGCCGGA TGGGGATCGC GTCGGTGGCG GTCTATTCCG ACGCCGACCG CTTCACCCGC
GCCGTGCGCG ACGCCGATGA GGCGGTGCGG GTCGGGCTTG CGCCCGCGGC GGAGAGCTAT
CTCAATGTCG ACGCCATCGT CGACGCCTGC CTCGCCACTG GCGCGCAAGC GGTGCATCCC
GGCTATGGCT TTCTGTCGGA GAACCGTGGC TTCGCCGAAC GACTCGCGCA GCACGGCATC
GCCTTTATCG GCCCGCGGCC GGAGCATCTC GATGCCTTCG GCCTGAAGCA CAAGGCTCGT
GAGATCGCGC AGCAAAGTGG CGTTCCACTG CTGCCGGGCT CGGATTTGCT GGAGACGATC
GATGACGCGC TGGTGGAGGC CGCGCGGATC GGCTTTCCCT TGATGTTGAA GAGCACCGCC
GGTGGCGGCG GCATCGGCAT GCAGCTTTGC CATGATGAAG CAACGCTGCG CGAGCGCTTC
GCCACCGTGC AGCGCACCGC GCGCGCCAGT TTTGGCGATG CGCGGGTCTA TCTCGAACGC
TACGTCGCGG TCGCCCGCCA TATCGAGGTG CAGATTTTTG GCGACGGCAA AGGTCATGTG
GTGGCGCTCG GCGAGCGCGA CTGCTCGCTG CAACGGCGCA ATCAGAAAGT GGTCGAAGAA
ACCCCGGCGC CGGGCATTTC GGAAGAGATG CGCCGACGGT TGCATCAGGC CGCGGTGACG
CTCGGACAGA GCGTCGCTTA CGAATCCGCC GGCACCGTGG AGTTCATCTA CGACGTCGAG
CGTGAGGATT TCTATTTCCT CGAGGTCAAC ACCCGACTGC AGGTCGAGCA TCCGGTTACA
GAGGCGGTGT TCGGTATCGA CCTGGTGGAA TGGATGGTGC GGCAGGCCGC TGGAGATAGT
CCGCTCGCCA GCTACAATCC GCGCGAACCC GAGGGCGCGG CGATCGAAGT GCGGCTCTAC
GCCGAAAATC CCAACGCCGG CTTCCGCCCC AGCGCTGGCC GACTGACCCG AGTGTCGTTT
CCGGAAACTG CGCGGATCGA CGGTTGGATC GAGACGGGGG TCGAGGTGAC GCCGTTCTAC
GATCCGATGC TGGCGAAGAT CATCGTTCAT GCCGAGGACC GACCGCGCGC GATCCGGAAG
CTGATCGATG CTCTTGATCG GTCGATCGTG GCAGGCATCG AAACCAACAT CGATTACTTG
CGTGCGATCG CCGGCGCCGA GGTGTTTCAG AGCGGCCGTG TCGCCACCAA TGTGCTGGCG
AATTTCGGAT TTGCGCCGAG CACCGTCGAC GTGCTGGCGC CGGGCGCTCA ATCGGGGCTG
CAGGAATTGC CGGGGCGGCT GCATCTATGG CATGTCGGCG TGCCGCCGAG CGGGCCGATG
GACGAGCGCT CGTTCCGGCT CGCCAACCGC ATCGTCGGCA ATCCGGAGAC CACCGCCGCG
CTGGAATTGA CGGTGAACGG CCCGACGCTG CGCTTCAATA CCGAGGCGGT GGTGGCGCTG
GCCGGTGCGC GGATGATTGC GAAGCTGGAT GGCGTCGAGG TTCCGCATCA CGCGCCGCTC
GCCATCAAGC CCGGCCAGAC GCTGGCGATC GGCAAGGTCG ATGGACCCGG CCAGCGCTGC
TACCTCGCGG TGCGCGGCGG CTTCGATGCG CCGGAATTCC TCGGCTCGCG CGCGGTGTTC
ATGCTTGGTG CGTTCGGAGG CCACGCGACC GGCGCGCTGA AGGCCGGTGA CGTGCTGCAC
ATCGCGACGC CTCACGTCTC GCTGCCGGCG CCGCGCGCCG CAGCGCCGGC AGAGCTTCCG
CCGCTCACCC GCGAATGGCG TCTCGGTGTG ATCTACGGCC CGCACGGCGC GCCGGACTTC
TTCCGTGAGG ACGACATCGC GACGCTGTTC GAGGCGAATT ACGAGGTGCA TTTCAATTCG
GCGCGGACCG GCGTGCGGTT GATCGGTCCG AAGCCGCAAT GGGCGCGGAC CGATGGCGGC
GAGGCAGGGC TGCATCCGTC CAATATCCAC GACAATGCTT ATGCGATCGG GTCGATCGAT
TTCACCGGCG ACATGCCGAT CATCCTCGGT CCCGACGGGC CGAGTCTCGG CGGCTTCGTC
TGTCCGGCGG TGGTGGCGCG CGAGGAGTTG TGGAAGATTG GTCAGCTCAA GCCCGGCGAC
AAGCTTCGCT TCGTGCCGGT GCAGCGCGCG GACGATCCGG TGGTGGGGCC TACGGTGATG
CGTGCGCCGG CCGAACTCGG CTCGGCGATC GTCGGCCGCC ACGACGATAG CGAAATTCCA
GTGGTGTATC GCCGCGCCGG CGACGACAAT CTGCTGGTTG AATACGGGCC GATGGAGCTG
GACATCGCGC TTCGTCTGCG GGTGCATGTG CTCGCAGAGG CGGTGACGCG GGCGCAGCTG
CCTGGGCTGA TCGATCTCAC ACCGGGGATC CGCTCGTTGC AGATTCACTA TGACAGCACC
CGGCTGTCGC GCCGCAAGCT GCTCGATACG CTGGCCGAGC TGGAGTGCGA CCTGCCGCCG
GTCGATGCGA TGCAGGTGCC GAGCCGCATC GTGCATCTGC CGCTGTCGTG GAACGATCCG
CAGGCGGTGC TGGCGATGCG CAAATATCAG GAGCTGGTGC GGCCGGACGC GCCCTGGTGC
CCGTCCAACA TCGAGTTCAT CCGCCGCATC AATGGTCTTG CCGACGAGGA GGCGGTCAAG
CGCATCGTAT TCGACGCGAG CTATTTGGTG CTCGGTCTCG GCGACGTCTA TCTCGGCGCG
CCGGTGGCGA CGCCGGTCGA TCCGCGGCAC CGCCTGGTGA CCACAAAGTA CAATCCGGCG
CGGACCTGGA CGCCGGAGAA CGCGGTCGGC ATCGGCGGCG CTTACATGTG CATCTACGGG
ATGGAAGGGC CGGGCGGTTA TCAGCTGTTC GGCCGCACCA TCCAGATGTG GAACTCGTGG
CGCTCGACGC CGGAATTCGC GCCCGGTCAT CCCTGGCTGC TGCGGTTCTT CGACCAGATC
AGGTTCTTCC CGGTGACGGC GGACGAATTG CTCGAGGCGC GTGCGGCGTT TCCGCATGGC
GCCTATCCGT TGAAGATCGA GGAGACCACG TTCTGCTACG CGGACTACAA GGCGTTCCTG
GCGCGCGAGG TCGACAGCGT CACGACCGTC AAGGCACGTC AGCAGGCTGC GTTCGAGGCC
GAGCGGCAGC GCTGGCGCGA CACCCGGATC GAAGAAGTGG TGGACGATGA ATCCGCCTCT
GCGCTCGGCT CCGGCGGCGA CATCCCCGAC GGATGCATCG GCCAGTTCAC CGAGGCGCCG
GGCAATGTCT GGAAGCTCTC TGTCGCCGAA GGCGAGCGCG TCGAGATCGG TCAGACGCTG
GCCGTGATCG AATCGATGAA GATGGAGATC GCCATCGCTG CAACCGCCTG CGGCATCGTC
CGGGTGCTGC ACACCCGGCC GGGCCAGACC CTGCGCGCCG GCGACCTCCT GTGCGCTCTG
GAGACGGCGT GA
 
Protein sequence
MFVKVLIANR GEIASRIGRT LCRMGIASVA VYSDADRFTR AVRDADEAVR VGLAPAAESY 
LNVDAIVDAC LATGAQAVHP GYGFLSENRG FAERLAQHGI AFIGPRPEHL DAFGLKHKAR
EIAQQSGVPL LPGSDLLETI DDALVEAARI GFPLMLKSTA GGGGIGMQLC HDEATLRERF
ATVQRTARAS FGDARVYLER YVAVARHIEV QIFGDGKGHV VALGERDCSL QRRNQKVVEE
TPAPGISEEM RRRLHQAAVT LGQSVAYESA GTVEFIYDVE REDFYFLEVN TRLQVEHPVT
EAVFGIDLVE WMVRQAAGDS PLASYNPREP EGAAIEVRLY AENPNAGFRP SAGRLTRVSF
PETARIDGWI ETGVEVTPFY DPMLAKIIVH AEDRPRAIRK LIDALDRSIV AGIETNIDYL
RAIAGAEVFQ SGRVATNVLA NFGFAPSTVD VLAPGAQSGL QELPGRLHLW HVGVPPSGPM
DERSFRLANR IVGNPETTAA LELTVNGPTL RFNTEAVVAL AGARMIAKLD GVEVPHHAPL
AIKPGQTLAI GKVDGPGQRC YLAVRGGFDA PEFLGSRAVF MLGAFGGHAT GALKAGDVLH
IATPHVSLPA PRAAAPAELP PLTREWRLGV IYGPHGAPDF FREDDIATLF EANYEVHFNS
ARTGVRLIGP KPQWARTDGG EAGLHPSNIH DNAYAIGSID FTGDMPIILG PDGPSLGGFV
CPAVVAREEL WKIGQLKPGD KLRFVPVQRA DDPVVGPTVM RAPAELGSAI VGRHDDSEIP
VVYRRAGDDN LLVEYGPMEL DIALRLRVHV LAEAVTRAQL PGLIDLTPGI RSLQIHYDST
RLSRRKLLDT LAELECDLPP VDAMQVPSRI VHLPLSWNDP QAVLAMRKYQ ELVRPDAPWC
PSNIEFIRRI NGLADEEAVK RIVFDASYLV LGLGDVYLGA PVATPVDPRH RLVTTKYNPA
RTWTPENAVG IGGAYMCIYG MEGPGGYQLF GRTIQMWNSW RSTPEFAPGH PWLLRFFDQI
RFFPVTADEL LEARAAFPHG AYPLKIEETT FCYADYKAFL AREVDSVTTV KARQQAAFEA
ERQRWRDTRI EEVVDDESAS ALGSGGDIPD GCIGQFTEAP GNVWKLSVAE GERVEIGQTL
AVIESMKMEI AIAATACGIV RVLHTRPGQT LRAGDLLCAL ETA