Gene RPC_4531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4531 
Symbol 
ID3972080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5059801 
End bp5063352 
Gene Length3552 bp 
Protein Length1183 aa 
Translation table11 
GC content67% 
IMG OID637927642 
Productallophanate hydrolase subunit 2 
Protein accessionYP_534372 
Protein GI90426002 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG1038] Pyruvate carboxylase
[COG2049] Allophanate hydrolase subunit 1
[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain
[TIGR02712] urea carboxylase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGACA AGGTCTTGAT TGCCAACCGC GGCGAAATCG CCTCGCGGAT CGGCAAAACG 
TTGCGCCGCA TCGGCATCGC TTCGGTGGCG GTGTATTCCG ACGCCGACCG CTTCACCCGC
GCGGTGCTCG ACGCCGACCA GGCGGTGCGG GTCGGCGCTT CGCCTGCGGC GGAGAGCTAT
CTCAACATCG ATGCGATCAT CAAAGCGTGT CGAGAGACCG GCGCGCAGGC GGTGCATCCT
GGCTACGGCT TTCTCTCCGA GAACCGCGGC TTTGCCGAGC GCCTGGCCTC GCACGGCATC
GTCTTCATCG GGCCGCGGCC GGAGCATCTC GATGCCTTTG GCTTGAAACA CAAGGCGCGC
GAATTGGCGC AGCACAGCAA CGTGCCGCTG CTGCCGGGCT CCGGCCTGTT GGAGACGATC
GATGAGGCGA TGCGCGAGGC CGAGCGCATC GGCTTTCCCC TGATGCTGAA GAGCACCGCG
GGCGGCGGCG GCATCGGCAT GCAGCTGTGC CACGACGCCG ACACGCTGCG CGAGCGCTTC
GCCACCGTGC AGCGCACCGC AAAGGCGAGC TTCGGCGACG CCCGAGTCTA TCTGGAGCGT
TTCGTCGCCG AGGCGCGCCA CATCGAGGTG CAGATATTCG GCGACGGTCA GGGCAAAGTG
ATCGCGCTCG GCGAGCGCGA CTGCTCGCTG CAGCGGCGCA ACCAGAAAGT GATCGAGGAG
ACCCCGGCGC CGGGCCTCAG TGACGAACTC CGTACGCGGC TGCATCGCGC TGCGGTGGCG
TTGGGCGAGA GCGTCGCCTA TCAGTCCGCC GGCACCGTCG AATTCATCTA CGACGTCGCG
CGCGAGGACT TCTATTTTCT CGAGGTGAAC ACAAGGCTGC AGGTCGAGCA TCCGGTGACC
GAAGCGGTGT TCGGCGTCGA TCTGGTGGAA TGGATGGTGC GGCAGGCGGC TGGCGACAGC
CCGCTTGCGA ACTACACGCC GCGCCCGCCG CAGGGCGCCG CGATCGAGGT GCGGCTCTAC
GCGGAAAATC CCAACGCCGG CTTTCGCCCC AGCGCCGGCC GGCTGACGGA TGTGGAATTT
CCGGACGGCG TGCGGGTCGA TGGTTGGATC GAGACCGGCG TCGAGGTGAC GCCGTTCTAC
GATCCGATGC TGGCCAAGCT GATCGTGCAT GCCGACAGCC GCGATCAGGC GATTGATAAA
CTACGCGACG CGCTGCAGCA ATCCCGCGTC GCCGGCATCG AGACCAATCT GGACTATCTG
CGCGCCATCG CCGGCTCCGA GCTGTTTCGC TCTGGCCGCG TCGCCACCAA TGTACTTGCG
AGTTTCGCGT TCGCACCCCG CACCATCGAC GTGCTGGCGC CCGGCGCGCA GTCCGGCTTG
CAGGAATTGC CGGGGCGGCT GCATCTGTGG CACGTCGGCG TGCCGCCGTC GGGGCCGATG
GACGAGCGCT CGTTCCGGCT TGCTAATATC ATTGTCGGCA ATCCCGAGGT CACCGCCGCG
CTCGAACTCA CCGTCAACGG GCCGACGCTG CGTTTCAACA CCGACGCGGT GATCGGGCTG
GCGGGCGCCC ACATGCTGGC GAAGCTCGAC GGCGTCGCGA TCGCGCATCA CGCGCCGGTA
GCTGTGAAAG CGGGGCAGAC GCTGCAAATC GGCAAGATCG ACGGCGCCGG GCAGCGCTGT
TACCTCGCGG TTCGCGGCGG CTTCGACGCG CCGCAGATTC TTGGCTCACG CGCGGTGTTC
ATGCTGGGCG CGTTCGGCGG CCATTCGACC GGTGCGCTGA AACCCGGCGA CGTGCTGCAT
ATCGTTGCGG AGAGCGACGC TCTCGCCGCA CCCCGCGCGG CTACGCCTGA CGAGATCCCG
CCGTTCACCC GCGCCTGGCA GATCGGCGTG ATCTACGGCC CGCACGGCGC GCCGGATTTC
TTCCGCGACG ACGATATCGC GACGTTGTTT TCCACCGACT ACGAGGTGCA TTTCAATTCC
GCGCGCACCG GCGTCCGGCT GATCGGGCCG AAGCCGCAAT GGGCGCGCGC CGACGGCGGC
GAGGCGGGCC TGCATCCCTC GAATATTCAC GACAACGCCT ATGCGGTCGG CTCGCTGGAT
TTCACCGGCG ACATGCCGAT CATTCTCGGT CCCGATGGCC CGAGCCTCGG CGGCTTCGTC
TGCCCTGCGG TGGTGGCGCG TGACGAATTG TGGAAGATCG GCCAGCTCAA GCCTGGCGAC
AAGGTTCGCT TCGTGCCGCT GCCGCGCGAT GACGACCCGG TCGCCGGCCC GACGGTGCTC
GCCGGGCCGC GCGAGCTTGG CTCGGCGATC GTGGCGGGGC GCGACGACGG CGCCATGCCA
GTGGTCTATC GCCGCGCCGG CGACGACAAT CTATTGGTCG AATACGGCCC GATGGAATTG
GACATCGCGC TGCGGTTGCG GGTGCAGCTG TTGGCCGACG CGGTGGCCGC GGCGAAATTG
CCCGGGCTGA TCGATCTCAC CCCAGGTATC CGCTCGTTGC AGATCCATTA CGACGGCGCC
ACGCTATCGC GCCGCAAACT GCTCGACGCG CTGGCCGCCA TCGAAGGCGA ACTGCCGGCG
GTCGATGCCA TGCGCGTGCC GAGCCGCGTC GTGCATCTGC CGCTGTCGTG GAACGACCCG
CAGGCGGTCA AGGCGATGCA CAAATATCAG GAACTGGTGC GGCCTGATGC GCCGTGGTGC
CCGTCGAACA TCGATTTCAT CCGGCGCATC AACGGGCTCG ACGATGAGGC CGCGGTGCAG
CGCATCGTGT TCGACGCCAG CTATTTGGTG CTTGGCCTCG GCGACGTCTA TCTCGGCGCG
CCGGTGGCGA CCCCGGTCGA TCCGCGGCAT CGGCTGGTGA CCACGAAGTA CAATCCAGCG
CGGACCTGGA CGCCGGAAAA CGCCGTCGGC ATCGGCGGCG CCTATATGTG CATCTACGGC
ATGGAAGGGC CGGGCGGTTA TCAACTGTTC GGCCGCACCA TCCAGATGTG GAATTCCTGG
CGCTCGACGC CGGAATTCAC CCCCGGCCAT CCGTGGCTGT TGCGGTTCTT CGACCAGATC
AGATTCTTCC CGGTCAGCGC CTCCGAATTG CTGGAGGCCC GCGAGGCGTT TCCGCACGGC
CAATATCCCC TGCGCATCGA GGAAACGGTG TTTTCCTATG CGGACTACGC CAAGGGGCTG
GCGCGGGATC AGGATAGCAT CGCGGCGTTC AAGCAGCGCC AGCAGGCGGC GTTCGAGGCC
GAGCGGCAGC GCTGGAAACA ATTGCGGCTC GACGCGGTTC AGGATGATGA GTCGGCCGGT
GCAGAAGCCG CGCCCGACGA CATCCCCGAC GGTGCGACCG GGGTGTTTTC CGAAGTGCCG
GGCAACGTCT GGAAGATTCT GGTCGACGAA GGCGCCATGG TCGCGGCCGG CGACACGCTG
GCGATCATCG AATCGATGAA GATGGAGATC AGCGTGCCGG CGCCGGTCGC CGGACGCTTG
GCGTCGATCC GCATCAAGCC GGGGCAGACG CTGCGCGCCG GAGACGTGGT GGCGGTGATT
GCGGAGGGGT GA
 
Protein sequence
MFDKVLIANR GEIASRIGKT LRRIGIASVA VYSDADRFTR AVLDADQAVR VGASPAAESY 
LNIDAIIKAC RETGAQAVHP GYGFLSENRG FAERLASHGI VFIGPRPEHL DAFGLKHKAR
ELAQHSNVPL LPGSGLLETI DEAMREAERI GFPLMLKSTA GGGGIGMQLC HDADTLRERF
ATVQRTAKAS FGDARVYLER FVAEARHIEV QIFGDGQGKV IALGERDCSL QRRNQKVIEE
TPAPGLSDEL RTRLHRAAVA LGESVAYQSA GTVEFIYDVA REDFYFLEVN TRLQVEHPVT
EAVFGVDLVE WMVRQAAGDS PLANYTPRPP QGAAIEVRLY AENPNAGFRP SAGRLTDVEF
PDGVRVDGWI ETGVEVTPFY DPMLAKLIVH ADSRDQAIDK LRDALQQSRV AGIETNLDYL
RAIAGSELFR SGRVATNVLA SFAFAPRTID VLAPGAQSGL QELPGRLHLW HVGVPPSGPM
DERSFRLANI IVGNPEVTAA LELTVNGPTL RFNTDAVIGL AGAHMLAKLD GVAIAHHAPV
AVKAGQTLQI GKIDGAGQRC YLAVRGGFDA PQILGSRAVF MLGAFGGHST GALKPGDVLH
IVAESDALAA PRAATPDEIP PFTRAWQIGV IYGPHGAPDF FRDDDIATLF STDYEVHFNS
ARTGVRLIGP KPQWARADGG EAGLHPSNIH DNAYAVGSLD FTGDMPIILG PDGPSLGGFV
CPAVVARDEL WKIGQLKPGD KVRFVPLPRD DDPVAGPTVL AGPRELGSAI VAGRDDGAMP
VVYRRAGDDN LLVEYGPMEL DIALRLRVQL LADAVAAAKL PGLIDLTPGI RSLQIHYDGA
TLSRRKLLDA LAAIEGELPA VDAMRVPSRV VHLPLSWNDP QAVKAMHKYQ ELVRPDAPWC
PSNIDFIRRI NGLDDEAAVQ RIVFDASYLV LGLGDVYLGA PVATPVDPRH RLVTTKYNPA
RTWTPENAVG IGGAYMCIYG MEGPGGYQLF GRTIQMWNSW RSTPEFTPGH PWLLRFFDQI
RFFPVSASEL LEAREAFPHG QYPLRIEETV FSYADYAKGL ARDQDSIAAF KQRQQAAFEA
ERQRWKQLRL DAVQDDESAG AEAAPDDIPD GATGVFSEVP GNVWKILVDE GAMVAAGDTL
AIIESMKMEI SVPAPVAGRL ASIRIKPGQT LRAGDVVAVI AEG