Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_4531 |
Symbol | |
ID | 3972080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 5059801 |
End bp | 5063352 |
Gene Length | 3552 bp |
Protein Length | 1183 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637927642 |
Product | allophanate hydrolase subunit 2 |
Protein accession | YP_534372 |
Protein GI | 90426002 |
COG category | [C] Energy production and conversion [E] Amino acid transport and metabolism [I] Lipid transport and metabolism |
COG ID | [COG1038] Pyruvate carboxylase [COG2049] Allophanate hydrolase subunit 1 [COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit |
TIGRFAM ID | [TIGR00724] biotin-dependent carboxylase uncharacterized domain [TIGR02712] urea carboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGACA AGGTCTTGAT TGCCAACCGC GGCGAAATCG CCTCGCGGAT CGGCAAAACG TTGCGCCGCA TCGGCATCGC TTCGGTGGCG GTGTATTCCG ACGCCGACCG CTTCACCCGC GCGGTGCTCG ACGCCGACCA GGCGGTGCGG GTCGGCGCTT CGCCTGCGGC GGAGAGCTAT CTCAACATCG ATGCGATCAT CAAAGCGTGT CGAGAGACCG GCGCGCAGGC GGTGCATCCT GGCTACGGCT TTCTCTCCGA GAACCGCGGC TTTGCCGAGC GCCTGGCCTC GCACGGCATC GTCTTCATCG GGCCGCGGCC GGAGCATCTC GATGCCTTTG GCTTGAAACA CAAGGCGCGC GAATTGGCGC AGCACAGCAA CGTGCCGCTG CTGCCGGGCT CCGGCCTGTT GGAGACGATC GATGAGGCGA TGCGCGAGGC CGAGCGCATC GGCTTTCCCC TGATGCTGAA GAGCACCGCG GGCGGCGGCG GCATCGGCAT GCAGCTGTGC CACGACGCCG ACACGCTGCG CGAGCGCTTC GCCACCGTGC AGCGCACCGC AAAGGCGAGC TTCGGCGACG CCCGAGTCTA TCTGGAGCGT TTCGTCGCCG AGGCGCGCCA CATCGAGGTG CAGATATTCG GCGACGGTCA GGGCAAAGTG ATCGCGCTCG GCGAGCGCGA CTGCTCGCTG CAGCGGCGCA ACCAGAAAGT GATCGAGGAG ACCCCGGCGC CGGGCCTCAG TGACGAACTC CGTACGCGGC TGCATCGCGC TGCGGTGGCG TTGGGCGAGA GCGTCGCCTA TCAGTCCGCC GGCACCGTCG AATTCATCTA CGACGTCGCG CGCGAGGACT TCTATTTTCT CGAGGTGAAC ACAAGGCTGC AGGTCGAGCA TCCGGTGACC GAAGCGGTGT TCGGCGTCGA TCTGGTGGAA TGGATGGTGC GGCAGGCGGC TGGCGACAGC CCGCTTGCGA ACTACACGCC GCGCCCGCCG CAGGGCGCCG CGATCGAGGT GCGGCTCTAC GCGGAAAATC CCAACGCCGG CTTTCGCCCC AGCGCCGGCC GGCTGACGGA TGTGGAATTT CCGGACGGCG TGCGGGTCGA TGGTTGGATC GAGACCGGCG TCGAGGTGAC GCCGTTCTAC GATCCGATGC TGGCCAAGCT GATCGTGCAT GCCGACAGCC GCGATCAGGC GATTGATAAA CTACGCGACG CGCTGCAGCA ATCCCGCGTC GCCGGCATCG AGACCAATCT GGACTATCTG CGCGCCATCG CCGGCTCCGA GCTGTTTCGC TCTGGCCGCG TCGCCACCAA TGTACTTGCG AGTTTCGCGT TCGCACCCCG CACCATCGAC GTGCTGGCGC CCGGCGCGCA GTCCGGCTTG CAGGAATTGC CGGGGCGGCT GCATCTGTGG CACGTCGGCG TGCCGCCGTC GGGGCCGATG GACGAGCGCT CGTTCCGGCT TGCTAATATC ATTGTCGGCA ATCCCGAGGT CACCGCCGCG CTCGAACTCA CCGTCAACGG GCCGACGCTG CGTTTCAACA CCGACGCGGT GATCGGGCTG GCGGGCGCCC ACATGCTGGC GAAGCTCGAC GGCGTCGCGA TCGCGCATCA CGCGCCGGTA GCTGTGAAAG CGGGGCAGAC GCTGCAAATC GGCAAGATCG ACGGCGCCGG GCAGCGCTGT TACCTCGCGG TTCGCGGCGG CTTCGACGCG CCGCAGATTC TTGGCTCACG CGCGGTGTTC ATGCTGGGCG CGTTCGGCGG CCATTCGACC GGTGCGCTGA AACCCGGCGA CGTGCTGCAT ATCGTTGCGG AGAGCGACGC TCTCGCCGCA CCCCGCGCGG CTACGCCTGA CGAGATCCCG CCGTTCACCC GCGCCTGGCA GATCGGCGTG ATCTACGGCC CGCACGGCGC GCCGGATTTC TTCCGCGACG ACGATATCGC GACGTTGTTT TCCACCGACT ACGAGGTGCA TTTCAATTCC GCGCGCACCG GCGTCCGGCT GATCGGGCCG AAGCCGCAAT GGGCGCGCGC CGACGGCGGC GAGGCGGGCC TGCATCCCTC GAATATTCAC GACAACGCCT ATGCGGTCGG CTCGCTGGAT TTCACCGGCG ACATGCCGAT CATTCTCGGT CCCGATGGCC CGAGCCTCGG CGGCTTCGTC TGCCCTGCGG TGGTGGCGCG TGACGAATTG TGGAAGATCG GCCAGCTCAA GCCTGGCGAC AAGGTTCGCT TCGTGCCGCT GCCGCGCGAT GACGACCCGG TCGCCGGCCC GACGGTGCTC GCCGGGCCGC GCGAGCTTGG CTCGGCGATC GTGGCGGGGC GCGACGACGG CGCCATGCCA GTGGTCTATC GCCGCGCCGG CGACGACAAT CTATTGGTCG AATACGGCCC GATGGAATTG GACATCGCGC TGCGGTTGCG GGTGCAGCTG TTGGCCGACG CGGTGGCCGC GGCGAAATTG CCCGGGCTGA TCGATCTCAC CCCAGGTATC CGCTCGTTGC AGATCCATTA CGACGGCGCC ACGCTATCGC GCCGCAAACT GCTCGACGCG CTGGCCGCCA TCGAAGGCGA ACTGCCGGCG GTCGATGCCA TGCGCGTGCC GAGCCGCGTC GTGCATCTGC CGCTGTCGTG GAACGACCCG CAGGCGGTCA AGGCGATGCA CAAATATCAG GAACTGGTGC GGCCTGATGC GCCGTGGTGC CCGTCGAACA TCGATTTCAT CCGGCGCATC AACGGGCTCG ACGATGAGGC CGCGGTGCAG CGCATCGTGT TCGACGCCAG CTATTTGGTG CTTGGCCTCG GCGACGTCTA TCTCGGCGCG CCGGTGGCGA CCCCGGTCGA TCCGCGGCAT CGGCTGGTGA CCACGAAGTA CAATCCAGCG CGGACCTGGA CGCCGGAAAA CGCCGTCGGC ATCGGCGGCG CCTATATGTG CATCTACGGC ATGGAAGGGC CGGGCGGTTA TCAACTGTTC GGCCGCACCA TCCAGATGTG GAATTCCTGG CGCTCGACGC CGGAATTCAC CCCCGGCCAT CCGTGGCTGT TGCGGTTCTT CGACCAGATC AGATTCTTCC CGGTCAGCGC CTCCGAATTG CTGGAGGCCC GCGAGGCGTT TCCGCACGGC CAATATCCCC TGCGCATCGA GGAAACGGTG TTTTCCTATG CGGACTACGC CAAGGGGCTG GCGCGGGATC AGGATAGCAT CGCGGCGTTC AAGCAGCGCC AGCAGGCGGC GTTCGAGGCC GAGCGGCAGC GCTGGAAACA ATTGCGGCTC GACGCGGTTC AGGATGATGA GTCGGCCGGT GCAGAAGCCG CGCCCGACGA CATCCCCGAC GGTGCGACCG GGGTGTTTTC CGAAGTGCCG GGCAACGTCT GGAAGATTCT GGTCGACGAA GGCGCCATGG TCGCGGCCGG CGACACGCTG GCGATCATCG AATCGATGAA GATGGAGATC AGCGTGCCGG CGCCGGTCGC CGGACGCTTG GCGTCGATCC GCATCAAGCC GGGGCAGACG CTGCGCGCCG GAGACGTGGT GGCGGTGATT GCGGAGGGGT GA
|
Protein sequence | MFDKVLIANR GEIASRIGKT LRRIGIASVA VYSDADRFTR AVLDADQAVR VGASPAAESY LNIDAIIKAC RETGAQAVHP GYGFLSENRG FAERLASHGI VFIGPRPEHL DAFGLKHKAR ELAQHSNVPL LPGSGLLETI DEAMREAERI GFPLMLKSTA GGGGIGMQLC HDADTLRERF ATVQRTAKAS FGDARVYLER FVAEARHIEV QIFGDGQGKV IALGERDCSL QRRNQKVIEE TPAPGLSDEL RTRLHRAAVA LGESVAYQSA GTVEFIYDVA REDFYFLEVN TRLQVEHPVT EAVFGVDLVE WMVRQAAGDS PLANYTPRPP QGAAIEVRLY AENPNAGFRP SAGRLTDVEF PDGVRVDGWI ETGVEVTPFY DPMLAKLIVH ADSRDQAIDK LRDALQQSRV AGIETNLDYL RAIAGSELFR SGRVATNVLA SFAFAPRTID VLAPGAQSGL QELPGRLHLW HVGVPPSGPM DERSFRLANI IVGNPEVTAA LELTVNGPTL RFNTDAVIGL AGAHMLAKLD GVAIAHHAPV AVKAGQTLQI GKIDGAGQRC YLAVRGGFDA PQILGSRAVF MLGAFGGHST GALKPGDVLH IVAESDALAA PRAATPDEIP PFTRAWQIGV IYGPHGAPDF FRDDDIATLF STDYEVHFNS ARTGVRLIGP KPQWARADGG EAGLHPSNIH DNAYAVGSLD FTGDMPIILG PDGPSLGGFV CPAVVARDEL WKIGQLKPGD KVRFVPLPRD DDPVAGPTVL AGPRELGSAI VAGRDDGAMP VVYRRAGDDN LLVEYGPMEL DIALRLRVQL LADAVAAAKL PGLIDLTPGI RSLQIHYDGA TLSRRKLLDA LAAIEGELPA VDAMRVPSRV VHLPLSWNDP QAVKAMHKYQ ELVRPDAPWC PSNIDFIRRI NGLDDEAAVQ RIVFDASYLV LGLGDVYLGA PVATPVDPRH RLVTTKYNPA RTWTPENAVG IGGAYMCIYG MEGPGGYQLF GRTIQMWNSW RSTPEFTPGH PWLLRFFDQI RFFPVSASEL LEAREAFPHG QYPLRIEETV FSYADYAKGL ARDQDSIAAF KQRQQAAFEA ERQRWKQLRL DAVQDDESAG AEAAPDDIPD GATGVFSEVP GNVWKILVDE GAMVAAGDTL AIIESMKMEI SVPAPVAGRL ASIRIKPGQT LRAGDVVAVI AEG
|
| |