Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_4321 |
Symbol | |
ID | 3971509 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 4819587 |
End bp | 4821425 |
Gene Length | 1839 bp |
Protein Length | 612 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637927430 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_534163 |
Protein GI | 90425793 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.103538 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.169432 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGCCT ATCGCTCCAG GACCACAACC CATGGCCGCA ATATGGCGGG TGCTCGCGGC TTGTGGCGTG CCACCGGCAT GAAGAACGAG GATTTCGGCA AGCCGATCAT CGCGGTGGTG AATTCCTTCA CCCAGTTCGT GCCCGGCCAC GTGCATCTGA AGGACCTCGG CCAATTGGTC GCCCGCGAGA TCGAGAACGC CGGCGGCGTC GCCAAGGAAT TCAACACCAT CGCGATCGAT GACGGCATCG CGATGGGCCA TGACGGCATG CTGTATTCGC TGCCGTCGCG CGAATTGATC GCCGACAGCG TCGAATACAT GGTCAACGGC CATTGCGCCG ACGCCATGGT GTGCATCTCG AATTGCGACA AGATCACCCC CGGCATGTTG ATGGCCTCGC TGCGGCTCAA CATTCCGACC ATCTTCGTTT CCGGCGGCCC GATGGAAGCC GGCAAGGTCA CGGTCGGCGG CAAGAAGCGC GCGGTCGACC TGATCGACGC CATGGTGGCG GCGGCCGATG ACCGGGTCAG CGACGCCGAC GTCGAGGCGA TCGAGCGCTC CGCCTGTCCG ACCTGCGGCT CCTGCTCCGG CATGTTCACC GCCAATTCGA TGAACTGCCT GACCGAGGCG CTCGGGCTGG CGCTGCCCGG CAACGGCTCG GTGCTCGCCA CCCACGCCGA CCGCAAGGCG CTGTTCGTCG AGGCCGGCCA CCTGATCGTC GATCTGGCCC GGCGTTACTA CGAGCAGGAC GACGAGACGG CGCTGCCGCG CAACATCGCC AGCTTCAAGG CGTTCGAGAA CGCCATGACG CTCGACATCG CGATGGGCGG CTCGACCAAC ACCGTGCTGC ATCTGTTGGC CGCGGCCTAT GAGGGCGAGA TCCCCTTCAC CATGCAGGAC ATCGACCGGC TGTCGCGCCG GGTGCCGGTG CTGTGCAAGG TGGCGCCGGC GGTGGCCGAC GTTCACGTCG AGGACGTGCA TCGCGCCGGC GGCGTGATGG GCATTCTCGG CGAACTCGAC CGCGCCCGGC TGATCAACGC CGAACTGCCG ACCGTGCACT CGACCTCGCT CGGCGAAGCG CTGAACCGCT GGGACGTGAT GCGCACCCAG AGCGACAGCG TGCGCAAGTT CTACAAGGCG GCGCCCGGCG GCGTGCCGAC GCAGGTCGCG TTCAGCCAGG AGCAGCGCTA TGACGACGTC GATACCGACC GCGCCAAAGG CTGCATCCGC GACGCCGAGC ACGCCTTCTC CAAGGACGGC GGCCTGGCGG TGCTGTCCGG CAACCTCGCG ATCGACGGCT GCATCGTCAA GACCGCCGGC GTCGACGCCA GCATCCTGAC CTTCCAGGGC CCGGCGCGGG TGTTCGAGAG CCAGGACGCC GCGGTCGAAG GCATTCTCGG CGGCAAGATC ACAGCCGGCG ATATCGTGGT GATCCGCTAT GAGGGGCCGC GCGGCGGCCC CGGCATGCAG GAAATGCTGT ATCCGACCAG CTACCTGAAA TCGAAAGGCT TGGGCAAAGC CTGCGCGCTG ATCACCGACG GCCGGTTCTC CGGCGGCTCC TCGGGGCTGT CGATCGGCCA CGTCTCGCCG GAAGCCGCCG AGGGTGGGCT GATCGGCCTC GTCGAAGAGG GCGACCGCAT CGAGATCGAC ATTCCGCAGC GCTCGATTCG GCTCGCGGTC GACGACGCCG TGCTGGCGGA ACGCCGCGTC GCCATGCTGG CGCGCAAGGA TGCTTGGAAG CCCGGCAAGC GCAGCCGCAA GGTCACCTCG GCGCTGAAGG CCTACGCCGC GATGACCACC AGCGCCGCCC GCGGCGCGGT GCGGGTGGTG AAGGATTAA
|
Protein sequence | MPAYRSRTTT HGRNMAGARG LWRATGMKNE DFGKPIIAVV NSFTQFVPGH VHLKDLGQLV AREIENAGGV AKEFNTIAID DGIAMGHDGM LYSLPSRELI ADSVEYMVNG HCADAMVCIS NCDKITPGML MASLRLNIPT IFVSGGPMEA GKVTVGGKKR AVDLIDAMVA AADDRVSDAD VEAIERSACP TCGSCSGMFT ANSMNCLTEA LGLALPGNGS VLATHADRKA LFVEAGHLIV DLARRYYEQD DETALPRNIA SFKAFENAMT LDIAMGGSTN TVLHLLAAAY EGEIPFTMQD IDRLSRRVPV LCKVAPAVAD VHVEDVHRAG GVMGILGELD RARLINAELP TVHSTSLGEA LNRWDVMRTQ SDSVRKFYKA APGGVPTQVA FSQEQRYDDV DTDRAKGCIR DAEHAFSKDG GLAVLSGNLA IDGCIVKTAG VDASILTFQG PARVFESQDA AVEGILGGKI TAGDIVVIRY EGPRGGPGMQ EMLYPTSYLK SKGLGKACAL ITDGRFSGGS SGLSIGHVSP EAAEGGLIGL VEEGDRIEID IPQRSIRLAV DDAVLAERRV AMLARKDAWK PGKRSRKVTS ALKAYAAMTT SAARGAVRVV KD
|
| |