Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3648 |
Symbol | |
ID | 3972019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 4057774 |
End bp | 4059573 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637926757 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_533502 |
Protein GI | 90425132 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.458648 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAAGC CGGTGACCGG GCGCAAGCTG CGCTCCAGCG AATGGTTCAA TGACCCTCAC AATCCGGCGA TGACCGCGCT CTATCTGGAG CGCTATCTGA ACTACGGACT GACACGCAAA GAGCTGCAGG CCGGCAAGCC GATCATCGGC ATCGCGCAGA CCGGCAACGA TTTGTCGCCG TGCAACAGGC ATCACCTGGA ACTGGCGCAG CGGGTGCGCG AGGGCATCCG CGAGGCCGGC GGCATCGCGA TGGAATTCCC GATGCACCCG ATCCAGGAAA CCGGCAAGCG GCCGACCGCG GCGCTCGACC GCAACCTGGC CTATCTCGGG CTGGTCGAAA TCCTGTTCAG CTACCCGCTC GACGGCGTGG TGCTGACCAC CGGCTGCGAC AAGACCACCC CGGCCTGCCT GATGGCGGCG GCGACCGTCA ACCTGCCGGC GATCGTACTG TCCGGCGGGC CGATGCTGAA CGGCTGGCAC GAGGGCGAGC GCACCGGCTC CGGCACGGTG ATCTGGAAAT CCCGCGAGCG GATGGCCGCG GGCGAGATCG ACTACGAGGA ATTCATGGAC ATCGTCGCCT CCTCGGCGCC CTCGGTCGGC CATTGCAACA CCATGGGCAC GGCGTCGACG ATGAATGCGC TGGCGGAAGC GCTGGGGATG TCGCTGCCCG GCTGTGCCGC GATCCCGGCG CCCTATCGCG AACGCGGCCA GATCGCTTAT CAGACCGGTT TGCGCGCGGT GCAAATGGTC TGGGAAGATC TCAAGCCCTC CGACATTCTC ACCAGGCAAG CCTTCGAGAA TGCCATCGTG GTGAACTCAG CGATTGGCGG CTCCACCAAC GCGCCGATCC ATCTCAACGC GCTGGCCCGC CATATCGGCG TGGAGCTCTC GATCGACGAC TGGCAGAGCG TCGGCCACAA GATCCCGCTG CTGGTCAACA TGCAGCCGGC GGGCTTCTAT CTCGGCGAGG AATTCCATCG CGCCGGCGGC GTGCCGGCCG TGGTGCGCGA ACTCATGAAG CACGGCAAGA TCCACAAGGA CGCGCTGACG GTGAATGGCC GCGGCATCGG CGTGAACTGC GCCAATGCGC CGTTGCCCGA CGGCGAGGTG ATCAAGACTT ACGACGGCCC GCTGGTGCAG GACGCCGGCT TCCTGGTGTT GCGCGGCAAC CTGTTCGATT CGGCGATCAT GAAGACCAGC GTGATCTCGC TGGAATTCCG CGAGCGCTAT CTGGCGACGC CGGGCGATCT CAACGCCTTC GAGGGCCGCG CCATCGTGTT CGAAGGCCCG GAGGACTATC ATGCCCGGAT CGACGACGAA GCGCTCGAGG TCGACGAGCA CTGCATCCTG TTCGTACGCG GCACCGGGCC GATCGGCTAT CCGGGCGGCG CCGAGGTGGT CAACATGCAG CCGCCGGCGG CCTTGATCAA ACGCGGCATC CACTCGCTGC CCTGCATCGG CGACGGACGG CAATCCGGCA CGTCCGGCTC GCCGTCGATC CTCAACGCTA CGCCGGAAGC CGCCGCCGAT GGCGGCCTCG CCATCCTGCG CACCGGCGAC AAGGTGCGCA TCGACCTCAA CCTCGGCAGC GCCAATATCC TGATCTCGGA TGAGGAGCTG GCGCAACGCC GCGCCGAGCT GAAAGCCCAT GGCGGATTCA AATATCCGGC GCACCAGACG CCGTGGCAGG AATTGTATCG CGCAACGGTC GGCCAACAGG CCACCGGCGC CTGCCTTGAG CTTGCGACGC GCTATCACGA CATCGCAGGC AAAGTCGGCG TCGCGAGACA TAATCATTAG
|
Protein sequence | MDKPVTGRKL RSSEWFNDPH NPAMTALYLE RYLNYGLTRK ELQAGKPIIG IAQTGNDLSP CNRHHLELAQ RVREGIREAG GIAMEFPMHP IQETGKRPTA ALDRNLAYLG LVEILFSYPL DGVVLTTGCD KTTPACLMAA ATVNLPAIVL SGGPMLNGWH EGERTGSGTV IWKSRERMAA GEIDYEEFMD IVASSAPSVG HCNTMGTAST MNALAEALGM SLPGCAAIPA PYRERGQIAY QTGLRAVQMV WEDLKPSDIL TRQAFENAIV VNSAIGGSTN APIHLNALAR HIGVELSIDD WQSVGHKIPL LVNMQPAGFY LGEEFHRAGG VPAVVRELMK HGKIHKDALT VNGRGIGVNC ANAPLPDGEV IKTYDGPLVQ DAGFLVLRGN LFDSAIMKTS VISLEFRERY LATPGDLNAF EGRAIVFEGP EDYHARIDDE ALEVDEHCIL FVRGTGPIGY PGGAEVVNMQ PPAALIKRGI HSLPCIGDGR QSGTSGSPSI LNATPEAAAD GGLAILRTGD KVRIDLNLGS ANILISDEEL AQRRAELKAH GGFKYPAHQT PWQELYRATV GQQATGACLE LATRYHDIAG KVGVARHNH
|
| |