Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3191 |
Symbol | |
ID | 3972202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 3531089 |
End bp | 3532789 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637926301 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_533052 |
Protein GI | 90424682 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.488817 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGATG GACTGCGCAA GGGCCTGACC TCTTACGGCG ACGCCGGGTT CTCGCTGTTC CTGCGCAAGG CCTTCATCAA GGCGATGGGC TATTCCGACG ACGCGCTGGA TCGGCCGATC GTCGGCATCA CCAACACCCA CAGCGACTAC AATCCCTGCC ACGGCAACGT GCCGCAGATC ATCGAGGCGG TGAAGCGCGG CGTGATGCTG GCGGGCGCGA TGCCGATGGT GTTTCCAACT ATCTCGATCG CCGAGAGTTT CGCGCATCCG ACCTCGATGT ATCTGCGCAA CCTGATGGCG ATGGACACCG AGGAGATGAT TCGCGCCCAG CCGATGGACG CGGTGGTGGT GATCGGCGGC TGCGACAAGA CGCTGCCGGC GCAGATCATG GCCGCGGTGT CGGCGGATCT GCCGACCGTG GTGATTCCGG TCGGGCCGAT GGTGGTCGGC CATCACAAGG GCGAGGTGCT GGGCGCTTGC ACCGACTGCC GCAGGTTGTG GGGCAAGTTT CGCGCCGGCG AGATGGATGA GGCCGAGATC GAGGCGGTCA ACGGCCGGCT GGCGCCCTCG GTCGGCACCT GCATGGTGAT GGGCACCGCC TCGACCATGG CCTGCATCAC CGAAGCGCTC GGGCTGTCGC TGCCGATGAG CGCGACGATT CCGGCGCCGC ACGCCGAGCG ATTTCGTTCC GCCGAACAAA GCGGCAAGCT CGCCGCGGCA ATGGCGGTGG CGAAGGGGCC GAAGCCCAGC GAGCTGTTGA CGCCGGCGGC ATTGCGCAAT GCGCAAGTGG TGCTGCAGGC GATCGGCGGC TCCACCAACG GGCTCATTCA TCTCACCGCG ATCGCCGGGC GAACGACGTA TCGCCTCGAT CTGGCGGCGT TCGATCGGCT GTCGCGCGAG GTGCCGGTGC TGGTCGATCT GAAGCCGTCG GGCGATCACT ACATGGAGCA CTTCCATCAC GCCGGCGGCG TGCCGAAACT GTTGGCGCAA CTCGGCGAGC TGATCGATCT CGACGCCAAA ACGATTTACG GCAGCTTGCG CGATGCGGTG GCCGCGGCCG AGGACGTGCC GGGGCAGGAC GTCATTCGCG CGCGCAACGA TCCGATCCGC AGCGAAGGCG CGATGGCGGT GCTATCCGGC AATCTGGCGC CGCGCGGCGC GGTGATCAAG CACTCGGCGG CGTCGCCAAA GCTCCTTCAG CACAGCGGCC GCGCCGTGGT GTTCGACAGT CTCGAGGACA TGGCGGCGCG GATCGACGAT CCGGGTCTCG ACGTTGCGGC CGATGACGTG CTGGTGCTGC GCAATGCCGG GCCGCAGGGC GCGCCGGGGA TGCCGGAGGC CGGCTATCTG CCGATTCCGC TGAAGCTGGC GCGCGCCGGC GTCAAGGACA TGGTGCGGAT TTCCGACGCC CGGATGAGCG GCACCGCGTT CGGCACCATC GTGCTGCACA TCACGCCGGA GAGCGCGGCC GGCGGGCCTT TGGCTTTGGT GCAAAACGGC GACGTGATCC GGCTCGACGT CGAAGCGCGC CGCATCGATC TGATGGTCGA GGACGATGAG TTAGCACGCC GCCGCCAAGC GCTGCCGGCG TCGCGGCAGC CCGCGCCGCT GCGCGGCTAT GCGCGGTTGT TCCACCAGAC GATCCTGCAG GCCGATCAAG GCTGCGATTT CGATTTTCTG ACCGGGCAGG GCGGGGATTA A
|
Protein sequence | MSDGLRKGLT SYGDAGFSLF LRKAFIKAMG YSDDALDRPI VGITNTHSDY NPCHGNVPQI IEAVKRGVML AGAMPMVFPT ISIAESFAHP TSMYLRNLMA MDTEEMIRAQ PMDAVVVIGG CDKTLPAQIM AAVSADLPTV VIPVGPMVVG HHKGEVLGAC TDCRRLWGKF RAGEMDEAEI EAVNGRLAPS VGTCMVMGTA STMACITEAL GLSLPMSATI PAPHAERFRS AEQSGKLAAA MAVAKGPKPS ELLTPAALRN AQVVLQAIGG STNGLIHLTA IAGRTTYRLD LAAFDRLSRE VPVLVDLKPS GDHYMEHFHH AGGVPKLLAQ LGELIDLDAK TIYGSLRDAV AAAEDVPGQD VIRARNDPIR SEGAMAVLSG NLAPRGAVIK HSAASPKLLQ HSGRAVVFDS LEDMAARIDD PGLDVAADDV LVLRNAGPQG APGMPEAGYL PIPLKLARAG VKDMVRISDA RMSGTAFGTI VLHITPESAA GGPLALVQNG DVIRLDVEAR RIDLMVEDDE LARRRQALPA SRQPAPLRGY ARLFHQTILQ ADQGCDFDFL TGQGGD
|
| |