Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3247 |
Symbol | |
ID | 3911048 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3709231 |
End bp | 3710940 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637885149 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_486854 |
Protein GI | 86750358 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.652611 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGACG GATTGCGCAA GGGGCTGACC AGCTACGGCG ACGCCGGTTT TTCGCTGTTC CTGCGCAAGG CGTTCATCAA GGCGATGGGC TATTCCGACG ACGCGCTGGA GCGGCCGATC GTCGGCATCA CCAATACCCA CAGCGATTAC AATCCGTGCC ACGGCAACGT GCCGCAGATC ATCGAGGCGG TGAAGCGCGG CGTGATGCTG GCGGGCGCGA TGCCGATGGT GTTTCCGACC ATCTCGATCG CCGAGAGCTT CGCGCATCCG ACCTCGATGT ATCTGCGCAA TCTGATGGCG ATGGACACCG AGGAGATGAT CCGCGCCCAG CCGATGGATG CGGTGGTAGT GATCGGCGGC TGCGACAAGA CGCTGCCGGC GCAGATCATG GCGGCGGTGT CGGCGGATCT GCCGACGGTG GTGATCCCGG TCGGGCCGAT GGTGGTCGGC CATCACAAGG GCGAAGTGCT GGGTGCCTGC ACCGACTGCC GGCGGTTGTG GGCGAAGCAT CGCGCCGGTG AGATCGACGA GGCGGAGATC GAGGCCGTCA ACGGCCGGCT GGCGCCGTCG GTCGGCACCT GCATGGTGAT GGGCACCGCC TCGACGATGG CGTGTCTCAC CGAGGCGATG GGCCTGTCGC TGCCGATGAG CGCGACGATC CCGGCGCCGC ATGCCGAGCG GTTTCGCTCG GCGGAAGAAA GCGGCAGGGT CGCGGCTGCG ATGGCCAAGG CGAAAGGCCC GAAGCCGAGC GATCTGCTGA CCCCCGCCGC GTTCCGCAAC GCGCAAGTCG TGCTGCAGGC GATCGGCGGC TCGACCAACG GACTGATTCA TCTCACCGCG ATCGCCGGCC GCGTGCCGCA TAAGATCGAC CTCGACGGTT TCGACCGGAT CGGCCGCGAC GTGCCGGTGC TGGTCGATCT GAAGCCGTCG GGCGATCACT ACATGGAGCA TTTTCATCAC GCCGGCGGCG TGCCGAAGCT GATGGCGCAG CTCGGCGAAC TGATCGATCT CGACGCGCGG ACGATCACCG GCGCGCCGCT GCGCGACATC GTCGCCAGGG CCGAACACGT GCAGGGCCAG GACGTGATCC GCTCGCGCGA CAATCCGATC CGGCGCGAGG GCGGGCTCGC GATGCTCACC GGCAATCTGG CGCCGCGCGG CGCGGTGATC AAACACGCCG CCGCGTCGCC GCAACTGATG CAGCACACCG GCCGCGCCGT GGTGTTCGAC TCGGTCGAGG ACATGACGCT GCGGATCGAC GATCCCGATC TCGACGTTGC GGCCGACGAC GTGCTGGTGC TGCGCAATGC CGGGCCGCGC GGCGCGCCGG GGATGCCGGA GGCGGGCTAT CTGCCGATCC CGATGAAGCT GGCGCGGGCG GGCATCAAAG ACATGGTGCG CATTTCGGAC GCGCGGATGA GCGGCACCGC GTTCGGTACC ATCGTGCTGC ACATCACGCC AGAGAGCGCG GATGGTGGGC CGCTGGCGCT GGTCGAAACC GGCGACCGGA TCGCGCTGGA TGTCGCGGCG CGGCGGATCG ATCTGTTGGT TGACGAAAGC GAACTCGCGC GCCGCCGTGC CGCATTGTCG TCGTCAGCCG CGGCGCGGCC GACGCGCGGC TATGCGCAAC TGTTTCACGA CACCATCCTG CAGGCCGACG AGGGCTGCGA TTTCGATTTT CTCACCGCAG CCGGGCGCAG CGAGCGTTGA
|
Protein sequence | MADGLRKGLT SYGDAGFSLF LRKAFIKAMG YSDDALERPI VGITNTHSDY NPCHGNVPQI IEAVKRGVML AGAMPMVFPT ISIAESFAHP TSMYLRNLMA MDTEEMIRAQ PMDAVVVIGG CDKTLPAQIM AAVSADLPTV VIPVGPMVVG HHKGEVLGAC TDCRRLWAKH RAGEIDEAEI EAVNGRLAPS VGTCMVMGTA STMACLTEAM GLSLPMSATI PAPHAERFRS AEESGRVAAA MAKAKGPKPS DLLTPAAFRN AQVVLQAIGG STNGLIHLTA IAGRVPHKID LDGFDRIGRD VPVLVDLKPS GDHYMEHFHH AGGVPKLMAQ LGELIDLDAR TITGAPLRDI VARAEHVQGQ DVIRSRDNPI RREGGLAMLT GNLAPRGAVI KHAAASPQLM QHTGRAVVFD SVEDMTLRID DPDLDVAADD VLVLRNAGPR GAPGMPEAGY LPIPMKLARA GIKDMVRISD ARMSGTAFGT IVLHITPESA DGGPLALVET GDRIALDVAA RRIDLLVDES ELARRRAALS SSAAARPTRG YAQLFHDTIL QADEGCDFDF LTAAGRSER
|
| |