Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_1648 |
Symbol | |
ID | 6409305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 1765529 |
End bp | 1767385 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642711537 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001990652 |
Protein GI | 192290047 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGCTT ATCGCTCGAG AACGACGACC CACGGCCGCA ACATGGCTGG CGCGCGCGGC CTGTGGCGCG CCACCGGTAT GAAGGATTCC GACTTCGGCA AGCCGATCAT CGCGGTGGTC AACTCGTTCA CCCAGTTCGT GCCCGGCCAC GTCCATCTGA AGGACCTCGG CCAGCTTGTT GCGCGTGAGA TCGAAGCGGC CGGCGGCGTT GCCAAGGAGT TCAACACCAT CGCGGTCGAT GACGGCATCG CGATGGGCCA CGACGGCATG CTGTATTCGC TGCCGTCGCG TGAACTGATC GCCGACAGCG TCGAGTACAT GGTCAATGCC CATTGCGCCG ACGCGATGGT GTGCATCTCC AACTGCGACA AGATCACGCC CGGCATGCTG ATGGCGGCGA TGCGGCTTAA CATTCCGGCC GTGTTCGTCT CCGGCGGCCC GATGGAAGCT GGCAAGGTGG TGCTGAAGGG TAAGACACAC GCGGTCGACC TGATCGACGC GATGGTCGCC GCCGCCGACA GCGCCATGAG CGACGAAGAC GTGCAGACGA TGGAACGCTC GGCATGTCCG ACCTGCGGCT CGTGCTCGGG CATGTTCACC GCCAATTCGA TGAACTGCCT CACAGAAGCG CTCGGCCTGT CGTTGCCGGG CAACGGCTCG GTGCTCGCCA CCCATTCCGA TCGCAAGCGG CTGTTCGTCG AGGCCGGCCA CACCATCGTC GACCTAGCGC GCCGCTACTA CGAAGGCGAC GATGCCTCGG TGCTGCCGCG CAACATCGCG AACTTCAAGG CGTTCGAGAA CGCGATGACG CTCGACATCG CGATGGGCGG CTCAACCAAC ACCGTGCTGC ATCTGCTGGC CGCGGCGCGC GAAGCCGAAC TCGACTTCTC GATGAAGGAT ATCGATCGGC TGTCGCGCCG GGTGCCGTGC CTCAGCAAGA TCGCTCCGTC GGTGTCCGAC GTGCACATGG AAGACGTGCA TCGCGCCGGC GGCATCATGG CGATCCTCGG CGAGCTCGAC CGCGCCGGGC TGCTCGACAC CTCGTGCACC ACGGTGCATT CCGAAACCCT CGGTGCGGCC TTGGCGCGGT GGGACATCCG CCAGAGCAAC AGCGAGAGCG TCCGCACCTT CTTCCGTGCG GCGCCGGGCG GCGTGCCGTC GCAGACCGCG TTCAGCCAGG ACCGCCGCTA CGACGAGCTC GATCTCGACC GTGAGAAGGG TGTGATCCGC GACGCCGCGC ATGCCTTCAG CAAGGACGGC GGCCTCGCCG TGCTGTACGG CAACATCGCG CTGGACGGCT GCATTGTGAA GACCGCCGGC GTCGACGCCT CGATCCTGAC CTTCTCGGGT CCGGTGAAGG TGTTCGAGAG CCAGGACGAC GCGGTGTCGG CGATCCTGAC CAACAAGATC GTCGCCGGCG ATGTGGTCGT GATCCGCTAC GAAGGTCCGC GCGGTGGTCC GGGCATGCAG GAGATGCTGT ATCCGACCAG CTATCTGAAA TCGAAGGGCC TCGGCAAAGC CTGCGCGCTG ATCACTGACG GCCGGTTCTC GGGCGGCACC TCGGGCCTGT CGATCGGCCA CGTCTCGCCG GAAGCGGCGG AAGGCGGCCT GATCGGCCTG GTACGCGACG GCGACCGGAT CTCGATCGAC ATCCCGAACC GTACCATCAG CCTCGACGTC TCCGAAGCCG AACTCGCCAA GCGCGGCGAG GAAGAGCGGG CGCGCGGCGA AGCGGCGTGG ACGCCCAAGG ACCGCAAGCG CAACGTCTCG GCTGCGCTGC AGGCCTACGC GATGCTGACC ACCAGCGCCG CCAACGGCGC GGTGCGCGAT GTGAAGCGTC GCTTCGGTAA GAACTAA
|
Protein sequence | MPAYRSRTTT HGRNMAGARG LWRATGMKDS DFGKPIIAVV NSFTQFVPGH VHLKDLGQLV AREIEAAGGV AKEFNTIAVD DGIAMGHDGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS NCDKITPGML MAAMRLNIPA VFVSGGPMEA GKVVLKGKTH AVDLIDAMVA AADSAMSDED VQTMERSACP TCGSCSGMFT ANSMNCLTEA LGLSLPGNGS VLATHSDRKR LFVEAGHTIV DLARRYYEGD DASVLPRNIA NFKAFENAMT LDIAMGGSTN TVLHLLAAAR EAELDFSMKD IDRLSRRVPC LSKIAPSVSD VHMEDVHRAG GIMAILGELD RAGLLDTSCT TVHSETLGAA LARWDIRQSN SESVRTFFRA APGGVPSQTA FSQDRRYDEL DLDREKGVIR DAAHAFSKDG GLAVLYGNIA LDGCIVKTAG VDASILTFSG PVKVFESQDD AVSAILTNKI VAGDVVVIRY EGPRGGPGMQ EMLYPTSYLK SKGLGKACAL ITDGRFSGGT SGLSIGHVSP EAAEGGLIGL VRDGDRISID IPNRTISLDV SEAELAKRGE EERARGEAAW TPKDRKRNVS AALQAYAMLT TSAANGAVRD VKRRFGKN
|
| |