Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4066 |
Symbol | |
ID | 3911873 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4638581 |
End bp | 4640440 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637885970 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_487670 |
Protein GI | 86751174 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.405184 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGCAT ATCGATCCCG AACGACCACT CACGGCCGCA ACATGGCCGG CGCGCGCGGC CTCTGGCGCG CCACCGGGAT GAAGGATTCC GACTTCGGCA AGCCGATCAT CGCCGTCGTC AACTCCTTCA CGCAGTTCGT GCCGGGGCAC GTGCATCTGA AGGATCTCGG CCAGCTCGTC GCCCGGGAGA TCGAGGCCGC CGGCGGCGTC GCCAAGGAAT TCAACACCAT CGCGGTCGAC GACGGCATCG CGATGGGCCA TGACGGCATG CTGTACAGCC TGCCGTCGCG CGAACTGATC GCCGACAGCG TCGAATACAT GGTCAACGCG CACTGCGCCG ACGCGATGGT CTGCATCTCG AACTGCGACA AGATCACCCC CGGCATGCTG ATGGCCGCGA TGCGGCTCAA CATCCCAGCG GTGTTCGTCT CCGGCGGGCC GATGGAAGCG GGCAAAGTGG TGTTGAAGGG CAAGACCCAC GCCGTCGACC TGATCGACGC GATGGTCGCG GCCGCCGACA GCAGCATGAG CGACGAAGAC GTGCAGACGA TGGAGCGCTC GGCGTGCCCG ACCTGCGGCT CCTGCTCCGG CATGTTCACC GCCAATTCGA TGAACTGTCT CGCCGAGGCG CTGGGTCTGG CGCTGCCCGG CAACGGCTCG GTGCTCGCCA CCCATGCCGA TCGCAAGCGG CTGTTCGTCG AGGCCGGTCA CACCATCGTC GATCTGGCGC GGCGCTACTA CGAAGGCGAC GACGAATCCG TGCTGCCGCG CAAGGTCGCG AGCTTCGAGG CGTTCGAGAA CGCGATGACG CTCGACATCG CGATGGGCGG CTCGACCAAC ACGGTGCTGC ATCTGCTCGC CGCGGCGCGC GAGGCCGAAC TCGACTTCTC GATGAAGGAC ATCGACCGGC TGTCGCGCAA GGTGCCGTGC CTGAGCAAGA TCGCCCCGTC GGTGTCCGAC GTCCACATGG AGGACGTGCA TCGCGCCGGC GGCATCATGG CGATCCTCGG CGAGCTCGAT CGCGCCGGAC TGATCCACAA CTCATGCCCG ACTGTGCATT CGGAGACGCT CGGTGCCGCG CTGGCGCGCT GGGACATCCG CCAGAGCAAC AGCGAAGCGG TCCGCACCTT CTACCGCGCC GCGCCGGGCG GCGTGCCGAC CCAGGTTGCG TTCAGCCAGG ACCGCCGCTA CGACGAGCTC GACCTCGACC GGCAGAAGGG CGTGATCCGC GACGCCGAGC ACGCCTTCAG CAAGGACGGC GGCCTCGCGG TGCTGTACGG CAATATTGCG CTCGACGGCT GCATCGTGAA GACCGCCGGC GTCGACGCCT CGATCCTGAC CTTCTCCGGC CCGGCGAAAG TGTTCGAGAG CCAGGACGAC GCGGTGTCGG CGATCCTCGG CAACAAGATC GTCGCTGGCG ACGTCATCGT GATCCGCTAC GAAGGGCCAC GTGGCGGGCC GGGCATGCAG GAGATGCTGT ATCCGACCAG CTATCTGAAG TCGAAGGGCC TCGGCAAAGC CTGCGCGCTG ATCACCGACG GCCGCTTCTC CGGCGGCACC TCGGGCCTGT CGATCGGTCA CGTCTCGCCC GAGGCTGCGG AAGGCGGCCT GATCGGACTG GTGCGGAACG GCGACCGGAT TTCGATCGAC ATTCCCAATC GCGGCATCAC CCTTGACGTC GCCGCTGACG AGCTGTCGCG GCGCGCCGAG GAGGAAGAGG CGAAGGGCGA CAAGGCCTGG CAGCCGAAAG ACCGCAAGCG CAAGGTCTCG GCCGCGCTGC AGGCCTATGC CATGCTGACC ACCAGCGCTG CGAACGGCGC GGTGCGCGAC GTCAACCGCA GGCTCGGCAA AGGAAAGTAG
|
Protein sequence | MPAYRSRTTT HGRNMAGARG LWRATGMKDS DFGKPIIAVV NSFTQFVPGH VHLKDLGQLV AREIEAAGGV AKEFNTIAVD DGIAMGHDGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS NCDKITPGML MAAMRLNIPA VFVSGGPMEA GKVVLKGKTH AVDLIDAMVA AADSSMSDED VQTMERSACP TCGSCSGMFT ANSMNCLAEA LGLALPGNGS VLATHADRKR LFVEAGHTIV DLARRYYEGD DESVLPRKVA SFEAFENAMT LDIAMGGSTN TVLHLLAAAR EAELDFSMKD IDRLSRKVPC LSKIAPSVSD VHMEDVHRAG GIMAILGELD RAGLIHNSCP TVHSETLGAA LARWDIRQSN SEAVRTFYRA APGGVPTQVA FSQDRRYDEL DLDRQKGVIR DAEHAFSKDG GLAVLYGNIA LDGCIVKTAG VDASILTFSG PAKVFESQDD AVSAILGNKI VAGDVIVIRY EGPRGGPGMQ EMLYPTSYLK SKGLGKACAL ITDGRFSGGT SGLSIGHVSP EAAEGGLIGL VRNGDRISID IPNRGITLDV AADELSRRAE EEEAKGDKAW QPKDRKRKVS AALQAYAMLT TSAANGAVRD VNRRLGKGK
|
| |