Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3804 |
Symbol | |
ID | 4024320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 4244391 |
End bp | 4246250 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637964008 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_570926 |
Protein GI | 91978267 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGTCT ATCGATCCCG AACGACCACT CACGGCCGCA ACATGGCCGG CGCGCGCGGC CTGTGGCGCG CCACCGGCAT GAAGGATTCC GATTTCGGCA AGCCGATCAT CGCCGTCGTC AACTCGTTCA CGCAGTTCGT GCCGGGCCAC GTTCATCTGA AGGACCTCGG GCAGCTCGTT GCGCGCGAGA TCGAGGCCGC CGGCGGTGTC GCCAAGGAGT TCAACACCAT CGCGGTCGAC GATGGCATCG CGATGGGGCA CGGCGGAATG CTGTACAGCC TGCCGTCGCG CGAACTGATC GCCGACAGCG TCGAATACAT GGTCAACGCC CACTGCGCCG ACGCCATGGT TTGCATTTCG AACTGTGACA AGATCACCCC CGGCATGCTG ATGGCCGCGA TGCGGCTGAA CATCCCCGCG GTGTTCGTCT CCGGCGGCCC GATGGAAGCC GGCAAGGTGG TGCTGAATGG CAAGACACAC GCCGTCGACC TGATCGACGC CATGGTCGCG GCCGCCGACA GCAATATGAG CGATGCCGAT GTGCAGGTGA TGGAGCGCTC GGCGTGCCCG ACCTGCGGCT CGTGTTCGGG CATGTTCACC GCCAATTCGA TGAACTGCCT CGCCGAGGCG CTGGGTCTCG CGCTGCCCGG CAATGGCTCG GTGCTCGCCA CCCATGCCGA TCGCAAGCGG CTGTTCGTCG AGGCCGGTCA CACCATCGTC GATCTGGCGC GGCGTTACTA CGAAGGTGAC GACGAATCCG TGCTGCCGCG CAACGTCGCC AGCTTCAAAG CGTTCGAGAA CGCGATGACG CTCGACATCG CGATGGGTGG CTCGACCAAT ACGGTGCTGC ATCTGCTCGC CGCGGCGCGC GAGGCCGAAC TCGACTTCTC GATGAAGGAC ATCGACCGGC TGTCGCGCAA GGTGCCGTGC CTGAGCAAGA TCGCCCCGTC GGTGTCCGAC GTTCACATGG AGGACGTGCA TCGCGCCGGC GGCATCATGG CGATCCTCGG CGAGCTCGAT CGCGCCGGGC TGATCCACAA TTCCTGCCCG ACGGTGCATT CCGAGACGCT CGGTGCCGCA CTGGCGCGTT GGGACATCCG CCAGAGCAAC AGCGAAGCGG TCCGCACCTT CTACCGCGCC GCGCCGGGCG GCGTGCCGAC CCAGGTCGCG TTCAGCCAGG ACCGCCGCTA CGACGAGCTC GACCTCGACC GGCAGAAGGG CGTGATCCGC GACGCGGAGC ATGCCTTCAG CAAGGACGGC GGTCTCGCCG TGCTGTACGG CAACATCGCC GAAGACGGCT GCATCGTGAA GACCGCGGGC GTCGACGCCT CGATCCTGAC CTTCTCCGGT CCGGCGAAAG TGTTCGAGAG TCAGGACGAT GCGGTGTCGG CGATCCTCGG CAACAAGATT GTCGCCGGCG ACGTCATCGT CATCCGCTAC GAAGGACCGC GCGGCGGACC GGGCATGCAG GAGATGCTGT ATCCGACCAG CTATCTGAAG TCGAAAGGCC TCGGCAAGGC ATGCGCCTTG ATCACCGATG GCCGTTTTTC AGGCGGCACC TCGGGGCTTT CGATCGGTCA CGTTTCGCCG GAAGCGGCCG AAGGCGGACT GATCGGTCTG GTCCGCGATG GCGATCGCGT CGCGATCGAC ATCCCCAACC GCAGCATCAA CCTCGACGTT TCCGCCGACG AATTGGCGCG ACGCAGCGAA GAGGAGCAGG CGCGCGGCGA CAAGGCTTGG CAGCCGAAGG ACCGCAACCG CGTGGTCTCT GCTGCACTGC AGGCCTACGC TGCGCTGACC ACAAGCGCGG CGAACGGCGC AGTACGCGAC GTCAACCGCC GGCTGGGAAA AGGCAAGTAA
|
Protein sequence | MPVYRSRTTT HGRNMAGARG LWRATGMKDS DFGKPIIAVV NSFTQFVPGH VHLKDLGQLV AREIEAAGGV AKEFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS NCDKITPGML MAAMRLNIPA VFVSGGPMEA GKVVLNGKTH AVDLIDAMVA AADSNMSDAD VQVMERSACP TCGSCSGMFT ANSMNCLAEA LGLALPGNGS VLATHADRKR LFVEAGHTIV DLARRYYEGD DESVLPRNVA SFKAFENAMT LDIAMGGSTN TVLHLLAAAR EAELDFSMKD IDRLSRKVPC LSKIAPSVSD VHMEDVHRAG GIMAILGELD RAGLIHNSCP TVHSETLGAA LARWDIRQSN SEAVRTFYRA APGGVPTQVA FSQDRRYDEL DLDRQKGVIR DAEHAFSKDG GLAVLYGNIA EDGCIVKTAG VDASILTFSG PAKVFESQDD AVSAILGNKI VAGDVIVIRY EGPRGGPGMQ EMLYPTSYLK SKGLGKACAL ITDGRFSGGT SGLSIGHVSP EAAEGGLIGL VRDGDRVAID IPNRSINLDV SADELARRSE EEQARGDKAW QPKDRNRVVS AALQAYAALT TSAANGAVRD VNRRLGKGK
|
| |