Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3617 |
Symbol | |
ID | 4898189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 708458 |
End bp | 710257 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640114225 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001045479 |
Protein GI | 126464366 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.886843 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAAC GCCCCCGGAT CACCCCCGAC CAGCTGCGCT CGCGGCTCTG GTTCAACAAT CCCGACGATC CGGAGATGAC GGCGCTCTAT CTCGAGCGCT ATCTGAACTA CGGCCTGACC CGGGCCGAGC TTCAGGGCGG CAAGCCGATC ATCGGCATCG CCCAGACCGG CAGCGACCTC TCGCCCTGCA ACCGTCACCA CATCGAGCTG GCCAAGCGGG TGCGCGACGG CATCACCGCG GCGGGCGGCA TCCCGATGGA GATCCCGGTC CATCCGATCC AGGAGACGGG CAAGCGCCCG ACGGCGAGCC TCGACCGCAA CCTCGCCTAT CTGGGCCTCG TCGAGGCGCT GCACGGCTAT CCGATCGACG GGGTGGTGCT GACCATCGGC TGCGACAAGA CCACACCCGC GCTGCTGATG GCGGCGGCGA CGGTGAACAT CCCCGCCATC GCCTATTCGG TGGGCCCGAT GCTGAACGGC TGGCACCGGG GCGAGCGCGC GGGCTCGGGC ACCGCGGTCT GGCGGGCGCG CGAGCTTCTG GCAAGGGGCG AGATCGACGA GGAGGGTTTC TACGAGCTCG TGGCCTCCTC GGCCCCCTCG GTCGGCTATT GCAACACGAT GGGCACGGCC TCGACCATGA ACAGTCTGGC CGAGGTGCTC GGCATGCAGC TTCCCGGCTC GGCCGCTATC CCCGCCCCCT ACCGCGAGCG CGGGCAGATG GGGCACGCGA CCGGCCGGCG GATCGTCGAG ATGGTGTGGG AGGATCTGCG CCCCTCCGAC ATCCTGACGC GGGAGGCCTT CGAGAATGCG ATCGTGGCCT GCTCGGCCCT CGGCGGTTCG ACCAACGCGC CCATTCACCT CAATGCCGTG GCGCGGCACG CCGGCGTGGC GCTCGACAAT GACGACTGGC AGAGGCTCGG CCATGCGGTG CCGCTCCTGG TCAATCTCCA GCCCGCGGGC ACCTATCTCG GCGAGGATTT CTACCGCGCG GGCGGCGTGC CCGCCGTGCT GGGCGAACTC CTGGCCGCGG ATCTCCTGCC CCATCCCGAG GCGCCCACCG TCTTCGGCAC GCCACTCTCC GCCGGCGCCA TGCGCAGCCT CGAGACCGAC GTGATCCGCC CGGTGGTCGA ACCGCTGAAG GGCGAGGCGG GCTTCATCAA TCTGTCGGGC AATCTCTTCG ACAGCGCGAT CATGAAGACG AGCGTGATCT CGCCCGATTT CCGCGCCCGC TACCTCTCCG ATCCTGCCGA CCCCGAGGCG TTCGAGGGCA CGGTCTTCGT CTTCGACGGC CCCGAGCATT TCCATGCGGT GATCGACGAC CCGGCGCTGG GGATGGGCGA GGATGCGGTG CTCGTGATGC GCGGCGCGGG GCCCCTGGGC TATCCGGGCG CGGCCGAGGT GGTGAACATG CGCCCGCCCG CCTACCTCCT CAAGCGCGGC ATCCCGGCGG TGCCCTGCAT CGGCGACGGC CGCCAGTCGG GGACCTCGGG CTCGCCCTCG ATCCTCAATG CCTCGCCCGA GGCCGCGGCG GGCGGAGGCC TCGCGCTTCT GCGCAACGGC GACCGGATCC GCGTCGACCT GCGTCGCGGA CGGGTCGATG TGCTGCTGCC GGACGAGGAG CTGGAGGCCC GCAGGACTGC CCTGGCCGAG GCCGGCGGCT ACGCCATGCC GCCCAGCCAG ACCCCCTGGC AGGCGATCTT CCGCGACCTG ACGGGACAGC TTGCAGAGGG CATGGTGCTT GACGGCGCGG ACGGGTTCCA CGATCTCGCC CGCAAGGCGC TTGCGCGCAA CAACCATTGA
|
Protein sequence | MTQRPRITPD QLRSRLWFNN PDDPEMTALY LERYLNYGLT RAELQGGKPI IGIAQTGSDL SPCNRHHIEL AKRVRDGITA AGGIPMEIPV HPIQETGKRP TASLDRNLAY LGLVEALHGY PIDGVVLTIG CDKTTPALLM AAATVNIPAI AYSVGPMLNG WHRGERAGSG TAVWRARELL ARGEIDEEGF YELVASSAPS VGYCNTMGTA STMNSLAEVL GMQLPGSAAI PAPYRERGQM GHATGRRIVE MVWEDLRPSD ILTREAFENA IVACSALGGS TNAPIHLNAV ARHAGVALDN DDWQRLGHAV PLLVNLQPAG TYLGEDFYRA GGVPAVLGEL LAADLLPHPE APTVFGTPLS AGAMRSLETD VIRPVVEPLK GEAGFINLSG NLFDSAIMKT SVISPDFRAR YLSDPADPEA FEGTVFVFDG PEHFHAVIDD PALGMGEDAV LVMRGAGPLG YPGAAEVVNM RPPAYLLKRG IPAVPCIGDG RQSGTSGSPS ILNASPEAAA GGGLALLRNG DRIRVDLRRG RVDVLLPDEE LEARRTALAE AGGYAMPPSQ TPWQAIFRDL TGQLAEGMVL DGADGFHDLA RKALARNNH
|
| |