Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_2036 |
Symbol | |
ID | 5694879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 2467348 |
End bp | 2469021 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641264637 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001529917 |
Protein GI | 158522047 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000199143 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGCG ATTCAATGAA AGAGGGACTG GCACGGGCGC CCCATCGCTC TCTGCTCAAG TCCATCGGCT ATACGGATGA GGAAATCGGA CGGCCCATTA TCGGCATCGT CAATTCGGCC AACGAGATCG TACCCGGCCA CGCGGACTTG AACAAAATCG CCCGGGCCGT CAAGGACGGG GTTTACATGG CCGGGGGCAC GCCGGTGGAG TTTTCCACCA TCGGCGTGTG CGACGGCATT GCCATGAACC ATATCGGCAT GAAATACTCT CTGGGCAGCC GGGAGCTGAT CGCCGACTCC GTGGAGATCA TGGCCACGGC CCACGCCTTT GACGCACTGG TGATGATTCC CAACTGCGAC AAGATCGTTC CGGGCATGCT CATGGCCGCG GCCCGGCTCA ACCTGCCCAC CATCTTCATC AGCGGCGGCC CCATGCTGGC GGGCCGGTAT CCCGGCAAGC CCGAAAAAAA AGTGGACCTG ATCACCGTGT TCGAAGCCGT GGGCGCCGTC AAATCCGGCA GAATGGCCCC TGAAGAACTT GCCATCATCG AAGACGCGGC CTGTCCCACC TGCGGATCGT GTTCCGGCAT GTTCACGGCC AACTCCATGA ACTGCCTGAC CGAGGCCATC GGCATGGGCC TGCCGGGCAA CGGCACGGTG CCGGCGGTGA TGTCCGAACG GGTGCGCATG GCCAAGCAGG CCGGCATGCG GATTCTCGAC CTTCTTAAAA ACGGCGTCAC CCCGGACAAA ATCATGACGG CCAAAGCGTT CCGCAACGCA CTGGCCGTGG ACATGGCCCT GGGATGTTCC ACCAACACCG TGTTGCACCT GCCGGCCATT GCCCACGAAG CCGGGGTCTC CATCAGCCTG GACCTGATCA ATGAGATCAG CGGCATCGCC CCCCACCTCT GCTCCCTGAG CCCGGCGGGG CCCAACCATA TCGAAGACCT GAACATGGCC GGTGGCATTC AGGCTGTTTT AAAAGAGCTT GCCCGGAAAA GCGGGCTGAT TGATCCTGAC TGCCTGACCG TCACCGGCAG AACCGTCGGC GAAAATATCG CGTCGGCCAG GGACGCGGAC GGCCAGGTGA TCCGCACCCT GGAAACCCCC CACCATGCCC AGGGCGGCCT GGCCGTCCTT TTCGGCAACC TTGCGCCGGA TGGATGCGTG GTCAAGCAGT CTGCGGTTGT CGATAAGATG CTGGTCCACG AAGGCCCTGC CCGGGTGTTT GATTCCGAGG AAGATGCCAC CACCGCCATC ATGGACGGCA GAATCAAGAA GGGAGACGTG CTGGTCATCC GCTACGAGGG CCCCAAGGGC GGTCCGGGCA TGCGGGAGAT GCTCACCCCC ACGTCGGCCC TGGCCGGCAT GGGACTGGAC AGCACCGTGG CCCTGATCAC CGACGGCCGG TTTTCCGGCG GCAGCCGGGG CGCGGCCATC GGTCATGTGT CGCCCGAGGC CATGGAGGGC GGTCCCATTG CCGTTGTAAA GGAAGGGGAC ACGATCACCA TCGACATTCC CAAAAAGAAG ATCGGTCTGA AACTGGACGC CGGCGAGATT CAGAACCGGC TATCCGGATG GAACCGGCCC GCTCCCAAAA TTACGCGGGG CTACATGGCG CGATACGCCG ACCAGGTGTC GTCGGCCAAT ACCGGCGCCA TCTTTAAAAA ATAG
|
Protein sequence | MKSDSMKEGL ARAPHRSLLK SIGYTDEEIG RPIIGIVNSA NEIVPGHADL NKIARAVKDG VYMAGGTPVE FSTIGVCDGI AMNHIGMKYS LGSRELIADS VEIMATAHAF DALVMIPNCD KIVPGMLMAA ARLNLPTIFI SGGPMLAGRY PGKPEKKVDL ITVFEAVGAV KSGRMAPEEL AIIEDAACPT CGSCSGMFTA NSMNCLTEAI GMGLPGNGTV PAVMSERVRM AKQAGMRILD LLKNGVTPDK IMTAKAFRNA LAVDMALGCS TNTVLHLPAI AHEAGVSISL DLINEISGIA PHLCSLSPAG PNHIEDLNMA GGIQAVLKEL ARKSGLIDPD CLTVTGRTVG ENIASARDAD GQVIRTLETP HHAQGGLAVL FGNLAPDGCV VKQSAVVDKM LVHEGPARVF DSEEDATTAI MDGRIKKGDV LVIRYEGPKG GPGMREMLTP TSALAGMGLD STVALITDGR FSGGSRGAAI GHVSPEAMEG GPIAVVKEGD TITIDIPKKK IGLKLDAGEI QNRLSGWNRP APKITRGYMA RYADQVSSAN TGAIFKK
|
| |