Gene Dole_2036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2036 
Symbol 
ID5694879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2467348 
End bp2469021 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content62% 
IMG OID641264637 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001529917 
Protein GI158522047 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000199143 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGCG ATTCAATGAA AGAGGGACTG GCACGGGCGC CCCATCGCTC TCTGCTCAAG 
TCCATCGGCT ATACGGATGA GGAAATCGGA CGGCCCATTA TCGGCATCGT CAATTCGGCC
AACGAGATCG TACCCGGCCA CGCGGACTTG AACAAAATCG CCCGGGCCGT CAAGGACGGG
GTTTACATGG CCGGGGGCAC GCCGGTGGAG TTTTCCACCA TCGGCGTGTG CGACGGCATT
GCCATGAACC ATATCGGCAT GAAATACTCT CTGGGCAGCC GGGAGCTGAT CGCCGACTCC
GTGGAGATCA TGGCCACGGC CCACGCCTTT GACGCACTGG TGATGATTCC CAACTGCGAC
AAGATCGTTC CGGGCATGCT CATGGCCGCG GCCCGGCTCA ACCTGCCCAC CATCTTCATC
AGCGGCGGCC CCATGCTGGC GGGCCGGTAT CCCGGCAAGC CCGAAAAAAA AGTGGACCTG
ATCACCGTGT TCGAAGCCGT GGGCGCCGTC AAATCCGGCA GAATGGCCCC TGAAGAACTT
GCCATCATCG AAGACGCGGC CTGTCCCACC TGCGGATCGT GTTCCGGCAT GTTCACGGCC
AACTCCATGA ACTGCCTGAC CGAGGCCATC GGCATGGGCC TGCCGGGCAA CGGCACGGTG
CCGGCGGTGA TGTCCGAACG GGTGCGCATG GCCAAGCAGG CCGGCATGCG GATTCTCGAC
CTTCTTAAAA ACGGCGTCAC CCCGGACAAA ATCATGACGG CCAAAGCGTT CCGCAACGCA
CTGGCCGTGG ACATGGCCCT GGGATGTTCC ACCAACACCG TGTTGCACCT GCCGGCCATT
GCCCACGAAG CCGGGGTCTC CATCAGCCTG GACCTGATCA ATGAGATCAG CGGCATCGCC
CCCCACCTCT GCTCCCTGAG CCCGGCGGGG CCCAACCATA TCGAAGACCT GAACATGGCC
GGTGGCATTC AGGCTGTTTT AAAAGAGCTT GCCCGGAAAA GCGGGCTGAT TGATCCTGAC
TGCCTGACCG TCACCGGCAG AACCGTCGGC GAAAATATCG CGTCGGCCAG GGACGCGGAC
GGCCAGGTGA TCCGCACCCT GGAAACCCCC CACCATGCCC AGGGCGGCCT GGCCGTCCTT
TTCGGCAACC TTGCGCCGGA TGGATGCGTG GTCAAGCAGT CTGCGGTTGT CGATAAGATG
CTGGTCCACG AAGGCCCTGC CCGGGTGTTT GATTCCGAGG AAGATGCCAC CACCGCCATC
ATGGACGGCA GAATCAAGAA GGGAGACGTG CTGGTCATCC GCTACGAGGG CCCCAAGGGC
GGTCCGGGCA TGCGGGAGAT GCTCACCCCC ACGTCGGCCC TGGCCGGCAT GGGACTGGAC
AGCACCGTGG CCCTGATCAC CGACGGCCGG TTTTCCGGCG GCAGCCGGGG CGCGGCCATC
GGTCATGTGT CGCCCGAGGC CATGGAGGGC GGTCCCATTG CCGTTGTAAA GGAAGGGGAC
ACGATCACCA TCGACATTCC CAAAAAGAAG ATCGGTCTGA AACTGGACGC CGGCGAGATT
CAGAACCGGC TATCCGGATG GAACCGGCCC GCTCCCAAAA TTACGCGGGG CTACATGGCG
CGATACGCCG ACCAGGTGTC GTCGGCCAAT ACCGGCGCCA TCTTTAAAAA ATAG
 
Protein sequence
MKSDSMKEGL ARAPHRSLLK SIGYTDEEIG RPIIGIVNSA NEIVPGHADL NKIARAVKDG 
VYMAGGTPVE FSTIGVCDGI AMNHIGMKYS LGSRELIADS VEIMATAHAF DALVMIPNCD
KIVPGMLMAA ARLNLPTIFI SGGPMLAGRY PGKPEKKVDL ITVFEAVGAV KSGRMAPEEL
AIIEDAACPT CGSCSGMFTA NSMNCLTEAI GMGLPGNGTV PAVMSERVRM AKQAGMRILD
LLKNGVTPDK IMTAKAFRNA LAVDMALGCS TNTVLHLPAI AHEAGVSISL DLINEISGIA
PHLCSLSPAG PNHIEDLNMA GGIQAVLKEL ARKSGLIDPD CLTVTGRTVG ENIASARDAD
GQVIRTLETP HHAQGGLAVL FGNLAPDGCV VKQSAVVDKM LVHEGPARVF DSEEDATTAI
MDGRIKKGDV LVIRYEGPKG GPGMREMLTP TSALAGMGLD STVALITDGR FSGGSRGAAI
GHVSPEAMEG GPIAVVKEGD TITIDIPKKK IGLKLDAGEI QNRLSGWNRP APKITRGYMA
RYADQVSSAN TGAIFKK