Gene HY04AAS1_0217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHY04AAS1_0217 
Symbol 
ID6743006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHydrogenobaculum sp. Y04AAS1 
KingdomBacteria 
Replicon accessionNC_011126 
Strand
Start bp196839 
End bp198503 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content40% 
IMG OID642750007 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002120887 
Protein GI195952597 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGCG ATATAATACA AAAAGGCTAT GACAAAGCCC CGCATAGATC TTTATTGAGG 
GCTTGTGGTT TTAAAGATGA GGATTTTGGA AAACCCATCA TAGGTGTTGC TAATTCTTAT
ATAGATATTG TGCCTGGGCA TGTGCATCTT AGAGAATTTG TAGAACCTAT AAAAGAAGAG
ATTAGAAAAG CCGGTGCTAT ACCAGTAGAG TTTAACACCA TAGGTGTAGA TGACGGTATA
GCTATGGGAC ATTACGGTAT GCACTACTCT TTACCTAGTA GAGAACTTAT AGCAGATGCT
ATAGAAACCG TTGTAGAAGC TCATCAGCTT GATGGTCTTA TCTGCATACC AAACTGCGAT
AAAATAGTCC CCGGTATGCT AATGGGTGCT CTTAGAGTAA ACGTGCCTAC GGTGTTTATA
AGCGGCGGTC CCATGGCTGC TGGTAAAATA GGAGATAAAA AAGTAGATCT TATATCTGTT
TTTGAAGGGG TTGGCAAGCT AAACAAAGGT GAGATAACAG AAAAAGATCT TCTTGTTATA
GAGCAAAACG CTTGTCCTTC TTGCGGTTCT TGTTCTGGGC TTTTTACGGC AAACTCTATG
AACTGCCTTA CTGAAGTGCT TGGTCTTGGA CTTCCAGGAA ACGGTACCAC CTTAGCCATA
GACCCAAGAA GAGAAATGCT TGCAAGGCAA GCTGCTAGAC AAGTAGTAGA ACTTGTAAAG
ATAGATTTAA AACCAAGGGA TATAGTTACA AAAGCATCTT TTGACAATGC TTTTAGAGTT
GATATAGCTA TGGGTGGATC TTCCAACACA GTGCTTCATC TTTTGGCTAT TGCAAGAGAG
GCTGGTATAG AGTACAAAAT GGAAGATATA GATAAAATTT CAAGAGAAAC ACCTACGCTT
TGCAAAATAT CCCCTTCTTC AGATTATCAT ATGGATGACC TTGATAGAGC TGGTGGAATA
AGTGCTATCA TGAAAGAACT TCTTAGAAAT GGTTTATTTG ACGGTAAACA AAGAACGGTG
ACTACAAAAA CCATAGAAGA GATCGTAAAA GATGTAGAAA TAATGGATGA AAACGTCATA
AGACGTATAG ACAACGCTTA TTCAAAAGAC GGTGGTCTTG CCATACTATT TGGTAATTTG
GCACCCAAAG GAAGCGTGGT AAAAACTGCT GGTGTTGCCA AAGAGATGCT GCAGTTCAAA
GGGAAAGCCA TATGTTTTGA TTCCGAAGAA GAAGCCATAG AAGGTATAAG GGGTGGAAAG
GTAAAACCCG GCCATGTAGT GGTCATAAGA TACGAAGGTC CAAAAGGTGG TCCTGGTATG
AGGGAGATGC TAAGCCCCAC TTCCACCATA ATGGGTATGG GTCTTGGAAG TTCTGTGGCC
CTTATTACAG ATGGAAGGTT TTCAGGTGGT ACTAGAGGTG CTTGTATAGG GCATATATCT
CCAGAAGCAG CAGCTGGTGG CCCTATTGGT ATAGTTCAAG ACGGTGATGA AATACTTATA
GACATACCAA ATAGAAAGCT TGAGCTTTTG ATATCTCAAG AAGAGTTTGA TGCTCGTTTA
AAAGCTTTTA AACCAAAGGA AAAACTTATT AAAAGCTCAT GGCTCAAGCG TTATAGGAAA
TTTGTAAAAG ATGCTTCAGA GGGTGCTATT TTATCTGCGG ATTGA
 
Protein sequence
MRSDIIQKGY DKAPHRSLLR ACGFKDEDFG KPIIGVANSY IDIVPGHVHL REFVEPIKEE 
IRKAGAIPVE FNTIGVDDGI AMGHYGMHYS LPSRELIADA IETVVEAHQL DGLICIPNCD
KIVPGMLMGA LRVNVPTVFI SGGPMAAGKI GDKKVDLISV FEGVGKLNKG EITEKDLLVI
EQNACPSCGS CSGLFTANSM NCLTEVLGLG LPGNGTTLAI DPRREMLARQ AARQVVELVK
IDLKPRDIVT KASFDNAFRV DIAMGGSSNT VLHLLAIARE AGIEYKMEDI DKISRETPTL
CKISPSSDYH MDDLDRAGGI SAIMKELLRN GLFDGKQRTV TTKTIEEIVK DVEIMDENVI
RRIDNAYSKD GGLAILFGNL APKGSVVKTA GVAKEMLQFK GKAICFDSEE EAIEGIRGGK
VKPGHVVVIR YEGPKGGPGM REMLSPTSTI MGMGLGSSVA LITDGRFSGG TRGACIGHIS
PEAAAGGPIG IVQDGDEILI DIPNRKLELL ISQEEFDARL KAFKPKEKLI KSSWLKRYRK
FVKDASEGAI LSAD