Gene HS_0342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0342 
SymbolilvD 
ID4239816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp344205 
End bp346040 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content43% 
IMG OID638103883 
Productdihydroxy-acid dehydratase 
Protein accessionYP_718550 
Protein GI113460488 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTAAAC TACGTTCAGC GACCAGTACA CAAGGTCGCA ATATGGCAGG TGCACGTGCT 
TTATGGCGGG CAACAGGAAT GAAAGAAAAT GATTTCGGTA AACCGATTAT TGCGGTTGTG
AACTCATTTA CTCAATTTGT ACCTGGACAT GTTCACTTAA AAGATATGGG GCAACTAGTT
GCTACTGAAA TTGAAAAATT TGGTGGGGTA GCAAAAGAGT TTAATACCAT CGCTGTAGAT
GATGGAATTG CTATGGGGCA TGGAGGTATG CTCTATTCTT TACCGAGCCG AGATTTAATT
GCCGATAGTG TTGAATATAT GGTTAACGCT CACTGTGCTG ATGCGATGGT TTGTATTTCT
AACTGTGATA AGATCACACC GGGAATGTTA ATGGCTGCAT TGCGTTTGAA TATTCCAACA
GTCTTTGTTT CAGGTGGCCC AATGGAAGCG GGTAAAACCA AATTATCCGA TCAAATCATT
AAATTAGACT TAGTGGATGC TATGATTCAA GGTGCAAATC CGAATGTTTC AGATGATGTC
AGCGAACAAA TTGAGCGTTC TGCTTGTCCA ACTTGTGGCT CTTGTTCAGG TATGTTTACC
GCCAATTCAA TGAATTGTTT AACCGAAGCA CTGGGCTTGA GCTTACCGGG AAACGGCTCA
TGTTTGGCTA CTCATGCCGA CCGCAAACAA CTTTTCTTAG CGGCAGGAAA ACAGATTGTT
GAACTGTGCA AACGTTATTA TGAACAAGAT GATACATCTG TTTTACCTCG CTCAATTGCC
ACAAAAGAAG CCTTTGATAA CGCTATGAGT CTTGATATCG CTATGGGTGG TTCGACCAAT
ACTGTTTTGC ATTTATTAGC TGCTGCACAG GAAGCGGAAG TCAATTTCAC TATGGCGGAT
ATTGATCGCC TTTCTCGGGT AGTACCGTGC CTGAGCAAAG TTGCACCAAA TACCCAAAAG
TATCATATGG AAGATGTGCA TCGTGCTGGC GGTATTATGG CAATTTTAGG TGAATTAGAT
CGTGCCGGCT TGTTGAATAG CCAAACTCGT ACAATTTTGG GTATGAGCAT AGGCGAACAA
ATTGCAAAAT ATGACATCAA ACTCACTCAA GATAAAGCCA TACATAAATT TTTCCGTGCA
GGACCAGCAG GGATTCGCAC TACTCAAGCT TTCTTGCAAG ATTGTCGTTG GGATACGGTT
GATGATGATC GTGAAAATGG CTGTATTCGC AGTAAAGAAT TTGCGTACAG CCAAGATGGT
GGGTTAGCTA TGTTGTCAGG CAATATTGCT TTAGATGGCT GTATAGTCAA AACTGCTGGA
GTAGATGAAA GCATTTTAAA GTTCAGCGGT AAAGCCATTG TATTTGAAAG TCAAGAAGAT
GCTGTATCAG GCATTTTGGG GGGTAAAGTA CAAGCCGGAC ATGTTGTGGT GATTCGGTAT
GAAGGACCAA AAGGCGGACC TGGTATGCAA GAAATGCTTT ATCCAACCAG TTATCTCAAA
TCTATGGGCT TAGGTAAAGC TTGTGCCTTA CTTACAGATG GTCGTTTCTC CGGTGGTACA
TCGGGACTGT CTATCGGACA CTGCTCACCG GAGGCGGCGG CAGGCGGTTT AATTGGTGTA
GTGAAAGATG GTGATATTAT TGAGATTGAT ATTCCAAATC GTCGCATCGA ATTGATGGTA
TCCGAAGAAG AACTTGCTGA GCGTCGAGCA GAGCAAGATA AACTTGGCTG GAAACCAGCT
AATCGCCAGA GAGAAGTTTC CTTTGCCCTA AAAGTTTACG GATATTTCGC AACATCTGCG
GACAAGGGTG CAGTACGAGA TAAAACGAAG ATATAA
 
Protein sequence
MPKLRSATST QGRNMAGARA LWRATGMKEN DFGKPIIAVV NSFTQFVPGH VHLKDMGQLV 
ATEIEKFGGV AKEFNTIAVD DGIAMGHGGM LYSLPSRDLI ADSVEYMVNA HCADAMVCIS
NCDKITPGML MAALRLNIPT VFVSGGPMEA GKTKLSDQII KLDLVDAMIQ GANPNVSDDV
SEQIERSACP TCGSCSGMFT ANSMNCLTEA LGLSLPGNGS CLATHADRKQ LFLAAGKQIV
ELCKRYYEQD DTSVLPRSIA TKEAFDNAMS LDIAMGGSTN TVLHLLAAAQ EAEVNFTMAD
IDRLSRVVPC LSKVAPNTQK YHMEDVHRAG GIMAILGELD RAGLLNSQTR TILGMSIGEQ
IAKYDIKLTQ DKAIHKFFRA GPAGIRTTQA FLQDCRWDTV DDDRENGCIR SKEFAYSQDG
GLAMLSGNIA LDGCIVKTAG VDESILKFSG KAIVFESQED AVSGILGGKV QAGHVVVIRY
EGPKGGPGMQ EMLYPTSYLK SMGLGKACAL LTDGRFSGGT SGLSIGHCSP EAAAGGLIGV
VKDGDIIEID IPNRRIELMV SEEELAERRA EQDKLGWKPA NRQREVSFAL KVYGYFATSA
DKGAVRDKTK I