Gene HS_1584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1584 
Symbol 
ID4241111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1798287 
End bp1799783 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content42% 
IMG OID638105170 
Productmethylmalonate-semialdehyde dehydrogenase [acylating] 
Protein accessionYP_719789 
Protein GI113461720 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000121824 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAACTTTA TCAATGGTAA ACAAGTAGCA AGTAAAGGTA CTAAAGCTTG GCCTATTTAT 
AATCCAGCCA CGGGTGAGCA AATTCGCCAA GTAATGATGA GTACTGCAGA AGAGGTAAAT
GAAGCGGTTG AAGTTGCACA AAAGGCATTT CCTGCTTGGG CTGCGACTTC GCCACTTCGT
CGTGCCCGTG TAATGTTTAA ATTTAAACAA TTGTTAGAAC GTGATTGGGA TGACTTAGCT
CGTTTAATTA CCGAAGAACA TGGTAAAGTT TTCTCTGATG CACAAGGTGA ACTAACTCGT
GGTTTAGAAG TTGTGGAGTT TGCTTGTGGC ATTCCACATT TAGTAAAAGG TGAATTTTCT
GAACAAGCCG GTCGTGGGAT TGATATTCAT TCAGCCATGC AACCTTTAGG AGTTTGTGTA
GGTATCACAC CATTTAATTT TCCTGCGATG GTTCCAATGT GGATGTTTCC GATTGCACTT
GCTTGCGGTA ATACATTTAT CTTAAAACCG GCGAAAGCCG ATCCATCTTT ATCTATTCGT
TTGGCTGAAT TATTAAAAGA GGCGGGGTTA CCGGATGGTG TGTTTAATGT AGTGCATGGT
GATCGTCAAG ACAATGAAAT TTTATTGCGT GATCCTCGAG TGAAAGCTGT AAGCTTTGTG
GGTTCAACTA CGGCTGCGGC ACATGTTCAT GCGATTGGTT CGGCACATGG TAAACGTGTT
CAAGCACTGG GTGCGGCGAA AAATCACGGT TTAGTGATGC CAGATGTCGA TATTGAAGCA
ACAGCTAACG CACTATTAGG TGCGGCATTC GGTGCAGCCG GTGAACGCTG TATGGCATTA
CCTTTAGCAG TAACCATTGA TGATGAAACG GCAGATAAAT TAGTCGCAGC ATTAAAACCG
AAAGTAGAGG CATTACGTTA TGGTCCCGGT CTCACTAAAG AAGGCGAAAA AGAAAATGAT
TTTGGTCCAT TAATTACTCG TCAACATCGT GATAACGTAA AACGTTATGT AGATATAGGT
GTGTCAGAGG GGGCAACTTT AGTGGTTGAT GGACGTGATA AATTCCCTGT TGGTTACGAG
AATGGTTTCT TTATTGGAGG GTGTTTATTT GACCATGTAA CTCCTGATAT GACTATTTAC
AAAGAAGAAA TTTTTGGTCC AGTATTAGGC ATTGTCCGTG CGAAAAATTT CCAAGAGGCT
ATGAAACTAA TTAATGAGCA TCAATTCGGT AATGGTGCAG CAATCTTTAC CAATAATGGT
GCGGCAGCTC GTGAATTTGC ATATCAAGTA CAAGCAGGTA TGGTGGGCGT GAATGTGCCA
ATTCCGGTGC CGATGGCATT CCATTGTTTC GGGGGATGGA AAGAATCCGC TTACGGTGCA
TTAAATGCTT ACGGTCCAGA CGGTGTTCGT TTTTATACCA AAATGAAAAC TGTAACAACA
CGTTGGCCAG AGAAAGATCT TTCATCACAA GCAGCATTCA GTATGCCAAC CCTATAA
 
Protein sequence
MNFINGKQVA SKGTKAWPIY NPATGEQIRQ VMMSTAEEVN EAVEVAQKAF PAWAATSPLR 
RARVMFKFKQ LLERDWDDLA RLITEEHGKV FSDAQGELTR GLEVVEFACG IPHLVKGEFS
EQAGRGIDIH SAMQPLGVCV GITPFNFPAM VPMWMFPIAL ACGNTFILKP AKADPSLSIR
LAELLKEAGL PDGVFNVVHG DRQDNEILLR DPRVKAVSFV GSTTAAAHVH AIGSAHGKRV
QALGAAKNHG LVMPDVDIEA TANALLGAAF GAAGERCMAL PLAVTIDDET ADKLVAALKP
KVEALRYGPG LTKEGEKEND FGPLITRQHR DNVKRYVDIG VSEGATLVVD GRDKFPVGYE
NGFFIGGCLF DHVTPDMTIY KEEIFGPVLG IVRAKNFQEA MKLINEHQFG NGAAIFTNNG
AAAREFAYQV QAGMVGVNVP IPVPMAFHCF GGWKESAYGA LNAYGPDGVR FYTKMKTVTT
RWPEKDLSSQ AAFSMPTL