Gene HS_0236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0236 
SymbolgalM 
ID4239752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp236574 
End bp237617 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content38% 
IMG OID638103773 
Productaldose 1-epimerase 
Protein accessionYP_718444 
Protein GI113460382 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2017] Galactose mutarotase and related enzymes 
TIGRFAM ID[TIGR02636] galactose mutarotase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAATTG ATTGTTTTGA GAAGGAACAC AAGCAAGGAA TGGCACCGGA TGGACAACCT 
TTTCGTATTT TTACGTTAAC TAATACAAAA GGCATGAAAG TGCAAGTGAT GGATTGGGGG
GCAACTTGGA TTTCTTGTCA AGTACCGGTA GGAAAAGAAG TACGAGAAGT TTTGCTTGGG
TGTCAGATTA ATGATTATCC GATACAACAG GTTTATTTAG GAGCGAGTAT TGGGCGTTAT
GCAAATCGTA TTGCAAATAG CCGATTTGAG TTGAATGGTA AACGCTATTT ACTTAACGCT
AATCAACATC AGCATCAACT TCATGGTGGA AAAGGGTTTC ATAATGAGCG TTGGTATTTA
GAAAAGTGCG GTGTAAATTC CATCACTTTT TCTCATTTTA GCCCTGATGG AGATCAGGGA
TTTCCCGGTA ATTTACATGC TTTTGTTACT TATTCTTTAA GTGAAACCAA CAACGTGAGA
ATTGAATATG AGGCAATTTG TGATCAAGAT TGCCCAATTA ATTTGACTAA CCATGCTTAT
TTTAACTTGA ATGATGCTAC TTTCGGTTGT GATATTCGAG GGCATTCTTT ACAACTTAAT
AGCGATTATT TTTTGCCGGT GGACAGTGTG GGTATCCCTA ATGCTAAGTT AAAAGCGGTT
GAGGGAACTA GTTTTGATTT TCGTGAGGAA AAACCAATCG GTTTAGATTT TTTACAAGAA
GAACAAAAAT TGGTAAAAGG TTACGACCAT TCTTTCTTGC TTAATCCGGA CATTGAAAAA
CCTTGTGCTA TTTTGACCGC ACTTGATCGT TCTTTGAGAA TGCAAGTGTT GACTTCTCAG
CCGGCTTTAC AGATTTATAC GGGCAATTTT CTATCAGCTA CGCCAACTCG TCAAAACGGG
CAGTATGCTG ATTATGCTGG TATTGCTTTG GAAACTCAAT GTTTGCCTGA TACACCGAAT
CATCCGGAAT GGTGGAAATA TGGTGGAATA ACAAAGGTGG GCGAAAAATA TTCTCATAAA
ACGGAATATC AATTTATCCG TTAG
 
Protein sequence
MLIDCFEKEH KQGMAPDGQP FRIFTLTNTK GMKVQVMDWG ATWISCQVPV GKEVREVLLG 
CQINDYPIQQ VYLGASIGRY ANRIANSRFE LNGKRYLLNA NQHQHQLHGG KGFHNERWYL
EKCGVNSITF SHFSPDGDQG FPGNLHAFVT YSLSETNNVR IEYEAICDQD CPINLTNHAY
FNLNDATFGC DIRGHSLQLN SDYFLPVDSV GIPNAKLKAV EGTSFDFREE KPIGLDFLQE
EQKLVKGYDH SFLLNPDIEK PCAILTALDR SLRMQVLTSQ PALQIYTGNF LSATPTRQNG
QYADYAGIAL ETQCLPDTPN HPEWWKYGGI TKVGEKYSHK TEYQFIR