Gene HS_0420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0420 
SymbolguaB 
ID4239896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp448262 
End bp449725 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content44% 
IMG OID638103963 
Productinosine 5'-monophosphate dehydrogenase 
Protein accessionYP_718630 
Protein GI113460566 
COG category[F] Nucleotide transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0516] IMP dehydrogenase/GMP reductase
[COG3448] CBS-domain-containing membrane protein 
TIGRFAM ID[TIGR01302] inosine-5'-monophosphate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACGCA TCAAACAAGA AGCCTTAACT TTTGACGACG TTCTATTAGT TCCGGCTCAT 
TCTACAGTAC TTCCAAATAC TGCCAACCTT TCAACTAACC TTACCAAAGA AATCCGTCTA
AATATTCCTA TGTTATCTGC TGCAATGGAT ACTGTTACCG AAGCTAAATT GGCTATTTCT
CTAGCTCAAG AGGGCGGTAT CGGATTTATT CATAAAAATA TGACCATTGA ACGTCAAGCG
GATCGTGTGC GTAAAGTGAA AAAATTCGAG AGCGGAATTG TATCTGAACC GGTTACCGTT
TCACCAACAA TGACGTTAAC TGAATTAGCG GAATTGGCTA AGAAAAACGG TTTTGCAGGT
TATCCGGTAG TTGATGAACA AAAAGGTTTA GTCGGGATCA TTACAGGTCG TGATACACGT
TTCGTTTCAG ATTTGAATAA AACCGTTGCG GATTTTATGA CCCCAAAAGA TCGTTTAGTT
ACTGTGAAAG AAGGGGCTAC TCGGGAAGAA ATTTTCCATT TAATGCACGA ACATCGTGTC
GAAAAAGTGC TGGTGGTAGA TGACAGCTTT AAACTAAAAG GAATGATTAC CTTAAAAGAC
TACCAAAAAG CCGAAAGTAA GCCGAATGCG TGTAAAGATG AATTTGGTCG TTTACGTGTT
GGTGCCGCAG TCGGTGCCGG TCCCGGTAAT GAAGAACGTA TTGAAGCCTT AGTAAATGCC
GGTGTGGACA TTTTATTGAT TGATTCATCA CACGGACATT CCGAAGGTGT TTTACAACGC
GTGCGTGAAA CGCGTGCTAA ATATCCAAAT TTACCTATTA TTGCAGGGAA TATTGCGACA
GCGGAAGGTG CGATTGCATT AGCTGATGCG GGGGCAAGTG CGGTTAAAGT TGGGATTGGA
CCGGGATCAA TTTGTACGAC TCGTATTGTA ACAGGTGTGG GGGTGCCACA AATTACGGCG
ATTGCTGATG CTGCTGAAGC ATTGCGTGAG CGTGGTATTC CAGTGATTGC CGATGGTGGT
ATCCGCTATT CGGGTGATAT CGCTAAAGCG ATTGCAGCAG GTGCATCTTG TGTTATGGTC
GGATCAATGT TTGCAGGAAC GGAAGAAGCA CCGGGTGAAA TCGAACTTTA TCAAGGGCGA
GCATTTAAAT CTTACCGAGG TATGGGTTCC CTTGGTGCAA TGTCAAAAGG ATCTTCCGAT
CGTTATTTTC AATCCGATAA TGCTGCGGAT AAACTAGTGC CTGAAGGGAT TGAAGGGCGT
ATTCCATACA AAGGCTTATT AAAAGAAATT ATCCACCAAC AAATGGGTGG ATTACGTTCT
TGCATGGGCT TAACCGGCTG TGCGACTATC GAAGAATTAC GCACTAAAGC CCAATTTGTA
CGTATCAGCG GAGCAGGCAT CAAAGAGAGT CACGTTCACG ATGTGACCAT TACCAAAGAA
GCACCGAATT ATCGCATGGG GTAA
 
Protein sequence
MLRIKQEALT FDDVLLVPAH STVLPNTANL STNLTKEIRL NIPMLSAAMD TVTEAKLAIS 
LAQEGGIGFI HKNMTIERQA DRVRKVKKFE SGIVSEPVTV SPTMTLTELA ELAKKNGFAG
YPVVDEQKGL VGIITGRDTR FVSDLNKTVA DFMTPKDRLV TVKEGATREE IFHLMHEHRV
EKVLVVDDSF KLKGMITLKD YQKAESKPNA CKDEFGRLRV GAAVGAGPGN EERIEALVNA
GVDILLIDSS HGHSEGVLQR VRETRAKYPN LPIIAGNIAT AEGAIALADA GASAVKVGIG
PGSICTTRIV TGVGVPQITA IADAAEALRE RGIPVIADGG IRYSGDIAKA IAAGASCVMV
GSMFAGTEEA PGEIELYQGR AFKSYRGMGS LGAMSKGSSD RYFQSDNAAD KLVPEGIEGR
IPYKGLLKEI IHQQMGGLRS CMGLTGCATI EELRTKAQFV RISGAGIKES HVHDVTITKE
APNYRMG