Gene HS_0018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0018 
Symbol 
ID4239526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp19007 
End bp20050 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content25% 
IMG OID638103549 
Producthypothetical protein 
Protein accessionYP_718224 
Protein GI113460167 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATATA TCACTTCTAA ACATGTATTT TTTGACTATC TAAATGAAAA TGAATTTGTT 
ATTGGGATAG GTAGTAACCA AGAAATTACA AACAACAAAG ATTATTTTAA TAATTGTCTA
AATTTATGTT ATTTTTGTAT AAATCCTAAG AGCATTTCTG AAATATTATC TTTTATAAAA
GATAATAATA TAGACATTCT ATATTTCGAT AAAATGAAGA AAATGAAGTT TATTACAAAA
GAAATAATAG ACTTCAATGA TAGATACAGT AGAAATCATT TATACTATAA TGCATTAGGA
TATAAAATAT ATGATATACA AAATAAAATA TCTAAATCTC ATATTCTTAT TGTTGGTGCA
GGAGGCATAG GAAATATTTG CTCCTATTTA TTAGGAACAA TAGGAATTAA GAAGTTAAGT
ATCATTGATG ATGATATAGT TGAGGAATCT AATCTAAACA GACAGTTCTT ATTTCGAGAG
AAAGACATAA ACAAAAATAA AGTAGAAACA ATAAAAAGAG AGTTATTATC TATTCGGAAA
GATATTATTA TTGATATTTT CCCAGAGAAA TTAAATAAAT CTATTTTAGA TAAAATATCA
CAAATAGATT TAGTTATTTG TTCAGCAGAT GATGAGTATT GTATAGATAT GATTAATGAA
TTTTGCTGTT TTAATAAAAT TCCTCTAATT AACGTAGGTT ACCTCAATGA TATTTCTGTT
ATCGGACCAT TCTACATTCC AAAGTTAGAA TATAGCTGTT GTTTATGTTG TGATAAGTCT
ATATATTTAG AAAATGATGT TATAGATGAA AAAGTGAAGA AAATTAAATC AGTTACGAAA
GCACCATCTA CTATCATTAA TAATTTCTTT GCTGGTGCTA TGCTTGGTTC AGAACTTATT
AAATTCTTTG CGTGCGATTA CAAATCAATG CAAAGTATTA ATTCTGTAAT AGGAATTCAC
AATAAGAATT TTAAGTATGA AGAAATTAAG TTAGCTAGAA ATTATAATTG CAAATATTGT
GGAGTAAATA ATGAGACACT ATGA
 
Protein sequence
MKYITSKHVF FDYLNENEFV IGIGSNQEIT NNKDYFNNCL NLCYFCINPK SISEILSFIK 
DNNIDILYFD KMKKMKFITK EIIDFNDRYS RNHLYYNALG YKIYDIQNKI SKSHILIVGA
GGIGNICSYL LGTIGIKKLS IIDDDIVEES NLNRQFLFRE KDINKNKVET IKRELLSIRK
DIIIDIFPEK LNKSILDKIS QIDLVICSAD DEYCIDMINE FCCFNKIPLI NVGYLNDISV
IGPFYIPKLE YSCCLCCDKS IYLENDVIDE KVKKIKSVTK APSTIINNFF AGAMLGSELI
KFFACDYKSM QSINSVIGIH NKNFKYEEIK LARNYNCKYC GVNNETL