Gene HS_0894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0894 
Symbol 
ID4240386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp981768 
End bp983234 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content39% 
IMG OID638104449 
Producthypothetical protein 
Protein accessionYP_719104 
Protein GI113461037 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGTTT ATGATTTAAA AACCCAAAAA ACCTTATTAG CTGTCAGCGT CTGTTTAGCG 
TTTTCCGCTC AAGCAGAAAC AAGTAAAAAT AAAGTTGAAC GAGCTAATCA ATTACCGGAG
GTTGTTGTTT ATGCAGAGCA AAACGCAGGA TTATCTTCCA GCCAAAAAGT AACCGCCAAA
GATATTAAAT CTTCCCCTAA TTCCAACGGC AATATTTCTG ATTTTTTGAA AACCAATTCT
CATGTGCGTT TTGAACGTAG TGATGAAAAC AGTTTTCAAC GAGGTGAAAT TAAACCTGCT
GACATTTCGA TTAATGGTGC GGAAGCCAAT CAAACCAGTT ATTTTGTAGA TAATGTCAAT
ATCAATAATG ATTTAGGCTT TGACTCGGCT ATTTTTGATG GAGCGATGCA AACTTTGCCT
ATGGCAAGTC ATGCACAAGC CTATTTCTTT GATGCGAATT TATTGTCTTC CGTAACCGTT
TACGACAGCG ATATTTCCGC CAGTTTAGGC GGTTTTGCCG GCGGTGCCGT AGTCGCCAAA
ACCAAGCAAT ACGATGGTAC CGATGGTGTG CAATTGCGTT ATCGTACCAG TCATTCTAAT
TGGGCGAAAT TTCATCTTGA GGAAAAAGAT CGAGAAAGAT TTAAACAAGC CTCGCCTAAC
GGTAGTAGTG CGGATTTTCA ACCTAAATAT AGCAAAGATT TCTTCAGCCT TTCTGCACAA
CATTCTTTAG GGGAAAATAT CGGTATGGTA GCAGGATTTA GTCGCCGTAC TTCAGATATT
CAGCAACGTC GTTTAGTGCT GGGAAAAGAT GACAAGTTGA GTTCGGATAA TCGCAGACAT
AAACGCCGTT CCGATAATGC GTTGTTGAAT TTTAACTGGC TTGCTAATGA GGATAATCGC
TTTGAATTGA GTTTGCGTTA TTCTAATTAT GTGGAAACTA AATTCTTTGC AGAGAATGTT
GATAGCAATG TGCAAGATTA TCATCAAGCC TATGGTGCCA CTTTAGCTTG GATTCGTTCG
TTAAAGAGCG GTGTGTTAAC TAATACATTG GCGTATGATC AATTTGCAGA TAAGCGTAAA
TCCGCCTCTA ATTATTTGAA GCAAATATTG GCATTTGATG AAAACTATGA TCCGATTAAT
TATGAACGTG GAGGAATGGG AGATAGTGCT TTAACGCAAC GCAATGTGCA TTTTTCCAGT
GAATTTGCTA TGGATCCGTT GACTTGGGGA CGTACTGAAC ATTCTATTTC CTTAGGGGGT
ATCTGGCAAT TTACGCACTA TCGTTTTCAA CGTGATCAAA ATGCTAAATC TGAAATTTTC
ATGCAGGACA GTATGGAAAG TCCTCTTTCT TCAAATTCCG TTTCCAAAGG AACAGTAAAA
ACCGATTATC ACAATATCGC CCTTTATGTC GAAGATTTAA TCAAGCTGGG GGGGGTAAAT
GGGAAGTTCG TCCGGGATTA CGTTTAG
 
Protein sequence
MSVYDLKTQK TLLAVSVCLA FSAQAETSKN KVERANQLPE VVVYAEQNAG LSSSQKVTAK 
DIKSSPNSNG NISDFLKTNS HVRFERSDEN SFQRGEIKPA DISINGAEAN QTSYFVDNVN
INNDLGFDSA IFDGAMQTLP MASHAQAYFF DANLLSSVTV YDSDISASLG GFAGGAVVAK
TKQYDGTDGV QLRYRTSHSN WAKFHLEEKD RERFKQASPN GSSADFQPKY SKDFFSLSAQ
HSLGENIGMV AGFSRRTSDI QQRRLVLGKD DKLSSDNRRH KRRSDNALLN FNWLANEDNR
FELSLRYSNY VETKFFAENV DSNVQDYHQA YGATLAWIRS LKSGVLTNTL AYDQFADKRK
SASNYLKQIL AFDENYDPIN YERGGMGDSA LTQRNVHFSS EFAMDPLTWG RTEHSISLGG
IWQFTHYRFQ RDQNAKSEIF MQDSMESPLS SNSVSKGTVK TDYHNIALYV EDLIKLGGVN
GKFVRDYV