Gene HS_1767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1767 
Symbol 
ID4241301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1989343 
End bp1990845 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content42% 
IMG OID638105360 
Producthypothetical protein 
Protein accessionYP_719972 
Protein GI113461903 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.214239 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTTTA ATGTGGATAG TTATCTCGAA TATTTCCTGA CGTTGCTCGG CTGGATCATT 
AATAATGGCT TATTTGGATT GTTGGTGAGT ACCGGGCTTT TTATTGCCCC GTTAATCGGA
ATGTTGATTA AAACTTGGCT TGAAGTGAAA AAACAAGGGG CTGATGAGGG AAATAAAGGA
GAGTTATTAA TTGATTGGTT AAGCATACAA TTTTTCCCTG CAATGCTGGT GATTGTGCTG
ACCCTTGCCC CAATGTTGCC GATTTCGCTG AATAATATGG CTTATAATGT GGAGCAATCA
AAACATTGCG GTTATAAAGT GCCGTTAGCA CCTGAAAAAA CAGGTTATGC CAGTATGGTG
AGTGAATTTG CCGGTAAACA GGCCAAAGTA CCGCTGTTGT GGGGATTAAT GCACGCCGTT
AATAAAGGGA TTTTACACGG TGCGGTTTCA ACCATTCCTT GCAAGCTTGA TTTACGTCAA
ATCCGCTTTG AGGTACAACA CGAAAAAATT AATAATCCGG CACTGTTGAC CGAAGTGCGT
CAATTTGTGC AACAATGTTA TTTGCCTGCT CGTCGTAAAG TGTTAGATAG CCAAGTGTCA
ATGAATGCGG CACAAGCTAG AGAAGTGAGC TGGCTTGGCG GGAAAATTTT AGTGAACAAT
AGCGAACTTT ATCCCCGTTA TCGAGCAATG CAACCGATGC AACGTTGGGC TTATGAGCCT
AACCGCGATC AGGGGCTACC CAATACGGGC AGAGGCGGTT TCCCTCATTG TGATGAATGG
TGGGCAGACA GCCAAGTGGG GCTAAAAGAT ATGCTCCTTT CAGATATGCG ACAAAATTTA
TCGGTGAAAT TAGGGGAAAT GTTTACTAAT GCGAATATTC AAGATGAAGC ATTACTTCGC
ACTTTATTAC GCCCTGAAAA TATCAATATC TCCAGAGGGA AAGTATATGA GGGGTACGGT
GGAAACTTAA ATCCTACGGG TCTTAATCGC GTCACATCCA CGGTATCAGG TCTTGGTGTT
GCCGCGGGGA GCTTAGTTGC TTATCCGGGA TTTGATGCAA TGCGAAATTC ATTACCTATG
ATCCAAGCAG TACTTATTAT GGCGGTCATT ATTTTAACAC CGATTGTTAT TGTGTTTAGT
GGGTATTCTT TAAAAGCGAT TGTGACGTTG ATGTTTGTAC AGTTTGCATT AGTAACTACG
TCATTTTGGT GGGAGTTGGC TAGATGGTTA GATTCGTCTC TTTATACTAT CATGTACCAT
TCACCTAGTC ATACAGATAC AGATTCGTTC TGGAGTTTCC TGCGAAATGA CACCGATAAT
ATGATTATGA GTATTGTATT AGGCGTAATG TTTTTGATTT TACCGGGTGT TTGGGTTACA
GCAATGTCTT GGGCTGGTTT TAATGTGGGA GCGTTAGCTG ATAATTTTGC ACAAAGCTCA
CGACAAGTTC AAGAAAGCGG TTCTGATGGA ACAAAGATTA TTGCGAAAAC AATACCAAAA
TAA
 
Protein sequence
MTFNVDSYLE YFLTLLGWII NNGLFGLLVS TGLFIAPLIG MLIKTWLEVK KQGADEGNKG 
ELLIDWLSIQ FFPAMLVIVL TLAPMLPISL NNMAYNVEQS KHCGYKVPLA PEKTGYASMV
SEFAGKQAKV PLLWGLMHAV NKGILHGAVS TIPCKLDLRQ IRFEVQHEKI NNPALLTEVR
QFVQQCYLPA RRKVLDSQVS MNAAQAREVS WLGGKILVNN SELYPRYRAM QPMQRWAYEP
NRDQGLPNTG RGGFPHCDEW WADSQVGLKD MLLSDMRQNL SVKLGEMFTN ANIQDEALLR
TLLRPENINI SRGKVYEGYG GNLNPTGLNR VTSTVSGLGV AAGSLVAYPG FDAMRNSLPM
IQAVLIMAVI ILTPIVIVFS GYSLKAIVTL MFVQFALVTT SFWWELARWL DSSLYTIMYH
SPSHTDTDSF WSFLRNDTDN MIMSIVLGVM FLILPGVWVT AMSWAGFNVG ALADNFAQSS
RQVQESGSDG TKIIAKTIPK