Gene HS_1072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1072 
Symbol 
ID4240571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1188767 
End bp1190194 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content31% 
IMG OID638104633 
Producthypothetical protein 
Protein accessionYP_719284 
Protein GI113461215 
COG category[S] Function unknown 
COG ID[COG2989] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAA TTAAAGTTAT CTTATTAATC TTATTATTAG GCACTACGAC TCATATTTAT 
GCACAACAAA AGTTTAATTC TATAGATGAG ATTAATAATA TTACTTTAAG TAAATCACAA
TTATTGTTTG AATTAGATTT ATTGGAACAA CAACTATCCG AGGAATATCA GCATAGATTA
TATGAACAAT TACATTCTTT GCTCAGAAAT ATAGATTTAC AATTTAAAAC TACAATTGGT
CGTATTTATG TTGAAAATGA CTATGCATTG CTTTGGGAAG ATAAGCAAGC TGAAAAAATG
TTTTTACGTG AATATGCCGC AATAGTTGCA AGCGGAGTCT CTGAAAGAGC GGCAAGATTA
TTGAATGATA TTTATAAAGC ATCTGAAATG GGAGGGTTAG TTTATGATAT GTTGCTGACC
GACGCATTTT TGGATTATAT GTATTATTCA AAAAATGCAA AAAATTTTGC ACAACAGTGG
TTCTACTCAG CAAATAGCTA CAAAGCTCAA TTGCCATCAA AACAAGATAT TCAACGATGG
CTATCATCAA TAAAACATAA TGAAAATTTG ATGTTCATCG AACAATTAGC TCAACATAAT
GAGCAATATG AGAAAATTAT AACTTACCTG AGTAAATTAA TACCCCAAGA TGACAAGTCA
ATATTATACA AATTAGCAAT TAATGCTCAG CGTTTAAGGA TTATTCCCGA TTTTAACAAC
GGTATTTTTG TTAATATTCC AAGTTATCAA TTAAATTATT ATAGAGATGG AAAATTAGTT
TTAAACTCTA AAGTCATTGT CGGTAAAAAA GCACGTAAAA CACCGGTAAT GTATAGTAAG
CTAAGTAATA TTGTTGTTAA TCCACCTTGG ATACCTACTC CTCGTTTAAT TAATGAAGAT
ATCGTGCCAA AGATTAAACT TGATCCGGAT TATGTTGCTC GTAATAGTTA TACCATAAGT
GATAGCAAGG GGCAGGTTAT AGATCCTTCA TCAATAGATT GGAATACTAT AGGCACTAAT
TTTCCCTATC GAATCCGCCA AGCTCCGGGA GGAAGTGCAT TAGGAAACTA TAAATTTAAT
ATGCCCAGTT CAGATGCAAT TTATTTACAC GATACACCCA ATAGAGGATT ATTTAGCAAA
AAAAATAGAG CATTAAGTTC AGGTTGTGTT CGTGTAGAAA AGTCAGATCA ACTGGCGACA
ATTTTATTAA CAGAAGCCGG TTGGACAGAA GAACGTAAGC AAAACGTGCT TAATAGTAAA
AAAAATACTT CGGAGAATAT TCGCTCCGAT AATCCTGTAT ATTTATATTA TGTTACTACT
TGGGTTGAAA ACGATGTCGT GAAAACATTA CCTGATATTT ATGAGTATGA TCAAGTACCT
CATTTAACTT ATATTAACTG GAATATTATT AAATGGTATC TAAATTAA
 
Protein sequence
MRKIKVILLI LLLGTTTHIY AQQKFNSIDE INNITLSKSQ LLFELDLLEQ QLSEEYQHRL 
YEQLHSLLRN IDLQFKTTIG RIYVENDYAL LWEDKQAEKM FLREYAAIVA SGVSERAARL
LNDIYKASEM GGLVYDMLLT DAFLDYMYYS KNAKNFAQQW FYSANSYKAQ LPSKQDIQRW
LSSIKHNENL MFIEQLAQHN EQYEKIITYL SKLIPQDDKS ILYKLAINAQ RLRIIPDFNN
GIFVNIPSYQ LNYYRDGKLV LNSKVIVGKK ARKTPVMYSK LSNIVVNPPW IPTPRLINED
IVPKIKLDPD YVARNSYTIS DSKGQVIDPS SIDWNTIGTN FPYRIRQAPG GSALGNYKFN
MPSSDAIYLH DTPNRGLFSK KNRALSSGCV RVEKSDQLAT ILLTEAGWTE ERKQNVLNSK
KNTSENIRSD NPVYLYYVTT WVENDVVKTL PDIYEYDQVP HLTYINWNII KWYLN