Gene HS_0226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0226 
SymbolrbsR 
ID4239742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp225762 
End bp226772 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content40% 
IMG OID638103763 
ProductLacI family transcription regulator 
Protein accessionYP_718434 
Protein GI113460372 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTACAA TGAAAGATAT TGCTCGAATT GCACAGGTAT CAACGTCCAC TGTTTCTCAT 
GTGATTAATA ATTCACGTTT TGTCAGTGAA GAAATTCGGC AGAAAGTCAT GGCGGTAGTG
GAGCAACTGA ATTATACGCC CTCAGCATTG GCTCGCAGTT TGAAAGTCAA AGAAACTAAA
ACAATCGGAA TGCTGGTAAC TACGAGTGAT AATCCTTTTT TTGCTGAGGT TGTGGCGAGT
GTTGAACGTT ATTGTCGCCA ACATCATTAT CATTTGATCT TATGCAATAC CAATGATGAC
AGTATTTGTC TGCAAGAAAA TCTGCAAAAT TTGATACGCA AGCAGGTCGA TGGATTGCTC
TTGATGTGTA CGGATAGTGC CTTTGAAACC AGTCCTTTAA ATTTAACTGT TCCTACTGTC
ATCATGGATT GGTGGCCGAC AGAGCTAAGT GCGGATAAAA TTTTTGAGGA TTCTGAGCAG
GGCGGATATT TGGCGACACG GACTCTTATT AAGCATCAAC ATCATGATAT TGCGATTATT
ACGGGAAATC TGAAAAAGCC TCTTGCTCGT AATCGTTTGG AAGGCTATAA GCAAGCGTTG
CAAGAATACA ACATTCCTAT TCGTGATAAA TGGATTATTG AAAGTCATTT TAACTTTGAA
GGTGGAGTTA AGGGAATGGA ACAGTTGTTA CAATTGAAAC AGCGTCCAAG TGCGGTGTTT
GCTTGTAGTG ATACCATTGC AGTGGGTGCT TATCAAGCTG TTTGGCGTCA TGGGTTATCT
GTGCCTGAAG ATATTTCTAT TATTGGTTAC GATAATATTC ATCTAGCACA ATATCTTTCT
CCTCCACTAA GCACCGTTCA TCAACCTAAA GATGAATTCG CACACTTGGC AGTAGATACA
TTGTTGCAAC GAATAAAAAA TCCTACCGAA AGTTACCGCA CGTTGACATT AAAACCTGAG
ATAGTATTGC GTCAATCTAT TTGTTCCTGT TTTTTGCCTA AGTCCGCATA G
 
Protein sequence
MATMKDIARI AQVSTSTVSH VINNSRFVSE EIRQKVMAVV EQLNYTPSAL ARSLKVKETK 
TIGMLVTTSD NPFFAEVVAS VERYCRQHHY HLILCNTNDD SICLQENLQN LIRKQVDGLL
LMCTDSAFET SPLNLTVPTV IMDWWPTELS ADKIFEDSEQ GGYLATRTLI KHQHHDIAII
TGNLKKPLAR NRLEGYKQAL QEYNIPIRDK WIIESHFNFE GGVKGMEQLL QLKQRPSAVF
ACSDTIAVGA YQAVWRHGLS VPEDISIIGY DNIHLAQYLS PPLSTVHQPK DEFAHLAVDT
LLQRIKNPTE SYRTLTLKPE IVLRQSICSC FLPKSA