Gene HS_0918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0918 
SymbolkpsF 
ID4240411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1012730 
End bp1013695 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content36% 
IMG OID638104474 
Productpolysialic acid capsule expression protein, KpsF/GutQ family protein 
Protein accessionYP_719129 
Protein GI113461061 
COG category[M] Cell wall/membrane/envelope biogenesis
[T] Signal transduction mechanisms 
COG ID[COG0794] Predicted sugar phosphate isomerase involved in capsule formation
[COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains 
TIGRFAM ID[TIGR00393] KpsF/GutQ family protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCAC TTTTCAATGA GGAAATGACG ATGAATTACT TACAAATCGC TCGAAATTCG 
TTAGCTGCCG AGCAAAATGC TTTAGCCAAA CTTAGTCAAA ATTTAAATCA ACAGTTTAAT
CAGGTTGTTG AGCTTATTTT AAATTGTGAA GGACGATTAG TTGTTGGCGG AATCGGGAAA
TCCGGACTTA TTGGTAAAAA AATGGTTGCT ACATTTGCCT CAACAGGAAC ACCAAGTTTT
TTTCTACATC CAACAGAGGC TTTTCATGGT GATTTGGGCA TGTTAAAGCC TATTGACATT
GTGATGTTAA TCTCTTATAG CGGTGAAAGT GATGATGTTA ATAAATTGAT TCCCAGCTTA
AAAAATTTTG GCAATAAAAT CATTGCATTA ACAGGTAACC TAAATTCTAC TTTAGCAAAA
CATGCCGACT ATATCCTTGA TATCAGTGTT GAGCGTGAAG CTTGTCCCAA TAATCTGGCT
CCAACAACTT CGGCTTTAGT AACGTTAGCT TTAGGCGACG CTCTTGCGGT TTCTTTAATT
ACAGCTAGAA ACTTTCAACC AGCTGATTTT GCCAAATTTC ATCCAGGCGG TAGTCTTGGT
CGTCGTTTGT TATGTAGAGT GAAAGATCAA ATGCAAGTTC GTTTACCAAA AGTAACAGAA
AACACAAACT TTACTGATTG TTTAACCGTT ATGAATGAAG GTCGTATGGG GGTTGCTCTT
GTCATGGAAA ATGAAAATTT AAAAGGTATT ATTACCGATG GCGATATTCG CCGTGCATTA
AGTGCAAACG GAACTAATAC ACTTAACAAA ACAGCCAAAG ATCTTATGAC TTCCAATCCT
AAAACTATTA ACTATAATAC TTATCTGTCT GAAGCGGAAA ACTTTATGAA AGAGAAAAAA
ATTCATTCAT TAGTCGTTGT AGATGATCAG AATAAAGTGA TAGGTTTAGT TGAATTTTCG
AGTTAA
 
Protein sequence
MTALFNEEMT MNYLQIARNS LAAEQNALAK LSQNLNQQFN QVVELILNCE GRLVVGGIGK 
SGLIGKKMVA TFASTGTPSF FLHPTEAFHG DLGMLKPIDI VMLISYSGES DDVNKLIPSL
KNFGNKIIAL TGNLNSTLAK HADYILDISV EREACPNNLA PTTSALVTLA LGDALAVSLI
TARNFQPADF AKFHPGGSLG RRLLCRVKDQ MQVRLPKVTE NTNFTDCLTV MNEGRMGVAL
VMENENLKGI ITDGDIRRAL SANGTNTLNK TAKDLMTSNP KTINYNTYLS EAENFMKEKK
IHSLVVVDDQ NKVIGLVEFS S