Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0918 |
Symbol | kpsF |
ID | 4240411 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 1012730 |
End bp | 1013695 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 638104474 |
Product | polysialic acid capsule expression protein, KpsF/GutQ family protein |
Protein accession | YP_719129 |
Protein GI | 113461061 |
COG category | [M] Cell wall/membrane/envelope biogenesis [T] Signal transduction mechanisms |
COG ID | [COG0794] Predicted sugar phosphate isomerase involved in capsule formation [COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains |
TIGRFAM ID | [TIGR00393] KpsF/GutQ family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGCAC TTTTCAATGA GGAAATGACG ATGAATTACT TACAAATCGC TCGAAATTCG TTAGCTGCCG AGCAAAATGC TTTAGCCAAA CTTAGTCAAA ATTTAAATCA ACAGTTTAAT CAGGTTGTTG AGCTTATTTT AAATTGTGAA GGACGATTAG TTGTTGGCGG AATCGGGAAA TCCGGACTTA TTGGTAAAAA AATGGTTGCT ACATTTGCCT CAACAGGAAC ACCAAGTTTT TTTCTACATC CAACAGAGGC TTTTCATGGT GATTTGGGCA TGTTAAAGCC TATTGACATT GTGATGTTAA TCTCTTATAG CGGTGAAAGT GATGATGTTA ATAAATTGAT TCCCAGCTTA AAAAATTTTG GCAATAAAAT CATTGCATTA ACAGGTAACC TAAATTCTAC TTTAGCAAAA CATGCCGACT ATATCCTTGA TATCAGTGTT GAGCGTGAAG CTTGTCCCAA TAATCTGGCT CCAACAACTT CGGCTTTAGT AACGTTAGCT TTAGGCGACG CTCTTGCGGT TTCTTTAATT ACAGCTAGAA ACTTTCAACC AGCTGATTTT GCCAAATTTC ATCCAGGCGG TAGTCTTGGT CGTCGTTTGT TATGTAGAGT GAAAGATCAA ATGCAAGTTC GTTTACCAAA AGTAACAGAA AACACAAACT TTACTGATTG TTTAACCGTT ATGAATGAAG GTCGTATGGG GGTTGCTCTT GTCATGGAAA ATGAAAATTT AAAAGGTATT ATTACCGATG GCGATATTCG CCGTGCATTA AGTGCAAACG GAACTAATAC ACTTAACAAA ACAGCCAAAG ATCTTATGAC TTCCAATCCT AAAACTATTA ACTATAATAC TTATCTGTCT GAAGCGGAAA ACTTTATGAA AGAGAAAAAA ATTCATTCAT TAGTCGTTGT AGATGATCAG AATAAAGTGA TAGGTTTAGT TGAATTTTCG AGTTAA
|
Protein sequence | MTALFNEEMT MNYLQIARNS LAAEQNALAK LSQNLNQQFN QVVELILNCE GRLVVGGIGK SGLIGKKMVA TFASTGTPSF FLHPTEAFHG DLGMLKPIDI VMLISYSGES DDVNKLIPSL KNFGNKIIAL TGNLNSTLAK HADYILDISV EREACPNNLA PTTSALVTLA LGDALAVSLI TARNFQPADF AKFHPGGSLG RRLLCRVKDQ MQVRLPKVTE NTNFTDCLTV MNEGRMGVAL VMENENLKGI ITDGDIRRAL SANGTNTLNK TAKDLMTSNP KTINYNTYLS EAENFMKEKK IHSLVVVDDQ NKVIGLVEFS S
|
| |