Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_1035 |
Symbol | hagA |
ID | 4240533 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | - |
Start bp | 1141355 |
End bp | 1142422 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638104596 |
Product | hemagglutinin antigen |
Protein accession | YP_719247 |
Protein GI | 113461178 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins [COG3637] Opacity protein and related surface antigens |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000658717 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTTAATT TCTTGATAAT TAATATAGAG GAAATTCAAA TGAAAAAAAC AATTATTGCA TTATCTATCG CAGGTTTTGT TGCTTCTGTG CAAGCGGCAC CTCAAGCAAA TACGTTCTAT GCCGGTGCGA AAGCCGGTTG GGCATCTTTC CATGACGGTC TTAATCAATT TGAAGATTCT TCACAAAAGA AAGGAACAGT TCGTAATTCA GTAGCTTATG GTATTTTTGG TGGTTACCAA ATTACCGATC ATGTCGCTGT AGAGTTGGGT TCTGAGTATT TCGGTCAAGC AAAAGGTCGT AAAGATAAAG ACGAAGCTAA ACATACCGCT CAAGGGATGC AATTAGGCTT AAAAGCAAGC TACCCTGTAT TAGAAGGCTT GGATATTTAC GGTCGTGTAG GTGCGGCGTT AATTCGTTCT AATTATCTTA ATGTTGAAAA ATTCGAAAGT GGTAAAGATG TAAAAAACAC TTTAAAAGTT TCTCCGGTTT TCGCAGCGGG TGTTGAATAC AGCCTACCTT CTTTACCGGA ATTGGCATTA CGTTTGGAAT ATCAATGGGT TAAAGGCGTT GGTAAAGCAC TTAAGAAAAG CAGTGGTGAA CGCTTAGATT ATACACCAAG TATCGGTGCG GTAACACTTG GCTTATCTTA CCGTTTCGGT CAAAAACCAG TTATGGCACC TGAAGTAGTA AACAAAGTGT TCAGCTTAAA TTCAGATGTG AATTTTGCCT TCGCTAAAGA TACATTAAAA CCTGAAGCTC AACAAACATT GGACGGTGTT TATGGTGAAA TCGCACAATT AAAAACCGCA CAAGTTTCTG TTGCCGGTTA TACAGACCGT ATCGGTTCTG ATGCGTCAAA CTTAAAATTA TCACAACGTC GTGCGGATAC TGTAGCAAAT TATTTAGTCT CTAAAGGTGT TGCTCAAGAT GCCATTAGTG CGGTAGGTTA CGGTGAAGCA AATCCGGTGA CCGGTGCGAA ATGTGATGCT GTTAAAGGTC GTAAAGCACT AATCGCCTGT TTAGCAGAAG ATCGTCGTGT TGAAATCTCA GTGAAAGGTA GCAAATAA
|
Protein sequence | MLNFLIINIE EIQMKKTIIA LSIAGFVASV QAAPQANTFY AGAKAGWASF HDGLNQFEDS SQKKGTVRNS VAYGIFGGYQ ITDHVAVELG SEYFGQAKGR KDKDEAKHTA QGMQLGLKAS YPVLEGLDIY GRVGAALIRS NYLNVEKFES GKDVKNTLKV SPVFAAGVEY SLPSLPELAL RLEYQWVKGV GKALKKSSGE RLDYTPSIGA VTLGLSYRFG QKPVMAPEVV NKVFSLNSDV NFAFAKDTLK PEAQQTLDGV YGEIAQLKTA QVSVAGYTDR IGSDASNLKL SQRRADTVAN YLVSKGVAQD AISAVGYGEA NPVTGAKCDA VKGRKALIAC LAEDRRVEIS VKGSK
|
| |