Gene HS_1035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1035 
SymbolhagA 
ID4240533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1141355 
End bp1142422 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content40% 
IMG OID638104596 
Producthemagglutinin antigen 
Protein accessionYP_719247 
Protein GI113461178 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins
[COG3637] Opacity protein and related surface antigens 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000658717 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTTAATT TCTTGATAAT TAATATAGAG GAAATTCAAA TGAAAAAAAC AATTATTGCA 
TTATCTATCG CAGGTTTTGT TGCTTCTGTG CAAGCGGCAC CTCAAGCAAA TACGTTCTAT
GCCGGTGCGA AAGCCGGTTG GGCATCTTTC CATGACGGTC TTAATCAATT TGAAGATTCT
TCACAAAAGA AAGGAACAGT TCGTAATTCA GTAGCTTATG GTATTTTTGG TGGTTACCAA
ATTACCGATC ATGTCGCTGT AGAGTTGGGT TCTGAGTATT TCGGTCAAGC AAAAGGTCGT
AAAGATAAAG ACGAAGCTAA ACATACCGCT CAAGGGATGC AATTAGGCTT AAAAGCAAGC
TACCCTGTAT TAGAAGGCTT GGATATTTAC GGTCGTGTAG GTGCGGCGTT AATTCGTTCT
AATTATCTTA ATGTTGAAAA ATTCGAAAGT GGTAAAGATG TAAAAAACAC TTTAAAAGTT
TCTCCGGTTT TCGCAGCGGG TGTTGAATAC AGCCTACCTT CTTTACCGGA ATTGGCATTA
CGTTTGGAAT ATCAATGGGT TAAAGGCGTT GGTAAAGCAC TTAAGAAAAG CAGTGGTGAA
CGCTTAGATT ATACACCAAG TATCGGTGCG GTAACACTTG GCTTATCTTA CCGTTTCGGT
CAAAAACCAG TTATGGCACC TGAAGTAGTA AACAAAGTGT TCAGCTTAAA TTCAGATGTG
AATTTTGCCT TCGCTAAAGA TACATTAAAA CCTGAAGCTC AACAAACATT GGACGGTGTT
TATGGTGAAA TCGCACAATT AAAAACCGCA CAAGTTTCTG TTGCCGGTTA TACAGACCGT
ATCGGTTCTG ATGCGTCAAA CTTAAAATTA TCACAACGTC GTGCGGATAC TGTAGCAAAT
TATTTAGTCT CTAAAGGTGT TGCTCAAGAT GCCATTAGTG CGGTAGGTTA CGGTGAAGCA
AATCCGGTGA CCGGTGCGAA ATGTGATGCT GTTAAAGGTC GTAAAGCACT AATCGCCTGT
TTAGCAGAAG ATCGTCGTGT TGAAATCTCA GTGAAAGGTA GCAAATAA
 
Protein sequence
MLNFLIINIE EIQMKKTIIA LSIAGFVASV QAAPQANTFY AGAKAGWASF HDGLNQFEDS 
SQKKGTVRNS VAYGIFGGYQ ITDHVAVELG SEYFGQAKGR KDKDEAKHTA QGMQLGLKAS
YPVLEGLDIY GRVGAALIRS NYLNVEKFES GKDVKNTLKV SPVFAAGVEY SLPSLPELAL
RLEYQWVKGV GKALKKSSGE RLDYTPSIGA VTLGLSYRFG QKPVMAPEVV NKVFSLNSDV
NFAFAKDTLK PEAQQTLDGV YGEIAQLKTA QVSVAGYTDR IGSDASNLKL SQRRADTVAN
YLVSKGVAQD AISAVGYGEA NPVTGAKCDA VKGRKALIAC LAEDRRVEIS VKGSK