Gene HS_1430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1430 
SymbolpilB 
ID4240946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1616691 
End bp1618106 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content37% 
IMG OID638105008 
Productpilin/fimbriae biogenesis protein 
Protein accessionYP_719642 
Protein GI113461573 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAATTC CTCAATCTGA AGATATGTCT CCGATAGTGA TGTCGCTCGA TGGAGATACT 
TATGAAATAA CACCGCACTT ATGGCAAAGA AATCAGCAAC AATCCCAAAT TCTGCTACGC
TATTTTGCCA TTCCTTTACA GGAAAATGAA CAAACGTTAT GGTTAGGCGT TGATAGTCTA
AATAATATTA GTGCCTGTGA AACATTTGCA TTTTTGTATG GAAAAATTGT AGAACCGGTT
TTACTTAATA ACCAATTACT TAAACAGTTA CTACAAAACT TATCGCCACA GCAAAATAAT
ATGTTCGTTG AAGAGCAACC TATAAATCAA TATGTAGCCG AGCATCGTAT AGAAGTAACT
GAAAAGTCTG ATGAACCCGT GATTCAACTT TTAGATCATA TTTTTGAAAA TGCGTTAAAG
CAACATGTAT CAGATATTCA TATTGAACCG CAAATGAATT GCTTGCAAAT ACGTTTCAGA
ATAGACGGTA TTTTACAATG CCAATCGCCT CTTCCGCTTT CGCTGAGTAA GCGTATTCTT
TCACGTTTAA AGTTACTGGC CAAATTGGAT ATAAGCGAAA CACGGTTACC TCAAGATGGA
CGATTTCATT TTAAAACAAC ATTTTCTGAC ATCCTTGATT TTCGTTTATC AACCCTTCCG
ACGAATATGG GTGAAAAAGC CGTGTTGCGT TTACAGCAAA ATAAACCTGT ACAGTTGAGT
TTTGCGGAAC TCGGCATGAC AGAAAACCAG CAAAAATCAT TCAAACAGGC ACTATCTCAA
CCCCAAGGAC TCATTTTAGT GACCGGTCCA ACAGGGAGTG GAAAAAGTAT CTCACTTTAC
ACCGCACTTC AATGGCTGAA TGATAAGCAC AAACATATTA TGACAGCGGA AGATCCGATA
GAAATTGAAT TGAACGGTAT CATTCAATGT CAAATAAATC CGCAGATCGG GTTAGATTTT
AGCCGGCTGT TAAGAACTTT TCTTCGTCAA GATCCTGACA TTATTATGTT GGGTGAAATT
CGGGATAACG AAAGTGCAAT AATGGCATTA AGAGCAGCAC AAACCGGACA TCTTGTTCTC
TCAACTTTAC ATACAAATGA TGCTCCGTCG GCAGTCTCAC GCTTGCTTCA ATTAGGGGTA
AAGCAACATG AAATTGACAA TAGTTTATTA TTGGTCATCG CTCAACGTTT GGTACGAAAA
AAATGCCCAC ATACAGAAAA TGAAAATTGC ACTTGTCATC AGAAATATCA AGGGCGAATT
GGGGTTTATC AATTTTTACA ACCGCATCTA ATAGATAATC ATATTTGCTA CCAAACGGAT
TATGCTCATT TGCGTGAAAG TGCAATGGAA AAAGTGCGGT TAGAAATAAC AGATTTAGCA
GAAGTGGATA GAGTTATTGG ACAAAGTAAT GAATAA
 
Protein sequence
MIIPQSEDMS PIVMSLDGDT YEITPHLWQR NQQQSQILLR YFAIPLQENE QTLWLGVDSL 
NNISACETFA FLYGKIVEPV LLNNQLLKQL LQNLSPQQNN MFVEEQPINQ YVAEHRIEVT
EKSDEPVIQL LDHIFENALK QHVSDIHIEP QMNCLQIRFR IDGILQCQSP LPLSLSKRIL
SRLKLLAKLD ISETRLPQDG RFHFKTTFSD ILDFRLSTLP TNMGEKAVLR LQQNKPVQLS
FAELGMTENQ QKSFKQALSQ PQGLILVTGP TGSGKSISLY TALQWLNDKH KHIMTAEDPI
EIELNGIIQC QINPQIGLDF SRLLRTFLRQ DPDIIMLGEI RDNESAIMAL RAAQTGHLVL
STLHTNDAPS AVSRLLQLGV KQHEIDNSLL LVIAQRLVRK KCPHTENENC TCHQKYQGRI
GVYQFLQPHL IDNHICYQTD YAHLRESAME KVRLEITDLA EVDRVIGQSN E