Gene HS_0589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0589 
Symbol 
ID4240073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp627781 
End bp628941 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content45% 
IMG OID638104139 
Productadhesin 
Protein accessionYP_718801 
Protein GI113460734 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAATTT TTTATTTATC TTTAACTGCG GTAGTTGTTT TAACTGCTCA TTCGGCTATG 
GCTCAAAATA CTTCTCCGAA TATGAAGTAT GCAAAAGAAG CTATTTATAT TGGTAAAAGT
ATTACAGAAG TTCCTAACAA TATACCAAAT GCTAAAGCCG TTGCCGTTGG TGATAAAACA
AAAGTCGGCG AGTCCGCTGT TGCTGTAGGT TATGAGGCAG ATGCTCATCT CGAAGGGGCT
ACTGCAGTAG GGCGTGGAAC TAAAACTCAA GCTTATGCCG TAGCAATGGG CTACCAAGCA
AATGCAGGTG TACAGGCGGT TTCTATAGGT AAAGATTCTA AAGCAACTGG AGTGCAATCA
GTGGCATTAG GTGATCAGGC AAAAGCAACT GGTAAATTAT CCTCCGCATT TGGTCAAAAT
GCTCAAGCAG CTGGTCATTA TAGTAATGCC TTTGGATGGC ATTCTAATGC ACGGAATGAA
CTTGCTACCG CAATCGGCTA TAAGGCAAAT GCCACAGGAA AAAGATCTAT AGTCTTAGGT
CCAGATTCAA CAGCCTCAGG CGAAGGTTCA TTAAGTTTTG GTAGTGGTGT AAAAACAACT
GGAAATAATT CATTCTCAGT TGGTCGTAAT ATTACGAATA ACAGTACCAA AACAGTTGCT
ATAGGTAACA ACATTCAACA TACAAAAGAT AATTCTGTAT TTCTAGGTGA TTCTTCTGCT
TATACCGCAC CAAGCGAAAC TTCAGGCGGT ATCGGTAAAG TCGACGGCAA TTATGCTGGC
GTTGATGCAA AAGGCGTCGT TTCCGTGGGC AGCAAAGGCA ATGAACGCCG TATTCAAAAT
GTTGCCGCCG GTTTGCTTTC TCATCAATCA ACCGATGCTG TCAACGGGAG CCAATTACAC
GCCACTAACC AACGTCTTGA AGAAGTCAAC AAAGACGCCA AAGCCGGTAT CGCTGCCGCG
ATGGCTTTTA AAGACGTGCC TTTCGTCCCG GGTAAATGGT CTTATGCCGC CGGTGCCGCT
CATTATAGCA GCGAAAGTGC GGTCTCTTTA AACCTCGGCA GAACTTCCAA TGATGGTAAA
TGGGCTGTCT CCGGCGGTAT GTCCTCCGAC AGCCGTGGTC GCCTCGGTTT CCGTGTCGGC
GTCAGCGGCG TGTTTAACTA A
 
Protein sequence
MKIFYLSLTA VVVLTAHSAM AQNTSPNMKY AKEAIYIGKS ITEVPNNIPN AKAVAVGDKT 
KVGESAVAVG YEADAHLEGA TAVGRGTKTQ AYAVAMGYQA NAGVQAVSIG KDSKATGVQS
VALGDQAKAT GKLSSAFGQN AQAAGHYSNA FGWHSNARNE LATAIGYKAN ATGKRSIVLG
PDSTASGEGS LSFGSGVKTT GNNSFSVGRN ITNNSTKTVA IGNNIQHTKD NSVFLGDSSA
YTAPSETSGG IGKVDGNYAG VDAKGVVSVG SKGNERRIQN VAAGLLSHQS TDAVNGSQLH
ATNQRLEEVN KDAKAGIAAA MAFKDVPFVP GKWSYAAGAA HYSSESAVSL NLGRTSNDGK
WAVSGGMSSD SRGRLGFRVG VSGVFN