Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0589 |
Symbol | |
ID | 4240073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | - |
Start bp | 627781 |
End bp | 628941 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 638104139 |
Product | adhesin |
Protein accession | YP_718801 |
Protein GI | 113460734 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAATTT TTTATTTATC TTTAACTGCG GTAGTTGTTT TAACTGCTCA TTCGGCTATG GCTCAAAATA CTTCTCCGAA TATGAAGTAT GCAAAAGAAG CTATTTATAT TGGTAAAAGT ATTACAGAAG TTCCTAACAA TATACCAAAT GCTAAAGCCG TTGCCGTTGG TGATAAAACA AAAGTCGGCG AGTCCGCTGT TGCTGTAGGT TATGAGGCAG ATGCTCATCT CGAAGGGGCT ACTGCAGTAG GGCGTGGAAC TAAAACTCAA GCTTATGCCG TAGCAATGGG CTACCAAGCA AATGCAGGTG TACAGGCGGT TTCTATAGGT AAAGATTCTA AAGCAACTGG AGTGCAATCA GTGGCATTAG GTGATCAGGC AAAAGCAACT GGTAAATTAT CCTCCGCATT TGGTCAAAAT GCTCAAGCAG CTGGTCATTA TAGTAATGCC TTTGGATGGC ATTCTAATGC ACGGAATGAA CTTGCTACCG CAATCGGCTA TAAGGCAAAT GCCACAGGAA AAAGATCTAT AGTCTTAGGT CCAGATTCAA CAGCCTCAGG CGAAGGTTCA TTAAGTTTTG GTAGTGGTGT AAAAACAACT GGAAATAATT CATTCTCAGT TGGTCGTAAT ATTACGAATA ACAGTACCAA AACAGTTGCT ATAGGTAACA ACATTCAACA TACAAAAGAT AATTCTGTAT TTCTAGGTGA TTCTTCTGCT TATACCGCAC CAAGCGAAAC TTCAGGCGGT ATCGGTAAAG TCGACGGCAA TTATGCTGGC GTTGATGCAA AAGGCGTCGT TTCCGTGGGC AGCAAAGGCA ATGAACGCCG TATTCAAAAT GTTGCCGCCG GTTTGCTTTC TCATCAATCA ACCGATGCTG TCAACGGGAG CCAATTACAC GCCACTAACC AACGTCTTGA AGAAGTCAAC AAAGACGCCA AAGCCGGTAT CGCTGCCGCG ATGGCTTTTA AAGACGTGCC TTTCGTCCCG GGTAAATGGT CTTATGCCGC CGGTGCCGCT CATTATAGCA GCGAAAGTGC GGTCTCTTTA AACCTCGGCA GAACTTCCAA TGATGGTAAA TGGGCTGTCT CCGGCGGTAT GTCCTCCGAC AGCCGTGGTC GCCTCGGTTT CCGTGTCGGC GTCAGCGGCG TGTTTAACTA A
|
Protein sequence | MKIFYLSLTA VVVLTAHSAM AQNTSPNMKY AKEAIYIGKS ITEVPNNIPN AKAVAVGDKT KVGESAVAVG YEADAHLEGA TAVGRGTKTQ AYAVAMGYQA NAGVQAVSIG KDSKATGVQS VALGDQAKAT GKLSSAFGQN AQAAGHYSNA FGWHSNARNE LATAIGYKAN ATGKRSIVLG PDSTASGEGS LSFGSGVKTT GNNSFSVGRN ITNNSTKTVA IGNNIQHTKD NSVFLGDSSA YTAPSETSGG IGKVDGNYAG VDAKGVVSVG SKGNERRIQN VAAGLLSHQS TDAVNGSQLH ATNQRLEEVN KDAKAGIAAA MAFKDVPFVP GKWSYAAGAA HYSSESAVSL NLGRTSNDGK WAVSGGMSSD SRGRLGFRVG VSGVFN
|
| |