Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_1421 |
Symbol | |
ID | 4240936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 1606354 |
End bp | 1607412 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638104998 |
Product | hypothetical protein |
Protein accession | YP_719633 |
Protein GI | 113461564 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000365122 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGACC AAGTTCAAAA AAAATACGAA CTTCTTCAAA ATGATACTAT TCAACAGAAT GGTAAAACAC TCTACCGAAT TAAAGCATTA ATCTCTTTTG GTGATATAAA GGCGGGTAAG CTGGGCGGTT ATATTGAGAA AGAGGAAAAT CTAAGTCACG AGGGAAATGC TTGGGTGTCC GACAATGCTA AAGTGTTTGG CAATGCTAGA GTGTATGGCA ATGCTGAAGT GTTTGGCAAT GCTAGAGTGT ATGGCAAGGC TAGAGTGTAT GACAATGCTA GAGTGTATGA CGATGCTGAA GTGTTTGGCA TTGCTGAAGT GTATGGCATT GCTGAAGTGT GTGAAAATGC TATAGTGTAT GACAATGCTA GAGTGTATGG CAATGCTGAA GTGTTTGGCA ATGCTAGAGT GTATGGCAAG GCTAGAGTGT ATGACTATGC TATAGTGTGT GACACTGCTG AAGTGTTTGG CAATGCTAGA GTGTATGGCA AGGCTAGAGT GTATGACTAT GCTATAGTGT GTGACACTGC TGAAGTGTTT GGCAAGGCTA GAGTGTATGA CTATGCTATA GTGTGTGACA CTGCTGAAGT GTTTGGCAAT GCTAGAGTGT ATGGCAAGGC TAGAGTGTAT GACTATGCTA TAGTGTGTGA CACTGCTGAA GTGTTTGGCA AGGCTAGAGT GTATGGCAAG GCTAGAGTGT ATGACTATGC TATAGTGTGT GACACTGCTG AAGTGTTTGG CAATGCTAGA GTGTGTGGCA AGGCTAAAGT GTTTGGCAAT GCTAGAGTGT GTGACACTGC TTTGGTGTGT AGAAGTGACT TTATATGTAA AAATGCATTT ATCTCAAAAG AAAGTGATGT CTTTTCCGCA AGTTATGTAG GAAGGGAAAA CGGCGTATTA ACTGTGTATA AAACCGAAAA TGAATTGTAT GCAACACGAG GCTGTTTTGT TGGTCCGGTG GAGGAGTTTT TGCAACAATC CGCCAAGGTC CATGATGAGA AAACTCATCG GGAATATCAG CTGCTGATTG AGGTAGCGAG AAGTCGCATT CTTAAATAA
|
Protein sequence | MKDQVQKKYE LLQNDTIQQN GKTLYRIKAL ISFGDIKAGK LGGYIEKEEN LSHEGNAWVS DNAKVFGNAR VYGNAEVFGN ARVYGKARVY DNARVYDDAE VFGIAEVYGI AEVCENAIVY DNARVYGNAE VFGNARVYGK ARVYDYAIVC DTAEVFGNAR VYGKARVYDY AIVCDTAEVF GKARVYDYAI VCDTAEVFGN ARVYGKARVY DYAIVCDTAE VFGKARVYGK ARVYDYAIVC DTAEVFGNAR VCGKAKVFGN ARVCDTALVC RSDFICKNAF ISKESDVFSA SYVGRENGVL TVYKTENELY ATRGCFVGPV EEFLQQSAKV HDEKTHREYQ LLIEVARSRI LK
|
| |