Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_1271 |
Symbol | |
ID | 4240782 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 1456960 |
End bp | 1458546 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638104844 |
Product | ABC transporter, periplasmic binding protein |
Protein accession | YP_719483 |
Protein GI | 113461414 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000259921 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAATT ACATTGAAAA TGAATCTCGC CGTAATTTCA TGAAAGTATT GGCTGGTGTC GGTGCAGGAA TTGCCTTTAG CGGTACATTA GGTAGTTTTG TTTCTAAGGC GAAGGCCACT TCAACAACAG GTAGTAGCAT TGAAGCAGGA ATAGCTTATC CCCTATCAAC CGGTTTTGAT CCGTCCACTT CCAGTGGAGC CTCTTCCTTT GCCGCCAACA TGCACTTTTT TGAAGGATTA GTGGATTTAC ACCCTGCCAC CCGTGAACCT TATCTCGCCT TAGCTGCAAA AGAACCTGAA CAAGTTGATG ATATAACATG GCGTGTTACA TTACGTGACG GTGCGACTTT CCACGATGGT ACACCTGTTA CCACGGAGGA TATAGTATAT TCTTATCAGC GTATTTTAGA TCCAAAAAAT GCTTCACTAT TTATACAATT CATTCCGTTT ATTGATTCAG TCAACGCTTT AGATGATAAA GTGGTAGAAT TTAAGTTGAA GTACCCGTTT TCTCTTTTTA AATTGCGATT AGGTATTGTA AAAATTGTAC CTAAACATGT GATTGAGGCT GTAGGACAAA CGGTATTTGA TGCAAATCCG GTAGGATCCG GACCATATAA ATTTGTTTCA GCCGTAAAAG ACGATCGCAT CGTATTTGCC GCACACAATG CTTACAACGG CCCATATCCT GCTCGTGTAG AAAAAATGAC ATGGTTTTTA CTATCCGATG ATGCCGCTCG CACAGCGGCA CAAGAGTCCG GACGTACTCA AGCAATGGAA AGTGTTCCTT ATTTAGACGT ATCTCGCTTA AAACGTAAAA GTGCGGTTGA ATCCGTGCAA TCTTTCGGGT TGTTATTCTT AATGTTTAAT TGTAAGAAAG CCCCATTTAA CAATCCCAAA GTACGTCAAG CTTTACACTA TGCACTGGAT ACACAAAAAT TAATTGATAT TGTATTCTTA GGTAATGCAA AAGCCGCCAC TTCTTACACT CAAGATACGC ATCCTGATTA TGTCAAGGCT TCAACCCAGT ATGATTATAA TCCTGAAAAA GCCACCGCGC TTTTAAAAGA AGCAGGTCTG AATAAATTAG AGTTTCAATT GCTTTCTACA GATCATTCTT GGATAAAAGA ATGTGCCCCT TTAGTTTTAG AATCTTGGAA TAAAATTCCG GGTGTGAAAG TAACTTTACA GCATCTACAA TCTGGAGCAT TGTATGGTGG TTACGTAGAT AAAGGAAATT ATGAAGTCGT AATGGCACCG GGGGATCCAT CAGTATTTAG CAACGATTTG GATCTCTTAT TGAGCTGGTG GTATCGTGGT GATGTTTGGC CGAAAAAACG CTTCGGTTGG TCAGATACCC CTGAATATGC GAAATTGCAA TTATTACTTG ATGATGCAAT TAAAGCAAAA CAGCCATCAG ATGCAAAAGC GGCTTGGACT CAGGCTATTA ATCTTATTGC TGAACAAGTA CCGCTTTATC CAATTTTGCA CCGTAAATTA CCAACCGCTT GGAATAACAA ATCACTAGAT GGTTTCCAAC CAATTCCAAC TACCGGACTT TCATTTATTG GTGTAGGTCG CAAATAA
|
Protein sequence | MTNYIENESR RNFMKVLAGV GAGIAFSGTL GSFVSKAKAT STTGSSIEAG IAYPLSTGFD PSTSSGASSF AANMHFFEGL VDLHPATREP YLALAAKEPE QVDDITWRVT LRDGATFHDG TPVTTEDIVY SYQRILDPKN ASLFIQFIPF IDSVNALDDK VVEFKLKYPF SLFKLRLGIV KIVPKHVIEA VGQTVFDANP VGSGPYKFVS AVKDDRIVFA AHNAYNGPYP ARVEKMTWFL LSDDAARTAA QESGRTQAME SVPYLDVSRL KRKSAVESVQ SFGLLFLMFN CKKAPFNNPK VRQALHYALD TQKLIDIVFL GNAKAATSYT QDTHPDYVKA STQYDYNPEK ATALLKEAGL NKLEFQLLST DHSWIKECAP LVLESWNKIP GVKVTLQHLQ SGALYGGYVD KGNYEVVMAP GDPSVFSNDL DLLLSWWYRG DVWPKKRFGW SDTPEYAKLQ LLLDDAIKAK QPSDAKAAWT QAINLIAEQV PLYPILHRKL PTAWNNKSLD GFQPIPTTGL SFIGVGRK
|
| |