Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0022 |
Symbol | |
ID | 4239530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 23475 |
End bp | 24458 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 638103553 |
Product | dicarboxylate-binding protein |
Protein accession | YP_718228 |
Protein GI | 113460171 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | [TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00664942 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTCA AAAAATTAAC TAAAGTATTA GTTGTTGCAG GCTTAGTATT TAGTAGTTCG GCATTTGCTA AAACTGTAAT TAAAATCGGA CATTTTAATT CCGATATGCA CCCGTCAAAT ATTGTATTAA ACGAAGTTTT CAAGAAAACC GTTGAAGAAA AAACGCAAGG TCGCTATGAA ATTCGTATTT TCCCAAATAA TCAATTGGGC GGCGAAGATC AAATCGTAAA TGGCTTACGC AATGGTACGA TTGAAGGTGG AATTACCGGC TTATTATTAC AAAATGTGGA TCCTATTTTT GGTGTGTGGG AAATGCCGTA TTTATTTAAA GACAATGTTG AAGCGAAAAA GGTATTGGAA TCTCCGATTG CAAAAGAAAT TGGCGATAAA ATGGAACAAT ACGGTATTAA ATTATTAGCT TACGGCATGA ACGGTTTTCG TGTGATTTCG TCCAATAAAA AATTGGAAAA ATTCGATGAT TTCAAAGGAT TACGTTTAAG AGTGCCATTG AATTCCTTGT TTGTGGATTG GGCAAAAGCA ATGAATATTA ACCCGCAAAG CATGCCTTTA AGTGAAGTCT TTACTGCATT AGAGCAAAAA GTGATTGATG GTCAAGAAAA TCCATATATG TTGCTCAAAG ATTCCGGTTT GTATGAAGTA CAAAAATACA TTATCCAAAC TAACCATATT TTCTCACCGG GATTATTGCA ATTAAGCTTG AAAACTTGGA ATAAAATGTC AAAAGAAGAT CAAGAAATTT TCTTAGAAGC GGCTAAATTA TATCAAGAGA AAGAATGGGA ATTAGCGATG AAGATGGAAC AAGATGTTAA AGACTTCTTC CATAAAAACG GTAAAGAAGT AATTATTCCG TCCGAGCAGT TTAAAGCAGA TATGTTGAAA GCCTCTGAAA CACTTTATCA AAATTTTTAT CAAAAATATG ATTGGGCAAA AGGCGTGATT GAAAAAATTC AAGCCGCTAA ATAA
|
Protein sequence | MKLKKLTKVL VVAGLVFSSS AFAKTVIKIG HFNSDMHPSN IVLNEVFKKT VEEKTQGRYE IRIFPNNQLG GEDQIVNGLR NGTIEGGITG LLLQNVDPIF GVWEMPYLFK DNVEAKKVLE SPIAKEIGDK MEQYGIKLLA YGMNGFRVIS SNKKLEKFDD FKGLRLRVPL NSLFVDWAKA MNINPQSMPL SEVFTALEQK VIDGQENPYM LLKDSGLYEV QKYIIQTNHI FSPGLLQLSL KTWNKMSKED QEIFLEAAKL YQEKEWELAM KMEQDVKDFF HKNGKEVIIP SEQFKADMLK ASETLYQNFY QKYDWAKGVI EKIQAAK
|
| |