Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0051 |
Symbol | ego |
ID | 4239559 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 56274 |
End bp | 57782 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638103582 |
Product | ABC transporter, ATP-binding, sugar transport |
Protein accession | YP_718257 |
Protein GI | 113460200 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1129] ABC-type sugar transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000186076 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAGA CAGAATCTAA TCCTACACCT TTACTGGAAG CTTGTGATGT AAGTAAAGCT TTCTCTGGTG TTGAAGTACT AAAATCCATT AACTTTTCCC TTTATTCTGG GCAGGTACAT ACTCTACTGG GTGGAAACGG TGCTGGCAAA TCAACCTTGA TGAAAATTAT TGCCGGTATT CAGTTGCAAG ATACCGGCAC GTTAAAAATT CATGGCAAAG TCCGAACCAA TTTAACACCA AGCAAGGCAC ATCAATTAGG TATTTACCTT GTTCCGCAAG AACCTTTGTT ATTTTCTAAT TTGAGTGTCA AAGAGAATAT TTTATTCGGT TTGCCAAGAA CAATGGATTT ACAGTCAAAG CTCACAGCGT TGTTGACGGA ACTAAACTCA CATTTAAATT TGGATATGCA GGCTGGTTTA TTAGATGTTG CTGATCAGCA GTTAGTTGAA ATTATGCGAG GCTTAATGAG AGATGCTCAG ATTCTGATTT TAGATGAACC TACCGCCTCA TTAACACCAG TGGAAACCGA AAATTTATTC GGACAGATAA ATTCCTTACT CGTCAAAGGG GTAGGGATTG TTTTTATTTC GCATAAATTG TCTGAAGTTT GGCAAATTGC TGATTGGGTG AGTGTAATGC GAGATGGTTA CATTGCCTTA AGCGAGCCAA TTGACAAGAT CAATCACGAC GATGTTATTC AGGCGATTAC TCCTCGAGCT AAAAATGCAT CATTAACAGA AAGTCAGCAG TTGTGGTTGG ATTTACCAGG GCATAAGAGG GTGAAATCGA CCGAATTTCC ATTATTGACA GTAGAAAATT TCACTGGTGA AGGATTCCGA GATATCAACT TTGAAATTTA TCCTGGCGAG ATTTTAGGAT TAGCAGGCGT AGTCGGCTCA GGTAGAACCG AATTAGCTGA AACGCTGTAT GGTTTGCGTA AAGCCAAATC CGGTAAAGTG ATATTTTGTG ATGAGCTAAT TAATTCACTG TCCGTTAAGC AACGGCTTAA ACGAGGCTTA GTGTATTTGC CCGAAGATAG GCAAGCCTCT GGATTGCATT TAGATTCACC ATTAAGTTGG AATACCTGCG GATTAACCTA TCATGATATG GGCTTTTGGA TTGATGAGTC CAAGGAGAAA GCAATTTTAG AACGCTATTA CCATGCTATC GGAATAAAAT TTAATCATGT TGAGCAATCG GTACGTACCT TATCAGGTGG TAATCAGCAA AAAATTTTGA TCGCAAAATG TTTGGAAGCC GATCCTCGGT TGTTGATTGT TGATGAGCCA ACTCGTGGCG TTGATATTGG TGCGAGAGCA GATATTTATC AATTAATAAA AAGTATTGCC AAACAAAATG TAGCCGTGTT GTTCATTTCA TCTGACTTGG AGGAAATAGA ACAAATGTCT AGTCGAGTTT TGGTAATGCA CGACGGTAAG CTTAGTCAAT CGTTGAAAGG TGAAGATATC AATACTGATA ATATTATGCG ACTGGCATTC AGTAGTTAG
|
Protein sequence | MKETESNPTP LLEACDVSKA FSGVEVLKSI NFSLYSGQVH TLLGGNGAGK STLMKIIAGI QLQDTGTLKI HGKVRTNLTP SKAHQLGIYL VPQEPLLFSN LSVKENILFG LPRTMDLQSK LTALLTELNS HLNLDMQAGL LDVADQQLVE IMRGLMRDAQ ILILDEPTAS LTPVETENLF GQINSLLVKG VGIVFISHKL SEVWQIADWV SVMRDGYIAL SEPIDKINHD DVIQAITPRA KNASLTESQQ LWLDLPGHKR VKSTEFPLLT VENFTGEGFR DINFEIYPGE ILGLAGVVGS GRTELAETLY GLRKAKSGKV IFCDELINSL SVKQRLKRGL VYLPEDRQAS GLHLDSPLSW NTCGLTYHDM GFWIDESKEK AILERYYHAI GIKFNHVEQS VRTLSGGNQQ KILIAKCLEA DPRLLIVDEP TRGVDIGARA DIYQLIKSIA KQNVAVLFIS SDLEEIEQMS SRVLVMHDGK LSQSLKGEDI NTDNIMRLAF SS
|
| |