Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0437 |
Symbol | |
ID | 4239913 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 464465 |
End bp | 465805 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 638103979 |
Product | PTS system, cellobiose-specific IIC component |
Protein accession | YP_718646 |
Protein GI | 113460582 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1455] Phosphotransferase system cellobiose-specific component IIC |
TIGRFAM ID | [TIGR00359] phosphotransferase system, cellobiose specific, IIC component [TIGR00410] PTS system, lactose/cellobiose family IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTATCT CACATCGATT TATTGATTTT ATGAATACGA ACGTGGCGCC GGTTGCCCGT CGTATGGAAA ACCAACCGCA CATTTCTGCT ATTCGTGATG GCTTTATTGT AGTGCTACCA TTCTTGATTG TCGGTAGTTT TATTATGATT TTGCTCATTC CACCGTTTGA TGAAAATACC ACGAATGTGT TTGGTCAAGC ATGGTGGCGT TTTGCTAACT GGGCAAGCCC GTATGGTTGG AACTTCTTCC AAATGTCGTT TAATGCGATT TCTTTGTTTA CCTCAGCCAG TATCGCTTAC AACTTAGCGA AAGCGTATAA ACGTGAACCA CTGCCTGCGG CATTCTTGTC TGTAATGGCT TTCTTACTGG TGGCCGCACC GGTGAAAGAT GGTACAATGG ACATCAAATT CTTCGGCGGT ATTGGGTTAT TCTCAGCGAT CTTTATTGCT ATTTATACTG TTGAAATGAC TCGCTTGCTA GAGTTCTTAA AAATCAAAAT TCGGTTGCCG AAAGAAGTTC CACATGCAGT TGCGGAGTCA CTGAATATCG TCATCCCGAT TTTAGCAGTG TTAGTAACAA TTTATCCGTT CTCTATTTGG GTGGAAGAGT ATTCGAGTCG TAACATTCCA CAGTTGATTA TGGACGTTAT GGCACCGTTG ATTGCCGTAA GTGATTCCTT AGGTGCAATC TGCTTGTTTG TTATTGCTAC TCACTTACTT TGGTTTTTAG GAATTAATGG GTCACTAGTG TTAATGCAAC TTTGGACACC GTTTCTATTG CAAAACATGG CAGCAAATTT GGCGGCTTTT CAAGCTGGCG AACCTCTACC TTTTATCATC ACTAACTCAT TTTGGGATTT CTATATCGTG CACGGTGCAT CAGGCGGTGT TATTGCGTTA GCTTTTTTGC TGGTACGTAG TAAATCCGCA CATTTGCGTT CTATCGGTAA AATCGGTTTG GTGCCATCTT TCTTCTCTAT CGGAGAACCG ATTGTGTATG GTGTACCAAT GGTCGTAAAC CCACTTTTCT TTATCCCACT TATTTTTGCA CCATTAGCCA ATTCAATCAT TGCCTATCTA ATTTTGGATT TTGATTTAAT CCATCGCATT TACTTAATGG CACCGTGGAC AACCCCTGCT CCGATTGGTG CTTACTTGGT GTCCGCTGGG GATATTTGGG CACCGGTATT AAGTATTGCA CTGATTATTC TCGATATTTT AATCTACTAT CCATTCTTTA AGATGTACGA AAAAATCTGT GTTGAAAAAG AGCGGAACCA AGTATTAGAT GAGGCTGCAC AAAAGGAGTT GATTGAAAAA GCGAAAATGG CAGCACAGTA A
|
Protein sequence | MSISHRFIDF MNTNVAPVAR RMENQPHISA IRDGFIVVLP FLIVGSFIMI LLIPPFDENT TNVFGQAWWR FANWASPYGW NFFQMSFNAI SLFTSASIAY NLAKAYKREP LPAAFLSVMA FLLVAAPVKD GTMDIKFFGG IGLFSAIFIA IYTVEMTRLL EFLKIKIRLP KEVPHAVAES LNIVIPILAV LVTIYPFSIW VEEYSSRNIP QLIMDVMAPL IAVSDSLGAI CLFVIATHLL WFLGINGSLV LMQLWTPFLL QNMAANLAAF QAGEPLPFII TNSFWDFYIV HGASGGVIAL AFLLVRSKSA HLRSIGKIGL VPSFFSIGEP IVYGVPMVVN PLFFIPLIFA PLANSIIAYL ILDFDLIHRI YLMAPWTTPA PIGAYLVSAG DIWAPVLSIA LIILDILIYY PFFKMYEKIC VEKERNQVLD EAAQKELIEK AKMAAQ
|
| |