Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0997 |
Symbol | ipk |
ID | 4240490 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | - |
Start bp | 1100290 |
End bp | 1101231 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 638104553 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_719208 |
Protein GI | 113461140 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACT ATCAATTTTC GACCGCACTT TTATCATCGC AAATACAAGG AAAAAAATTA CGTTTTCCTT GCCCTGCAAA GATAAATTTG TTTTTATATA TAACTTCTCA ACGCCCCGAC GGTTACCATG AACTGCAAAC CTTATTTCAA TTTTTAAATT TTGGCGATTG GTTAAGTATT GAGATACGCA CTGACGGAAA AATAATCTTA ACATCCGAAA TACCTCATCT AAAAAACGAG GATAATTTAA TCTATCGAGC CGCTAAATTA TTACAACAAA AAACAGGGTG TACATTAGGG GCAAATTTAC ATTTAGATAA AATTTTGCCA ATAGGCGGAG GAGTTGGCGG AGGATCATCA AATGCAGCCA CAGCACTCGT TGCACTCAAC TATTTATGGA ATACACAACT TTCACTCTCA ACACTTGCTG AAATAGGATT ACAGCTCGGT GCTGATGTGC CTGTTTTTGT ATATGGAAAA GCCGCTTTTG CCGAAGGTGT TGGTGAAAAA CTTACTTTTT GTCAACCGCC ACAAAAATGG TTTTTAGTAT TAAAACCCGA AACATCTATT TCTACAGCTA TTATATTTAA AGATCCTAAC TTACCTCGCA ATACCCCAAA ACGACCTTTA GCGGAATTAT TAATAACAAA ATATGAAAAC GATTGCGAAA AAGTTGTTTT AAATCATTAT TCAGAGGTTG AAGAAGCCCT TGGCTGGTTG TTACAATATG CACCGGCAAG ATTAACAGGA ACTGGAGCTT GTGTTTTCGC TGAATTTGCT AACGAACAGG CGGCACAATC TGCATTTCTT GATAAACCTG AAAAATACGT TGGTTTTGTT GCTCAGGGAA CAAATATTTC ACCATTACAT CAAATGATTG AATATTTATC GCAACAAAAA CAAACACTTT GTCTTCCTAA CAATACAAAT TCCAGAGGTT AA
|
Protein sequence | MKNYQFSTAL LSSQIQGKKL RFPCPAKINL FLYITSQRPD GYHELQTLFQ FLNFGDWLSI EIRTDGKIIL TSEIPHLKNE DNLIYRAAKL LQQKTGCTLG ANLHLDKILP IGGGVGGGSS NAATALVALN YLWNTQLSLS TLAEIGLQLG ADVPVFVYGK AAFAEGVGEK LTFCQPPQKW FLVLKPETSI STAIIFKDPN LPRNTPKRPL AELLITKYEN DCEKVVLNHY SEVEEALGWL LQYAPARLTG TGACVFAEFA NEQAAQSAFL DKPEKYVGFV AQGTNISPLH QMIEYLSQQK QTLCLPNNTN SRG
|
| |