Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_1640 |
Symbol | |
ID | 5709341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | - |
Start bp | 1715827 |
End bp | 1717455 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641276148 |
Product | carbohydrate kinase, YjeF related protein |
Protein accession | YP_001541453 |
Protein GI | 159042201 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.0283025 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGCTTAA TTAACATGAG TAGTGAAGCA ATATCAGTAT TAGAAATGAG GGTTCTGGAT GCTAACACTG CGTACCTGGG TATTGACAGT AGAATACTAA TGGAGAATGC CGGTAGGGGT GTTGCTCAAA TAGTCTCCTC AAGGTGGCCG AACGCAGGTA AGGTGCTTGT TGTGGCTGGG TTAGGTAACA ATGGTGGGGA TGGTATTGTT GCCGGCAGAT ACCTTTACAA TTGGGGTAAG GACGTGGTTA TCATACTACT GGGTAGGGTA AGTGACATGA GGGAGGAGCC TGCCGCCACT AATCTTAAGA TATGCCTAAA CCTACTGGGT TGCAGGGTAA TGGAGGCTAG GGATGAGTTG GAGCTGCTCT CCTACCAGGA TTACTTCATT AAATGGGCCG ACGTAATTAT TGACGCAATC CTAGGCATTG GTGTTAAGGG TAGGATTAGG CAACCTGCCT CGGCTGCAAT AGACTTAATA AACATGTCTA AGGCACCTAA GGTTGCGGTT GACATACCCT CAGGTCTAGA TCCCGATACC GGTGATGTTG CGGATAAGGC TGTTAAGGCT GACTTAACCG TAACCATGCA TAAGGCTAAG CGTGGGCTTA TTGCCGATAA GGCTAAGCCG TATGTTGGTG AACTGGTGGT AGTCGACATA GGTATCCCGA AGGAGGCCGA GTTAATAGTA GGGCCCGGTG ACTTGAATTA CTTAACATAC GCCAGGAGGC TTGATTCCAA GAAGGGTGAT TTCGGTAGGA TAGCCATAAT AGGGGGTTCA AGGGATTACA CGGGCGCCAT AGCTTTAACG GCTTTAGCAT CATTAATAAC GGGGGCTGAC TTACCAATAG TCTACGCGCC TCATGATGTT GCACACGACA TTAGATCACA GACACCAAAC CTAATAGCAG TGCCACTTGA GGGTGAGGTT TTAAGTAAGG ATAATGTTGG ACCAGTACTT AGAGGTATTG AGAGGGCTAA TGTGGTAGCC ATAGGTCCTG GGCTAGGACT TGAGAAGACG ACAATGGAGG CAGTCTACAT TATACTTGAG ACCGCGGTTA AGTTGGGTAA GAGGATTGTT ATTGATGCTG ATGCGATAAA GGCCATTGGA ATCGGGAAGA AACTTAACCT ACTTAAGCCA GGCGTGGTGC TTACTCCACA TGCCGGGGAG TTAAGGGAGT TACTGGGTAT TGATGTACCT AAGCTTAATC CAATTGAAAC CGGGCAGTGG CTTAAGGAGC AGGTTTCGAA ATGCTGCCCA GGTAGTGTAA TACTCCTTAA GGGTAATACT GATGTTATTA GTGATGGTTC AAGGATTAAA TTAAACATGA GTGGTAATCC AGGTATGACA GTTGGTGGGA CGGGAGACGT ATTAACAGGG GTTTTAGCAA CAATGCTTCA TAGGGTTAAT GACCCCCTTG AGGCTGCAGC CATAGCAGCA TTCATAACAG GTGTTGCAGG TGACTTGGCT GCGTTAGAGT TAGGCTACCA CTTAACCCCA ATGGATGTGG TAAATAGGAT ACCTAAGGCG TTCTCAATAT TCATAAACAC TAAGGGGATA ATTGAGGAGG CTTTACATAA GCCCTTAAGA GAATACTTGA TTAAGCACCA GTTAATTAAA GGTGAATAA
|
Protein sequence | MGLINMSSEA ISVLEMRVLD ANTAYLGIDS RILMENAGRG VAQIVSSRWP NAGKVLVVAG LGNNGGDGIV AGRYLYNWGK DVVIILLGRV SDMREEPAAT NLKICLNLLG CRVMEARDEL ELLSYQDYFI KWADVIIDAI LGIGVKGRIR QPASAAIDLI NMSKAPKVAV DIPSGLDPDT GDVADKAVKA DLTVTMHKAK RGLIADKAKP YVGELVVVDI GIPKEAELIV GPGDLNYLTY ARRLDSKKGD FGRIAIIGGS RDYTGAIALT ALASLITGAD LPIVYAPHDV AHDIRSQTPN LIAVPLEGEV LSKDNVGPVL RGIERANVVA IGPGLGLEKT TMEAVYIILE TAVKLGKRIV IDADAIKAIG IGKKLNLLKP GVVLTPHAGE LRELLGIDVP KLNPIETGQW LKEQVSKCCP GSVILLKGNT DVISDGSRIK LNMSGNPGMT VGGTGDVLTG VLATMLHRVN DPLEAAAIAA FITGVAGDLA ALELGYHLTP MDVVNRIPKA FSIFINTKGI IEEALHKPLR EYLIKHQLIK GE
|
| |