Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3237 |
Symbol | |
ID | 7267384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 3921658 |
End bp | 3923298 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643568058 |
Product | carbohydrate kinase, YjeF related protein |
Protein accession | YP_002464531 |
Protein GI | 219850098 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.271613 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAACGG AGTTGCCATT TGTCGTGACC GCAGCACAAA TGCGCGCGGC TGAAGAGGCA GCAGTAGCTC GGGGCGACGA TTGGACAGTG TTGATGGAAC GAGCTGGTGT CGGTGTGGCG ACTGCGGCGT TGCACCATTT TGCCCCGTTA GGCGGTCGTG ATGTGTTGGT GTTGGTTGGT CCTGGGAACA ATGGTGGTGA TGCGTTAGTG GCAGCGCGTC ATTTGGCCGA TGCCGGCGCG CAGGTTCTGC TCTATTGCTG GCGACGGACA CAAGTCGATG CAAATTTGTC GGCGTGTCGC GCACGTCATC TGCGTGAAGT CCACGCGACC GATGATCCCG ATGGTAAGCT GCTGAGCGCT GCATTGCAGA CGGCGGTCTT AGTCATTGAC GGCCTGCTCG GTACCGGTGC GCGTCCACCA CAAGCCGATC TGGCGGCAAT TATTACTACA GTGAATGAGG TGCGTTCCCA ACGTACCGAT CTGCGCGTCC TGTCAATCGA TATACCAAGT GGCGTTGCTG CCGATGATGG TCGGGTTGCG ACCGTGGCGA TCAAGGCCGA TCTGACGGTT GCAACCGGTC TGCTCAAACG GGGGGTATTG CTCTGGCCGG GTCGTGGCTA TGCCGGGACA CTTGTCGTTG CACCGATTGG GTTGGGGGCA TTAGATGGAG CATTAACTAT GAGTACACGA TTGACCGCTG CCCAGGCTCG TTTGCTGTTG CCGGCACGCC CGGCTGATGC GCACAAAGGA GTGTTTGGCA AGGTATTGGT ACTGGCCGGT TCGATCAACT ATCCCGGCGC AGCGGTGTTA GCTTGTGCGG GTGCTCAGCG GGTTGGGGCC GGCTTAGTCA CCCTGGCAAC CGGGCGGAAT GTATTGGCGT TAGCTTCGTT GCCACCGGAA GTGACCTTGT TACCGGTGGC CGAAGGTGAT TGGGGAGCGA TTGGGCCGGC AGCGATCGAA GAACTAGCCG ATGATTTACC GCGCTATCAA GCACTCTTGA TTGGGCCGGG GCTGGGGCAA GCTGAAGCGA CCCGTTCACT GGTGCTGCGC CTATTTGGAT TAGATCAGGT ACGTAGCCGC ACGCGGGTTG GGTTTGTAGC GGTAGGTGAA ACCGAAGATC ATACACCGCC CCACACGGTA GAATTGCCTC CTACCGTCAT TGATGCCGAT GGCCTAAACT TGTTGGCGAG TGCTCATGGC TGGTTCGAGC GTTTACCGCC GGAACGGTGT GTACTGACCC CGCATCCCGG TGAAATGCGT CGGTTGCTCG GCGTAGCGGA ATTGCCGCCG GATGTCGTAG CGGTAGCGGC GGAGGCGGCC CAGCGCTGGC GGCAAACGGT GGTGCTGAAA GGTGCGACAA CGGTGATCGC CGCACCCGAT GGGCGTACTG TGATTCACGA CGGTGCCAAC CCGGCGTTGG CGACTGCCGG TGCGGGTGAT GTGTTGGCCG GTGCAATTGC CGGGCTGATC GCCCAAGGGT GTGGGCTATA TGACGCGGCG GTGCTAGGCG TCTATCTGCA TAGTGCTGCC GGTGCAAAGG CCCGGTTGAC GTTGGGTGAT GCCGGTGTTG TGGCCGGCGA TCTACTCCCG CTCTTGCCGC AAGCGATCCG TGATTTGCGA GCAAACGTAA GCACCGGGTG A
|
Protein sequence | METELPFVVT AAQMRAAEEA AVARGDDWTV LMERAGVGVA TAALHHFAPL GGRDVLVLVG PGNNGGDALV AARHLADAGA QVLLYCWRRT QVDANLSACR ARHLREVHAT DDPDGKLLSA ALQTAVLVID GLLGTGARPP QADLAAIITT VNEVRSQRTD LRVLSIDIPS GVAADDGRVA TVAIKADLTV ATGLLKRGVL LWPGRGYAGT LVVAPIGLGA LDGALTMSTR LTAAQARLLL PARPADAHKG VFGKVLVLAG SINYPGAAVL ACAGAQRVGA GLVTLATGRN VLALASLPPE VTLLPVAEGD WGAIGPAAIE ELADDLPRYQ ALLIGPGLGQ AEATRSLVLR LFGLDQVRSR TRVGFVAVGE TEDHTPPHTV ELPPTVIDAD GLNLLASAHG WFERLPPERC VLTPHPGEMR RLLGVAELPP DVVAVAAEAA QRWRQTVVLK GATTVIAAPD GRTVIHDGAN PALATAGAGD VLAGAIAGLI AQGCGLYDAA VLGVYLHSAA GAKARLTLGD AGVVAGDLLP LLPQAIRDLR ANVSTG
|
| |