Gene Cagg_3237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3237 
Symbol 
ID7267384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3921658 
End bp3923298 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content60% 
IMG OID643568058 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_002464531 
Protein GI219850098 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.271613 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACGG AGTTGCCATT TGTCGTGACC GCAGCACAAA TGCGCGCGGC TGAAGAGGCA 
GCAGTAGCTC GGGGCGACGA TTGGACAGTG TTGATGGAAC GAGCTGGTGT CGGTGTGGCG
ACTGCGGCGT TGCACCATTT TGCCCCGTTA GGCGGTCGTG ATGTGTTGGT GTTGGTTGGT
CCTGGGAACA ATGGTGGTGA TGCGTTAGTG GCAGCGCGTC ATTTGGCCGA TGCCGGCGCG
CAGGTTCTGC TCTATTGCTG GCGACGGACA CAAGTCGATG CAAATTTGTC GGCGTGTCGC
GCACGTCATC TGCGTGAAGT CCACGCGACC GATGATCCCG ATGGTAAGCT GCTGAGCGCT
GCATTGCAGA CGGCGGTCTT AGTCATTGAC GGCCTGCTCG GTACCGGTGC GCGTCCACCA
CAAGCCGATC TGGCGGCAAT TATTACTACA GTGAATGAGG TGCGTTCCCA ACGTACCGAT
CTGCGCGTCC TGTCAATCGA TATACCAAGT GGCGTTGCTG CCGATGATGG TCGGGTTGCG
ACCGTGGCGA TCAAGGCCGA TCTGACGGTT GCAACCGGTC TGCTCAAACG GGGGGTATTG
CTCTGGCCGG GTCGTGGCTA TGCCGGGACA CTTGTCGTTG CACCGATTGG GTTGGGGGCA
TTAGATGGAG CATTAACTAT GAGTACACGA TTGACCGCTG CCCAGGCTCG TTTGCTGTTG
CCGGCACGCC CGGCTGATGC GCACAAAGGA GTGTTTGGCA AGGTATTGGT ACTGGCCGGT
TCGATCAACT ATCCCGGCGC AGCGGTGTTA GCTTGTGCGG GTGCTCAGCG GGTTGGGGCC
GGCTTAGTCA CCCTGGCAAC CGGGCGGAAT GTATTGGCGT TAGCTTCGTT GCCACCGGAA
GTGACCTTGT TACCGGTGGC CGAAGGTGAT TGGGGAGCGA TTGGGCCGGC AGCGATCGAA
GAACTAGCCG ATGATTTACC GCGCTATCAA GCACTCTTGA TTGGGCCGGG GCTGGGGCAA
GCTGAAGCGA CCCGTTCACT GGTGCTGCGC CTATTTGGAT TAGATCAGGT ACGTAGCCGC
ACGCGGGTTG GGTTTGTAGC GGTAGGTGAA ACCGAAGATC ATACACCGCC CCACACGGTA
GAATTGCCTC CTACCGTCAT TGATGCCGAT GGCCTAAACT TGTTGGCGAG TGCTCATGGC
TGGTTCGAGC GTTTACCGCC GGAACGGTGT GTACTGACCC CGCATCCCGG TGAAATGCGT
CGGTTGCTCG GCGTAGCGGA ATTGCCGCCG GATGTCGTAG CGGTAGCGGC GGAGGCGGCC
CAGCGCTGGC GGCAAACGGT GGTGCTGAAA GGTGCGACAA CGGTGATCGC CGCACCCGAT
GGGCGTACTG TGATTCACGA CGGTGCCAAC CCGGCGTTGG CGACTGCCGG TGCGGGTGAT
GTGTTGGCCG GTGCAATTGC CGGGCTGATC GCCCAAGGGT GTGGGCTATA TGACGCGGCG
GTGCTAGGCG TCTATCTGCA TAGTGCTGCC GGTGCAAAGG CCCGGTTGAC GTTGGGTGAT
GCCGGTGTTG TGGCCGGCGA TCTACTCCCG CTCTTGCCGC AAGCGATCCG TGATTTGCGA
GCAAACGTAA GCACCGGGTG A
 
Protein sequence
METELPFVVT AAQMRAAEEA AVARGDDWTV LMERAGVGVA TAALHHFAPL GGRDVLVLVG 
PGNNGGDALV AARHLADAGA QVLLYCWRRT QVDANLSACR ARHLREVHAT DDPDGKLLSA
ALQTAVLVID GLLGTGARPP QADLAAIITT VNEVRSQRTD LRVLSIDIPS GVAADDGRVA
TVAIKADLTV ATGLLKRGVL LWPGRGYAGT LVVAPIGLGA LDGALTMSTR LTAAQARLLL
PARPADAHKG VFGKVLVLAG SINYPGAAVL ACAGAQRVGA GLVTLATGRN VLALASLPPE
VTLLPVAEGD WGAIGPAAIE ELADDLPRYQ ALLIGPGLGQ AEATRSLVLR LFGLDQVRSR
TRVGFVAVGE TEDHTPPHTV ELPPTVIDAD GLNLLASAHG WFERLPPERC VLTPHPGEMR
RLLGVAELPP DVVAVAAEAA QRWRQTVVLK GATTVIAAPD GRTVIHDGAN PALATAGAGD
VLAGAIAGLI AQGCGLYDAA VLGVYLHSAA GAKARLTLGD AGVVAGDLLP LLPQAIRDLR
ANVSTG