Gene Clim_1204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1204 
Symbol 
ID6353721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1298908 
End bp1300479 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content55% 
IMG OID642668820 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_001943250 
Protein GI189346721 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.128075 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCCAG TAGTGACCGC GGCAGAAATG ATGAGAGCAG ACAGTGCCGC TATAGACGAT 
CTCCAGGTCG GAGCGATACG GCTTATGGAA CTTGCCGGAG CCCGTACATC GGATCTCGTC
AGGGAACTTA TCGAAAAAGA AAATATCTCC GGCAGCAGTT TTCTTGTAGT CTGCGGCAAG
GGCAACAATG GCGGCGACGG TCTGGTACTT GCCCGTCATC TGCTGAACCA CGGCGCGGAA
GTCGATATCC TGCTGCTCTA CCCGGAAACG GATTTATCCC CGATCAACCG CAATACTCTT
GACACTCTCT GTGGCTACCA AGCCCTGAGC GGACGTCTGC GGATCTTTCA TGGTCACGCT
GAGGCCCTGC CTTTCGTCAG GGATACACAC TATGAAGTTC TGATCGATGC GATACTCGGT
ACCGGGCTGA AACTCCGCCG ACAACTCTCA TCACCGCTAT GCGAAGGTAT TGATCTGCTG
AACGGCATCC ATGACCGTGC CGGGTCGCCA CTCATAGCAC TCGATATTCC TTCAGGTCTC
GATGCGACAT CCGGCGTTGC CGCCGAAAGG TGCGTTCTTG CCGATATGAC GGTTTCCATG
GCTTTTCTGA AAACTGGATT TTTTTTCAAT GACGGTCCAC TCCACTGCGG AGAGCTCCGT
ATCGCCGACA TATCGATCCC GGAATTTCTG ATCGCCCCTT CAGCCTGCCG TTTAACCGAT
AAAGAGTATG CCGCGGAACA TTTTATTCTG AGAGAGCCTG AGGGTGCAAA GCACCAGAGC
GGAAAGGTGC TTATTGTCGC CGGATCGCAA TCCGATAACG CTTCTATGCT TGGAGCGGCA
ATGCTTGCCG TTAAAGCCGC ACTCAAAACC GGTGCCGGAT ATGTGTGTGC GGCCATCCCC
ATTGCCGCTG CGGGCGTGCT GCATTCCTAT GCGCCGGAAG CAGTCGTCAT TGCCCAGGAG
ATGGACGCCA TCCTCGAAAA AGCCGGATGG GCAGATGCTG TGCTGATCGG ATGCGGACTT
GGAAGGGATC CGAAAACTGT AGATTTCATT CGCGAGCTGC TCCGGAAACC CGAAATAACA
GGCTGTAAAC TCGTGCTTGA CGCCGATGCG CTCTTTGCCC TTTCCGGTGT TGCCCTGCCT
GCATCCGGAA TCGATTTCGC CAATACCATA CTGACGCCGC ATTACGGAGA ATTCAGCCGG
CTTTGCGGCC ATACGGCAGA CGAGATTGCT CTGAACGCGC TTGTGCTCGC GACTGATTTT
GCACGGCTAA ACAGGGTCAA TCTGCTGCTC AAGGGCCATC CAACTCTGAT TGTCGGTGGC
GAGGAGGGGC TTATGCTTAA CGACTCGGGC ACTGAAGCGC TCTCTACCGC CGGCTCCGGA
GATATCCTGG CCGGGATGAT TGCAGCGATT GCTGCAAAAG GAGCCGAAAT ACTCGACGCA
GGAGCGGCGG CGGCCTGGTT TCACGGCAGG GCTGGAGATC TGGCAAATGA CATTTCCAGC
CTGGTATCGG CAAACGACAT CCTCAATGCC ATACCTGAAG CTGTGCAGGA AATTTTTTCC
CTGGAGGAGT AA
 
Protein sequence
MLPVVTAAEM MRADSAAIDD LQVGAIRLME LAGARTSDLV RELIEKENIS GSSFLVVCGK 
GNNGGDGLVL ARHLLNHGAE VDILLLYPET DLSPINRNTL DTLCGYQALS GRLRIFHGHA
EALPFVRDTH YEVLIDAILG TGLKLRRQLS SPLCEGIDLL NGIHDRAGSP LIALDIPSGL
DATSGVAAER CVLADMTVSM AFLKTGFFFN DGPLHCGELR IADISIPEFL IAPSACRLTD
KEYAAEHFIL REPEGAKHQS GKVLIVAGSQ SDNASMLGAA MLAVKAALKT GAGYVCAAIP
IAAAGVLHSY APEAVVIAQE MDAILEKAGW ADAVLIGCGL GRDPKTVDFI RELLRKPEIT
GCKLVLDADA LFALSGVALP ASGIDFANTI LTPHYGEFSR LCGHTADEIA LNALVLATDF
ARLNRVNLLL KGHPTLIVGG EEGLMLNDSG TEALSTAGSG DILAGMIAAI AAKGAEILDA
GAAAAWFHGR AGDLANDISS LVSANDILNA IPEAVQEIFS LEE