Gene Cpha266_1264 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1264 
Symbol 
ID4570398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1437957 
End bp1439540 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content51% 
IMG OID639765855 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_911721 
Protein GI119357077 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCCTG TCGTAACGGC TATTGAGATG TCGAAGGCCG ATAAAGCGGC CATTGAGGAA 
CTCCGTATAG GTGAGAGTCG TCTCATGGAG CTTGCAGGCA GCGAAGCGGC CGATATCATT
CTGAAGGCTC TCGAAAAAAA CGATAATCCC GAAGGTTGCT CGTTTCTTGT GGTGTGCGGA
AAAGGAAATA ACGGCGGTGA CGGTTTTGTT GTTGCCCGAC ACCTTCTTAA CCGCGGCGCT
ACGGTAGACG TTGTGCTGCT TTGCCCCCCG GAAACCCTTA AGCCTGTCAA CAGGGAGGGG
TATCTGATCC TTGAGGCTTA CAGGCATCAT AATGAGCCTC TTCGGATTTT TCATGGCATT
GAAGAAGCAA TTGACAGCAT AACTGAAACC GGTTATTCTG CGCTGATCGA CGGCATTCTC
GGTACAGGTC TTCGAATCAC TCAGGCTGGC GAAGCACTGC CTGAACCGAT CGCCTCTGCG
ATTACGCTGC TCAACACCCT TCGCCATAAC TCGGACGCTC TCATAGCAGC TCTTGATGTT
CCTTCAGGTC TGGATGCCAC GACAGGGCTC TCTGCTTCGC CTGCTGTGAT TGCAGACCTC
ACCGTAACCA TGGCATTTCT GAAAACCGGT TTTTTTTTCA ATGAGGGCCC CCTTCATTGC
GGCGATCTTC ATACTGCCGA AATATCAATA CCCCGTTTTA TTGCTGAACC GGTATCCACG
CTTTTGACGG ACGGGGAGTT CGCTGCCGAA CAGTTCATCA TGCGAAATCC CGCCGCGGCA
AAACATCAAA ACGGAAAAGT TCTGATTATT GCCGGGTCAA TATCTTCAAC ATCTTCCATG
ATCGGGGCTG CAATGCTTGC TGTAAAAGCA GCATTAAAAA CAGGTGCCGG CTACGTTTGC
GTTTCACTGC CGCTGACGCA TGCCGCTGCA ATGCACGCAT TTGCTCCCGG AGCTGTAGTT
ATCGGACGGG ATCTTGACGT TATAGCAGAA AAAGCCCGAT GGGCTGACGC TGTGCTGATA
GGATGCGGAC TTGGCAGGGA TAGTGCATCC GTGAGCTTTA TTGCCGATCT GCTTCAACGA
AAGGAGATTG CCGCCAATAA ACTTGTCATT GACGCCGACG CGCTCTATGC GCTCGCTTTA
CCGGATCTTT CGTCGTTATC GTTTGGGTTT TCCGATGCTA TCCTGACACC GCATTACGGA
GAGATGAGTC GACTGAGCGG CTTCTCGGTG GAAAGCATTG CCTGCGATCC TCTTGATACG
GCAAGAACGT ATGCTGAAAA ACATCGGGTA AATCTGCTTC TGAAAGGATA TCCAACTGTA
ATTGCAGCGC CTTCCGATCC GGTGCTCCTG AATACTACAG GCACAGATGC TCTGGGAACG
GCCGGTTCGG GAGATATTCT TTCGGGAATG ATTGCCGCCC TTGCCGCCAA AGGAGCAACA
ACCTTCAATG CCGGCGCTGC CGCTGCCTGG TTTCATGGAA GGGCCGGCGA TCTTGCCGGA
ACCATATCAA GCATTGTTTC CGCTGAAGAT ATTCTCGAAG CGATCCCGTC TGCCATTCAG
GAAATTTTTC ATATAGAAGA ATAA
 
Protein sequence
MLPVVTAIEM SKADKAAIEE LRIGESRLME LAGSEAADII LKALEKNDNP EGCSFLVVCG 
KGNNGGDGFV VARHLLNRGA TVDVVLLCPP ETLKPVNREG YLILEAYRHH NEPLRIFHGI
EEAIDSITET GYSALIDGIL GTGLRITQAG EALPEPIASA ITLLNTLRHN SDALIAALDV
PSGLDATTGL SASPAVIADL TVTMAFLKTG FFFNEGPLHC GDLHTAEISI PRFIAEPVST
LLTDGEFAAE QFIMRNPAAA KHQNGKVLII AGSISSTSSM IGAAMLAVKA ALKTGAGYVC
VSLPLTHAAA MHAFAPGAVV IGRDLDVIAE KARWADAVLI GCGLGRDSAS VSFIADLLQR
KEIAANKLVI DADALYALAL PDLSSLSFGF SDAILTPHYG EMSRLSGFSV ESIACDPLDT
ARTYAEKHRV NLLLKGYPTV IAAPSDPVLL NTTGTDALGT AGSGDILSGM IAALAAKGAT
TFNAGAAAAW FHGRAGDLAG TISSIVSAED ILEAIPSAIQ EIFHIEE