Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1264 |
Symbol | |
ID | 4570398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 1437957 |
End bp | 1439540 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 639765855 |
Product | carbohydrate kinase, YjeF related protein |
Protein accession | YP_911721 |
Protein GI | 119357077 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGCCTG TCGTAACGGC TATTGAGATG TCGAAGGCCG ATAAAGCGGC CATTGAGGAA CTCCGTATAG GTGAGAGTCG TCTCATGGAG CTTGCAGGCA GCGAAGCGGC CGATATCATT CTGAAGGCTC TCGAAAAAAA CGATAATCCC GAAGGTTGCT CGTTTCTTGT GGTGTGCGGA AAAGGAAATA ACGGCGGTGA CGGTTTTGTT GTTGCCCGAC ACCTTCTTAA CCGCGGCGCT ACGGTAGACG TTGTGCTGCT TTGCCCCCCG GAAACCCTTA AGCCTGTCAA CAGGGAGGGG TATCTGATCC TTGAGGCTTA CAGGCATCAT AATGAGCCTC TTCGGATTTT TCATGGCATT GAAGAAGCAA TTGACAGCAT AACTGAAACC GGTTATTCTG CGCTGATCGA CGGCATTCTC GGTACAGGTC TTCGAATCAC TCAGGCTGGC GAAGCACTGC CTGAACCGAT CGCCTCTGCG ATTACGCTGC TCAACACCCT TCGCCATAAC TCGGACGCTC TCATAGCAGC TCTTGATGTT CCTTCAGGTC TGGATGCCAC GACAGGGCTC TCTGCTTCGC CTGCTGTGAT TGCAGACCTC ACCGTAACCA TGGCATTTCT GAAAACCGGT TTTTTTTTCA ATGAGGGCCC CCTTCATTGC GGCGATCTTC ATACTGCCGA AATATCAATA CCCCGTTTTA TTGCTGAACC GGTATCCACG CTTTTGACGG ACGGGGAGTT CGCTGCCGAA CAGTTCATCA TGCGAAATCC CGCCGCGGCA AAACATCAAA ACGGAAAAGT TCTGATTATT GCCGGGTCAA TATCTTCAAC ATCTTCCATG ATCGGGGCTG CAATGCTTGC TGTAAAAGCA GCATTAAAAA CAGGTGCCGG CTACGTTTGC GTTTCACTGC CGCTGACGCA TGCCGCTGCA ATGCACGCAT TTGCTCCCGG AGCTGTAGTT ATCGGACGGG ATCTTGACGT TATAGCAGAA AAAGCCCGAT GGGCTGACGC TGTGCTGATA GGATGCGGAC TTGGCAGGGA TAGTGCATCC GTGAGCTTTA TTGCCGATCT GCTTCAACGA AAGGAGATTG CCGCCAATAA ACTTGTCATT GACGCCGACG CGCTCTATGC GCTCGCTTTA CCGGATCTTT CGTCGTTATC GTTTGGGTTT TCCGATGCTA TCCTGACACC GCATTACGGA GAGATGAGTC GACTGAGCGG CTTCTCGGTG GAAAGCATTG CCTGCGATCC TCTTGATACG GCAAGAACGT ATGCTGAAAA ACATCGGGTA AATCTGCTTC TGAAAGGATA TCCAACTGTA ATTGCAGCGC CTTCCGATCC GGTGCTCCTG AATACTACAG GCACAGATGC TCTGGGAACG GCCGGTTCGG GAGATATTCT TTCGGGAATG ATTGCCGCCC TTGCCGCCAA AGGAGCAACA ACCTTCAATG CCGGCGCTGC CGCTGCCTGG TTTCATGGAA GGGCCGGCGA TCTTGCCGGA ACCATATCAA GCATTGTTTC CGCTGAAGAT ATTCTCGAAG CGATCCCGTC TGCCATTCAG GAAATTTTTC ATATAGAAGA ATAA
|
Protein sequence | MLPVVTAIEM SKADKAAIEE LRIGESRLME LAGSEAADII LKALEKNDNP EGCSFLVVCG KGNNGGDGFV VARHLLNRGA TVDVVLLCPP ETLKPVNREG YLILEAYRHH NEPLRIFHGI EEAIDSITET GYSALIDGIL GTGLRITQAG EALPEPIASA ITLLNTLRHN SDALIAALDV PSGLDATTGL SASPAVIADL TVTMAFLKTG FFFNEGPLHC GDLHTAEISI PRFIAEPVST LLTDGEFAAE QFIMRNPAAA KHQNGKVLII AGSISSTSSM IGAAMLAVKA ALKTGAGYVC VSLPLTHAAA MHAFAPGAVV IGRDLDVIAE KARWADAVLI GCGLGRDSAS VSFIADLLQR KEIAANKLVI DADALYALAL PDLSSLSFGF SDAILTPHYG EMSRLSGFSV ESIACDPLDT ARTYAEKHRV NLLLKGYPTV IAAPSDPVLL NTTGTDALGT AGSGDILSGM IAALAAKGAT TFNAGAAAAW FHGRAGDLAG TISSIVSAED ILEAIPSAIQ EIFHIEE
|
| |