Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2044 |
Symbol | |
ID | 5539522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 2620209 |
End bp | 2621210 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640894179 |
Product | dihydroxyacetone kinase subunit DhaK |
Protein accession | YP_001432150 |
Protein GI | 156742021 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2376] Dihydroxyacetone kinase |
TIGRFAM ID | [TIGR02363] dihydroxyacetone kinase, DhaK subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.799824 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAC TGATCAATAA GCCGGAAGAT GTCGTCGTCG AGGCGCTGAA AGGGATCGAG TACGCCCATC CCGATCTGGT GAAGGTTCAC TACGAACCAA ATTTCATCTA CCGTGCCGAT GCGCCGGTGC AGGGTAAGGT GGCGATTGTC TCCGGCGGCG GCTCCGGTCA TGAACCCATG CACGGCGGAT TCGTCGGTAT GGGCATGCTC GATGCTGCGT GCCCGGGAGC AGTGTTTACC AGCCCCACTC CCGACCAGAT GCTGGAAGCG ACGAAGATGG TTCATGGCGG CGCAGGCGTG CTCCATATCG TCAAGAATTA CACCGGCGAC ATTCTCAACT TCGATATGGC TGCCGATCTG GCGCGTGCTG AGGGGATCGA GATTGAGTCG GTCGTGACGA ACGACGATGT GGCAGTGCAG GATTCGTTGT ATACTGCCGG TCGGCGTGGT GTAGGGGTGA CGGTGCTGGC AGAGAAGATC TGCGGCGCTG CCGCCGAGGA AGGGCGTTCG CTCACGGATG TGGCGGATGT CTGCCGCAAG GTGAATGGGT GGGGGCGGAG CATGGGCATG GCGCTGACCA GTTGCACCGT GCCGCACGCC GGCAAGCCCA CCTTTGATCT GCCGGAAGAC GAGATGGAGA TCGGCATTGG CATTCATGGT GAGCCGGGAC GCACGCGCAT GAAACTCAAA TCGGCTGATG AGATCACCGA AATGCTGATG GAGCCGATCC TGAACGATCT GCCGTTCCAG GCTGGTGATA ACGTCCTGCT GTTCGTCAAC AGCATGGGTG GTACGCCGCT GATCGAACTG TATATCATCT ATCGCAAGGC GTATGAAATC GCCACAAAAT CGGGACTCAA GGTCGTCCGC AATCTGATCG GTCCGTATAT CACATCGTTG GAAATGGCAG GCTGTTCGAT TACGCTGCTG AAGATGGACG ATGATCTGAT CCGTCTCTGG GATGCGCCGG TCAGAACGCC TGCTCTGCGT TGGGGAGTGT AA
|
Protein sequence | MKKLINKPED VVVEALKGIE YAHPDLVKVH YEPNFIYRAD APVQGKVAIV SGGGSGHEPM HGGFVGMGML DAACPGAVFT SPTPDQMLEA TKMVHGGAGV LHIVKNYTGD ILNFDMAADL ARAEGIEIES VVTNDDVAVQ DSLYTAGRRG VGVTVLAEKI CGAAAEEGRS LTDVADVCRK VNGWGRSMGM ALTSCTVPHA GKPTFDLPED EMEIGIGIHG EPGRTRMKLK SADEITEMLM EPILNDLPFQ AGDNVLLFVN SMGGTPLIEL YIIYRKAYEI ATKSGLKVVR NLIGPYITSL EMAGCSITLL KMDDDLIRLW DAPVRTPALR WGV
|
| |