Gene Rcas_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2044 
Symbol 
ID5539522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2620209 
End bp2621210 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content58% 
IMG OID640894179 
Productdihydroxyacetone kinase subunit DhaK 
Protein accessionYP_001432150 
Protein GI156742021 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2376] Dihydroxyacetone kinase 
TIGRFAM ID[TIGR02363] dihydroxyacetone kinase, DhaK subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.799824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAC TGATCAATAA GCCGGAAGAT GTCGTCGTCG AGGCGCTGAA AGGGATCGAG 
TACGCCCATC CCGATCTGGT GAAGGTTCAC TACGAACCAA ATTTCATCTA CCGTGCCGAT
GCGCCGGTGC AGGGTAAGGT GGCGATTGTC TCCGGCGGCG GCTCCGGTCA TGAACCCATG
CACGGCGGAT TCGTCGGTAT GGGCATGCTC GATGCTGCGT GCCCGGGAGC AGTGTTTACC
AGCCCCACTC CCGACCAGAT GCTGGAAGCG ACGAAGATGG TTCATGGCGG CGCAGGCGTG
CTCCATATCG TCAAGAATTA CACCGGCGAC ATTCTCAACT TCGATATGGC TGCCGATCTG
GCGCGTGCTG AGGGGATCGA GATTGAGTCG GTCGTGACGA ACGACGATGT GGCAGTGCAG
GATTCGTTGT ATACTGCCGG TCGGCGTGGT GTAGGGGTGA CGGTGCTGGC AGAGAAGATC
TGCGGCGCTG CCGCCGAGGA AGGGCGTTCG CTCACGGATG TGGCGGATGT CTGCCGCAAG
GTGAATGGGT GGGGGCGGAG CATGGGCATG GCGCTGACCA GTTGCACCGT GCCGCACGCC
GGCAAGCCCA CCTTTGATCT GCCGGAAGAC GAGATGGAGA TCGGCATTGG CATTCATGGT
GAGCCGGGAC GCACGCGCAT GAAACTCAAA TCGGCTGATG AGATCACCGA AATGCTGATG
GAGCCGATCC TGAACGATCT GCCGTTCCAG GCTGGTGATA ACGTCCTGCT GTTCGTCAAC
AGCATGGGTG GTACGCCGCT GATCGAACTG TATATCATCT ATCGCAAGGC GTATGAAATC
GCCACAAAAT CGGGACTCAA GGTCGTCCGC AATCTGATCG GTCCGTATAT CACATCGTTG
GAAATGGCAG GCTGTTCGAT TACGCTGCTG AAGATGGACG ATGATCTGAT CCGTCTCTGG
GATGCGCCGG TCAGAACGCC TGCTCTGCGT TGGGGAGTGT AA
 
Protein sequence
MKKLINKPED VVVEALKGIE YAHPDLVKVH YEPNFIYRAD APVQGKVAIV SGGGSGHEPM 
HGGFVGMGML DAACPGAVFT SPTPDQMLEA TKMVHGGAGV LHIVKNYTGD ILNFDMAADL
ARAEGIEIES VVTNDDVAVQ DSLYTAGRRG VGVTVLAEKI CGAAAEEGRS LTDVADVCRK
VNGWGRSMGM ALTSCTVPHA GKPTFDLPED EMEIGIGIHG EPGRTRMKLK SADEITEMLM
EPILNDLPFQ AGDNVLLFVN SMGGTPLIEL YIIYRKAYEI ATKSGLKVVR NLIGPYITSL
EMAGCSITLL KMDDDLIRLW DAPVRTPALR WGV