Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DET0427 |
Symbol | |
ID | 3230249 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dehalococcoides ethenogenes 195 |
Kingdom | Bacteria |
Replicon accession | NC_002936 |
Strand | + |
Start bp | 399337 |
End bp | 400875 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637119993 |
Product | carbohydrate kinase family protein |
Protein accession | YP_181171 |
Protein GI | 57234780 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00841365 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATATAG TCTCAACTAC CCAGATGCGG CAAATAGAAG AGGCCTGCGT GAAACAGGGT ATCACCACCG AAACCCTGAT GGACAATGCC GGCAGGGCGG TTGCTGTATT GGCCCGCCAC CTGCTGGAAA AGCAAAACGG CTGCCGGGTG GTGATACTTG CGGGTGCGGG CAATAACGGG GGTGACGGGC TGGTTGCCGG GCGTTATCTG GCGGCCTGGG ATAAAAAGGT GAGCATATTC GAACCCTTTG CGGGTGCTTC AAAAAATAAA ACTGCTTCGG GTGGTTTAAA AACCCCTGAA AACATATTTA CGGATTTAAC CGGACTTAAG GATCATCTTA GGGAAGCAGA TTTGGTAATA GATGCTTTGC TGGGTACAGG CGTAAACCGC CCCTTGGAAG GGGTGTACAG GCAAGCCCTG CAGATGACGG CTGATGTCAA AAATACCAGA CCCGAAATGC AAATACTGGC CTTAGATTTA TCCTCCGGCC TGAATGCCGA TACCGGGCAG GCGGACGAAG CCTGTTTGAA AGCTGATTTT ACCCTGTCAC TGGGTATTGC CAAGCAGGGG CTTTTTACCC ACCTCGGGCT GGAATTAAGC GGGGCGGTTT CGGTGGCGGA TATCGGCATA CCGCTGGGAC TTACAGCTGA TATCCATACC CGTCTGATTG AAAAAGACTG GGCAAAGGGT GTTTTGCCTG TCCGTTCCCC TCATGCCAAC AAGGGCAGTT TCGGCCGGGT GATGATTGTG GCGGGGAGTG ACCCTTACAT CGGGGCGGCC ATGCTTGCCG GAAGTGCCGC CATGCGTATT GGTGCGGGCT TGGTTACCCT GGCGCTGCCT CAAAGTTTAA CCGGGGCGGT AGCCGCCAAA ATACCCGAAG CTACCTATCT GCCGTTGCCC GAAGTATCTT GCGGCACTGC GGATAGCTTT GCTTCACGCC TTATCCTGAG TGAACTGGTC AAATATGACG TGCTTCTGAT AGGGCCGGGC CTGGGGCAAA GTCCATATGC CGCCAGGCTG GTTACCGAAG TGCTGTCTAA CCTGCCTGAG GAGCTTAAAG TGGTAATAGA CGCTGATGCT TTAAATATAC TTGCCGCTAT ACCCCGGTGG TGGCTGGAAT ATAGTTTTGA TGCTATACTC ACTCCGCATC CGGGTGAAAT GGCCCGCCTA GCCAAAACCA CAGCTGAGGC TGTCCAGTCT GACCGCTTCG GCATTTGCCG TGAATCTGCC CGCAAATGGG GTAAGACCAT TATTCTTAAA GGTGCCGGAA CAATAGTTTC ATCACCAGAG GGTGAAACCC TGTGCAACCC GGCGGCTAAC CCGGTACTGG CTTCGGCCGG AACGGGTGAC GTACTGGCCG GAATAATAAG CGGTCTTTTG GGGCAGGGTT TGAACCTGTT TGAGGCGGCG GGTTTAGGCG TTTATCTGCA CTCGCTAGCT GGGGCAACTC TGCGGAGTGA AATGGGTGAT GCCGGTGTAC TGGCTTCGGA TCTGCTGTTA AAATTGCCTG CGGTTATAAA AGAATTGAAA CAGAGCTGA
|
Protein sequence | MYIVSTTQMR QIEEACVKQG ITTETLMDNA GRAVAVLARH LLEKQNGCRV VILAGAGNNG GDGLVAGRYL AAWDKKVSIF EPFAGASKNK TASGGLKTPE NIFTDLTGLK DHLREADLVI DALLGTGVNR PLEGVYRQAL QMTADVKNTR PEMQILALDL SSGLNADTGQ ADEACLKADF TLSLGIAKQG LFTHLGLELS GAVSVADIGI PLGLTADIHT RLIEKDWAKG VLPVRSPHAN KGSFGRVMIV AGSDPYIGAA MLAGSAAMRI GAGLVTLALP QSLTGAVAAK IPEATYLPLP EVSCGTADSF ASRLILSELV KYDVLLIGPG LGQSPYAARL VTEVLSNLPE ELKVVIDADA LNILAAIPRW WLEYSFDAIL TPHPGEMARL AKTTAEAVQS DRFGICRESA RKWGKTIILK GAGTIVSSPE GETLCNPAAN PVLASAGTGD VLAGIISGLL GQGLNLFEAA GLGVYLHSLA GATLRSEMGD AGVLASDLLL KLPAVIKELK QS
|
| |