Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_1702 |
Symbol | |
ID | 8419533 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 1958602 |
End bp | 1960152 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645038276 |
Product | carbohydrate kinase, YjeF related protein |
Protein accession | YP_003198564 |
Protein GI | 258405822 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.214184 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.051973 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACATTC CCCTGCCAAC GCCGGACGAA ATGGCCCTTT GGGACCGAAC CAGCATCGAC ACTTACGGCA TTGACGCCAA AATGCTCATG GAAAACGCGA GCCGCGAGGC TCTGCATGTC CTGCTCGAGG AATTCGGATC CTTACAGGGG AAATCCGCTG TGGTCTTTGC CGGTTCCGGG AACAATGCCG GCGACGCTTT TGCCCTGGCC CGGCATATGA CCGACATGGG CGCGAAGGTG ATGGTCCTGC ACAAAAAGGC GCAACAGGAA TACACCCGGG AAACCGCCTA CCACCTCCAG TTGGCCTTGA ATACCGAGGT TCCCCTGGTG CGGTTGCCGG AATACAACCT CGATTTCTTG ACCCATCCCG ACATCGTGGT CGACGGCCTG CTCGGCACTG GGTTCCGGGG CCTGCTCAAC AAGCAATACA AAGACTGGAT CAACCATATC AATCGCATGG GGAAAAACAG CTTTGTCTTT GCCCTGGACA TCCCCTCCGG GATCAATGGC TGGACCGGCC TGCCGAGCCC GACCGCGGTC AAAGCCGACG CCACCGCCAC ATTTGCCTTT GCCAAAGTCG GCTTATTCAT GGCTGAAAGC CGCCCCTATG TCGGCAACCT CCATGTCCGC TCCATCGGCT TGCCCGCCCG GGTTTCAGAA GACGCCCCGC CCAGTCATTT CGGCCTCGAC CAGCACATTC TCTCCTTGTA CCCCGCTCCG CAGGACGACC TGCACAAGGG AACGGCCGGA CATGTTCTCA TCGTCGGCGG CTCTGAAGGA CTGACCGGCG CCCCCCATCT GGCCGCCCTC GGCGCCCTTC GCGGTGGCGC AGGTTTGGTG ACCATCGCTA TTCCGGGGGC GCTGGCTTCC GAAGTCAAAA ACGGCGCAGC GGACATCATG ACCCTGCCCC TTGGTGAGGG AGGCAAATGG AGCGGATCCC TCATTGAGGC TTTGAGTCCC CATTTCGAGC GCTTTGACAG CGTGGTCATC GGCCCGGGGC TGGGCCGCGA CACTGGGAGC CGGAATTTCC TCCGTGCCTA CCTCCAGAGC GAACACCCCC CGACGGTCAT CGATGCCGAC GCGCTGTATT GGCTGGCCGA GGATCCACAA TGTGTCCAAC ATCTGGATCA GGAATGCATC CTGACGCCCC ACCCCGGGGA AATGGCACGG CTGTGCCGGA AATCCAACAA TGAAGTCCAG CAAAACCGAC CAGCTATCCT GCGCCAGGCC GTGCAGGATT TCCAGTGCAC TATGGTCTTC AAAGGGGCCA ATACATTGAT CACTGCGCCC AACCGGCCCA TGTACGTCTC CCCCATCGCC TGCGCCAACC TGGCCATCGG CGGCGCTGGC GACATCCTGG CCGGTCTGAT AGGCAGCTTA CGCAACAGCT CCCTTTCCCC CTTGCAGGCC ACCTGCCTCG GCGTGTATTG GCATGGTTTC GCCGGCGAAC TCCTCCGGGA GAGATACCCC TACAGAGGCA ACCGAGCCAC GGAGTTGGCC GACATCCTGC CCCTCGTTTT TAAGGAGACA CACGATGCCC ACAGTTGCTG A
|
Protein sequence | MYIPLPTPDE MALWDRTSID TYGIDAKMLM ENASREALHV LLEEFGSLQG KSAVVFAGSG NNAGDAFALA RHMTDMGAKV MVLHKKAQQE YTRETAYHLQ LALNTEVPLV RLPEYNLDFL THPDIVVDGL LGTGFRGLLN KQYKDWINHI NRMGKNSFVF ALDIPSGING WTGLPSPTAV KADATATFAF AKVGLFMAES RPYVGNLHVR SIGLPARVSE DAPPSHFGLD QHILSLYPAP QDDLHKGTAG HVLIVGGSEG LTGAPHLAAL GALRGGAGLV TIAIPGALAS EVKNGAADIM TLPLGEGGKW SGSLIEALSP HFERFDSVVI GPGLGRDTGS RNFLRAYLQS EHPPTVIDAD ALYWLAEDPQ CVQHLDQECI LTPHPGEMAR LCRKSNNEVQ QNRPAILRQA VQDFQCTMVF KGANTLITAP NRPMYVSPIA CANLAIGGAG DILAGLIGSL RNSSLSPLQA TCLGVYWHGF AGELLRERYP YRGNRATELA DILPLVFKET HDAHSC
|
| |