Gene Dret_1702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1702 
Symbol 
ID8419533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1958602 
End bp1960152 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content61% 
IMG OID645038276 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_003198564 
Protein GI258405822 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.214184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.051973 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACATTC CCCTGCCAAC GCCGGACGAA ATGGCCCTTT GGGACCGAAC CAGCATCGAC 
ACTTACGGCA TTGACGCCAA AATGCTCATG GAAAACGCGA GCCGCGAGGC TCTGCATGTC
CTGCTCGAGG AATTCGGATC CTTACAGGGG AAATCCGCTG TGGTCTTTGC CGGTTCCGGG
AACAATGCCG GCGACGCTTT TGCCCTGGCC CGGCATATGA CCGACATGGG CGCGAAGGTG
ATGGTCCTGC ACAAAAAGGC GCAACAGGAA TACACCCGGG AAACCGCCTA CCACCTCCAG
TTGGCCTTGA ATACCGAGGT TCCCCTGGTG CGGTTGCCGG AATACAACCT CGATTTCTTG
ACCCATCCCG ACATCGTGGT CGACGGCCTG CTCGGCACTG GGTTCCGGGG CCTGCTCAAC
AAGCAATACA AAGACTGGAT CAACCATATC AATCGCATGG GGAAAAACAG CTTTGTCTTT
GCCCTGGACA TCCCCTCCGG GATCAATGGC TGGACCGGCC TGCCGAGCCC GACCGCGGTC
AAAGCCGACG CCACCGCCAC ATTTGCCTTT GCCAAAGTCG GCTTATTCAT GGCTGAAAGC
CGCCCCTATG TCGGCAACCT CCATGTCCGC TCCATCGGCT TGCCCGCCCG GGTTTCAGAA
GACGCCCCGC CCAGTCATTT CGGCCTCGAC CAGCACATTC TCTCCTTGTA CCCCGCTCCG
CAGGACGACC TGCACAAGGG AACGGCCGGA CATGTTCTCA TCGTCGGCGG CTCTGAAGGA
CTGACCGGCG CCCCCCATCT GGCCGCCCTC GGCGCCCTTC GCGGTGGCGC AGGTTTGGTG
ACCATCGCTA TTCCGGGGGC GCTGGCTTCC GAAGTCAAAA ACGGCGCAGC GGACATCATG
ACCCTGCCCC TTGGTGAGGG AGGCAAATGG AGCGGATCCC TCATTGAGGC TTTGAGTCCC
CATTTCGAGC GCTTTGACAG CGTGGTCATC GGCCCGGGGC TGGGCCGCGA CACTGGGAGC
CGGAATTTCC TCCGTGCCTA CCTCCAGAGC GAACACCCCC CGACGGTCAT CGATGCCGAC
GCGCTGTATT GGCTGGCCGA GGATCCACAA TGTGTCCAAC ATCTGGATCA GGAATGCATC
CTGACGCCCC ACCCCGGGGA AATGGCACGG CTGTGCCGGA AATCCAACAA TGAAGTCCAG
CAAAACCGAC CAGCTATCCT GCGCCAGGCC GTGCAGGATT TCCAGTGCAC TATGGTCTTC
AAAGGGGCCA ATACATTGAT CACTGCGCCC AACCGGCCCA TGTACGTCTC CCCCATCGCC
TGCGCCAACC TGGCCATCGG CGGCGCTGGC GACATCCTGG CCGGTCTGAT AGGCAGCTTA
CGCAACAGCT CCCTTTCCCC CTTGCAGGCC ACCTGCCTCG GCGTGTATTG GCATGGTTTC
GCCGGCGAAC TCCTCCGGGA GAGATACCCC TACAGAGGCA ACCGAGCCAC GGAGTTGGCC
GACATCCTGC CCCTCGTTTT TAAGGAGACA CACGATGCCC ACAGTTGCTG A
 
Protein sequence
MYIPLPTPDE MALWDRTSID TYGIDAKMLM ENASREALHV LLEEFGSLQG KSAVVFAGSG 
NNAGDAFALA RHMTDMGAKV MVLHKKAQQE YTRETAYHLQ LALNTEVPLV RLPEYNLDFL
THPDIVVDGL LGTGFRGLLN KQYKDWINHI NRMGKNSFVF ALDIPSGING WTGLPSPTAV
KADATATFAF AKVGLFMAES RPYVGNLHVR SIGLPARVSE DAPPSHFGLD QHILSLYPAP
QDDLHKGTAG HVLIVGGSEG LTGAPHLAAL GALRGGAGLV TIAIPGALAS EVKNGAADIM
TLPLGEGGKW SGSLIEALSP HFERFDSVVI GPGLGRDTGS RNFLRAYLQS EHPPTVIDAD
ALYWLAEDPQ CVQHLDQECI LTPHPGEMAR LCRKSNNEVQ QNRPAILRQA VQDFQCTMVF
KGANTLITAP NRPMYVSPIA CANLAIGGAG DILAGLIGSL RNSSLSPLQA TCLGVYWHGF
AGELLRERYP YRGNRATELA DILPLVFKET HDAHSC