Gene Dret_2006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2006 
Symbol 
ID8419851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2303845 
End bp2304834 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content62% 
IMG OID645038594 
Productthiamine-monophosphate kinase 
Protein accessionYP_003198868 
Protein GI258406126 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTTC GTTCCGAAGA CAGTTTTCTT GCCTTGATCG ACCGGTATTT TCCCAATACT 
CACGCCCATA TGCCATTGGG GCGCGGCGAT GATTGCGCGG TGCTGCGCGC GCCGGACTGG
ATGTGCCTGA CCGCGGATAT GTTCGTGGAG GACGTCCACT TCCGGCGGAC CTACTTCAGT
CCCGAAGACA TCGGATACAA GGCCCTGGCA GTGAATCTGT CTGATATCGC CGGCATGGGG
GCCCGGCCGC TCGGCTTTGC CCTCAATTTG ATGGCCACAG GGCGGGAAAG CGATGAATTC
TGCGAGGGGT TGGTCTCAGG TATGGCCGAT CTGGCGCGGG AGCATGATCT GCCCCTGGTC
GGTGGGGATC TCAGCCGAGG ACCGGCTTTG GCGGTGGCTA TCACCATGTG GGGCAAGTCG
CAGAAACGGT TTTTGCTGCG CCGCAATTGT CAGCCTGGCG ATCTGCTTTT TTGTCTCGGC
GATGTCGGTC TGGCCCGCTG CGGGCTGTCC GTGCTCGAGC GCGATGATGA GTCGTTGCGC
TGGCGTTTTC CGGAAGCCGT CGAGGCGCAT TTGCGGCCCC AGATCCGGTT GGAGCAGGCC
CAGACTTTGG GCGAATTCGA GCAGGTGCGC GGGCTGATGG ACGTTTCGGA CGGCCTGATG
CAGGACTTGC CCCGCTTTGT CGGGCCGGGC TTCGGCGTCG AGGTCTTCAT GAGCGAAAGT
GAAGTCCACC CGGAGGTGGT CGAATTCGCA CGGGAGTTTG CGGGATTGCC GGGAGTGGAG
CAGGCCCTTC TCGGCGGCGA GGACTACGCC CTGCTGGGTG CGGCGGCCCC GGGGGCGAGC
CATTTTCTGG AGCGGGAGTT CCCGGAGATC CTGTGGTTGG GGAAAGTGGT CGAACGCTCC
GGGATTTATC TCGACGGCGC CCGCCTGGAT CTCAAGGGTT TTGACCATTT CGGCGCCGAT
TTCCCGGAAC ACAGTGAAGA CGGAGAGTAA
 
Protein sequence
MTLRSEDSFL ALIDRYFPNT HAHMPLGRGD DCAVLRAPDW MCLTADMFVE DVHFRRTYFS 
PEDIGYKALA VNLSDIAGMG ARPLGFALNL MATGRESDEF CEGLVSGMAD LAREHDLPLV
GGDLSRGPAL AVAITMWGKS QKRFLLRRNC QPGDLLFCLG DVGLARCGLS VLERDDESLR
WRFPEAVEAH LRPQIRLEQA QTLGEFEQVR GLMDVSDGLM QDLPRFVGPG FGVEVFMSES
EVHPEVVEFA REFAGLPGVE QALLGGEDYA LLGAAAPGAS HFLEREFPEI LWLGKVVERS
GIYLDGARLD LKGFDHFGAD FPEHSEDGE