Gene Dret_0153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0153 
Symbol 
ID8417957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp197120 
End bp198196 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content60% 
IMG OID645036718 
Productglucose-1-phosphate thymidylyltransferase 
Protein accessionYP_003197033 
Protein GI258404291 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1209] dTDP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR01207] glucose-1-phosphate thymidylyltransferase, short form 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.34957 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAACT GCGCGGGACG CCAGTCTCAC CTCTCACGGT GCGCCTTGGT CCTGCAGGCA 
ATCAGCCGCT CCCACGAGAA AACACCGGAG CGCAGCCTCC TCTTGCCCTT CCCTGCCCTT
CAAGCTAACA ACGCCCACGT CCTCACTGCG TCACTCGCTT CGGTATTGGA GAACGGAGTT
TCCATGAAAG GCATCATTCT CGCCGGGGGA TCCGGCACAC GGCTGTATCC GTTGACCTGG
GGCGTGAGCA AACAGCTTTT GCCCATCTAC GATAAACCCA TGATCTATTA TCCCCTTTCC
GTGCTTATGC TCGCCGGTAT CAGGGAGATT CTGATCATTT CCACACCACA GGACATCCCC
CGTTTCGAAC GGCTTCTGGG CAGCGGCGAA CAAATCGGGC TTCGCTTGAC GTATAAGACC
CAGCCGGAGC CGGAAGGGTT GCCCCAGGCG TTTGTCCTCG GGCGGGAGTT TATCGGCGAC
GACTCGGTCT GTCTCGTCCT GGGAGACAAT CTGCTTTACG GCGAAGGGCT CTCGCGGATC
CTGCAGCGGT GTGCCGCCCT GGAACAAGGG GGGATCGTTT TCGGCTATCC AGTCCGGGAT
CCGCGGCAGT ACGGCGTGGT GGAATTCGAC GCCCATGGGC GGGCCACGCG CATCGTCGAA
AAACCGGAGA AGCCGCGGTC AAAATATGCG GTCCCCGGGA TCTATTTCTA TGACAATACT
GTGACCGAGA TCGCCGCACA GTTGCGTCCC TCCTCACGCG GCGAACTGGA GATCACGGAC
ATCAACACCG CCTATCTCCA GGCTGGCACA CTCCACGTCG AAGTCCTGGG CCGCGGGTAC
GCCTGGCTTG ACGCCGGGAC CCATGAATCC CTGCACCAGG CCGCGAGCTT CGTCCAGGCT
ATCCAGGAGC GCCAGGGATT CAAACTCGGC TGTATCGAGG AAATCGCCCT GCGCAAAGGA
TACATCACGC CGGATCAGGT CCGTGAACTC GCCGCTCCCA TGGCCAAAAA CGATTACGGC
GCCTACTTGC TCCAGCTTGT CGAGGAACTG CACACCTACG GACAACCGGC CTCCTGA
 
Protein sequence
MRNCAGRQSH LSRCALVLQA ISRSHEKTPE RSLLLPFPAL QANNAHVLTA SLASVLENGV 
SMKGIILAGG SGTRLYPLTW GVSKQLLPIY DKPMIYYPLS VLMLAGIREI LIISTPQDIP
RFERLLGSGE QIGLRLTYKT QPEPEGLPQA FVLGREFIGD DSVCLVLGDN LLYGEGLSRI
LQRCAALEQG GIVFGYPVRD PRQYGVVEFD AHGRATRIVE KPEKPRSKYA VPGIYFYDNT
VTEIAAQLRP SSRGELEITD INTAYLQAGT LHVEVLGRGY AWLDAGTHES LHQAASFVQA
IQERQGFKLG CIEEIALRKG YITPDQVREL AAPMAKNDYG AYLLQLVEEL HTYGQPAS