Gene Dret_1572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1572 
Symbol 
ID8419402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1820072 
End bp1821490 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content57% 
IMG OID645038145 
ProductUTP--glucose-1-phosphate uridylyltransferase 
Protein accessionYP_003198434 
Protein GI258405692 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4284] UDP-glucose pyrophosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACA CCGCCAATTT TCAGCCCTTT GCGGCAAAGA TGCATAATGC CGGACTGCGT 
CCCGAACTGA TCGATTTGTT TCGGGAGTAT TACCAACAAT TGGCTTCGGG ACACACCGGC
AAACTCTCAG AGGCCGCGAT CCGTCCACTG GATAAAGCCA ATGATCTCTT GCACCAGGAC
GATCTTACTG CTGAAGCGGG AACAAATGGT CGTCATACCC TGGATCAGAC AGTCATGATC
AAGCTCAACG GCGGCCTGGG CACGAGTATG GGAATGCCGT ATGCCAAGTC GTTACTGGAA
GTCAAACAGG GCAATAATTT TCTCGACGTC ATTGTCATGC AGTGCAACGG GTGTGACGGC
CAGTTGCAGT ATTCGATTCC GCTGGCGCTG ATGGACAGCT TTGCCACGCA TCAGGAGACC
AACGACTATC TGCAGCAGCA GGGGATCCGC CTCGGTCAGG ACGTCTTCAC GTTTTTGCAG
CACAAATTCC CCAAGATCCG CCAGGACACT CTCGAGCCCG CGACCTATCC CGAGGATCCG
GAATTAGAGT GGAACCCTCC CGGCCACGGC GACATTTACG CTGCCCTGGA AACCTCGGGT
CTGCTCAACC AGCTCTTGAG CGACGGGTAC CGCTACGCCT TTGTCTCCAA TTCGGACAAT
CTCGGCGCGG TCGTGGATTC CCGTTTGCTG GGCGCGTTTG CCGACAGTGG CACTCCGTTC
ATGATCGAGG TCTGCCGGCG CACCGGGGCG GACACCAAAG GCGGGCATTT GGCCCGGCAC
AAGGACGGGC GCCTCATCCT GCGGGAAATC GCCCAATGTC CGGATGAGGA ACTCGACGCC
TTTCAGGATG TCGAGCGCTT CAAATATTTC AATACCAATA ATATCTGGAT CGATCTCCAG
CAGCTGCGGG ATTTTATCGA TGCTCACGGT TTTCCCCAAC TCCCGATCAT CGTCAATCCC
AAAACCGTCA ATCCCCGAGA CGAGAATTCG ACCCCTGTCT TCCAGATCGA GACGGCGATG
GGGGCCGCGG TGGCCGCGTT TCCGGGAGCC CTGGCCGTCC AGGTCAACCG GGATCGATTC
ATCCCGGTCA AAAAGACGAA TGACCTGCTG GCAGTGTGGT CGGATTGCTA TATCTTTACC
CCTGATTGCC GCATCGTCCC CAATCCCGAG CGGCGCTTGG GGACCATCGT CATCAAGCTC
GATTCGACGT ATTTTAAGAA GATCGAGCAG TTGACCGAGC GGTTCCAGCA CGGCGCGCCT
TCGTTGGTGG CCTGTTCGTC CCTGACCATA CACGGCGATT TCGCCTTTGG CCCCAATGTT
GTGTGTCGCG ACGATGTCGT TTTAGAAAAC CGCGGCACGG AGCAAGTCGT TATCCCCGAA
GGTACGGTCC TGGAGGGTAA ACAGGTCTGG GGGGCATAA
 
Protein sequence
MSDTANFQPF AAKMHNAGLR PELIDLFREY YQQLASGHTG KLSEAAIRPL DKANDLLHQD 
DLTAEAGTNG RHTLDQTVMI KLNGGLGTSM GMPYAKSLLE VKQGNNFLDV IVMQCNGCDG
QLQYSIPLAL MDSFATHQET NDYLQQQGIR LGQDVFTFLQ HKFPKIRQDT LEPATYPEDP
ELEWNPPGHG DIYAALETSG LLNQLLSDGY RYAFVSNSDN LGAVVDSRLL GAFADSGTPF
MIEVCRRTGA DTKGGHLARH KDGRLILREI AQCPDEELDA FQDVERFKYF NTNNIWIDLQ
QLRDFIDAHG FPQLPIIVNP KTVNPRDENS TPVFQIETAM GAAVAAFPGA LAVQVNRDRF
IPVKKTNDLL AVWSDCYIFT PDCRIVPNPE RRLGTIVIKL DSTYFKKIEQ LTERFQHGAP
SLVACSSLTI HGDFAFGPNV VCRDDVVLEN RGTEQVVIPE GTVLEGKQVW GA