Gene Dret_1086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1086 
Symbol 
ID8418911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1276218 
End bp1277213 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content56% 
IMG OID645037658 
ProductADP-L-glycero-D-manno-heptose-6-epimerase 
Protein accessionYP_003197952 
Protein GI258405210 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02197] ADP-L-glycero-D-manno-heptose-6-epimerase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.163368 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0103099 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGTTG TTACTGGTGG TGCTGGTTTT ATCGGGAGTG CCTTTGTCTG GCAACTCAAC 
CAAATGGGAA TCAGGGATAT TCTGGTGGTC GACAACCTGG CCTGCAGTGA AAAGTGGAAA
AATCTGGTCA ATCTTGATTA TCAGGATTAC CTGCACCGGG ACGCCTATCT GGAGAAGGTC
CGCATGGACC GGTTGCCTGC GCCACGGGCC ATTATCCACA TGGGGGCCTG TTCGTCGACC
ACGGAGCGGG ACGCCGATTT CCTCATGGAA AACAATTACC GCTACAGCAA GATTCTCGCC
GAGTACGCCA TGAAGCATGG CGCCCGGTTT ATTTACGCTT CGAGCGCGGC CACCTATGGC
GACGGCCAGT TGGGGTTTGA TGACGATCTG CGTCTGGCGC CGCAACTCAA GCCGTTGAAC
ATGTATGGGT ATTCCAAACA GCTCTTTGAT CTCTGGGTTA TGCGCAATGG CCTGCTCGAC
CGACTCTGTG GCCTGAAATT TTTCAATGTT TTCGGTCCGA ACGAATACCA TAAGCAGGAT
ATGCGCAGTG TGGTCTGCAA GGCCTTCAAT CAGGTGCACG ACCACGGCCG CATCCGCCTT
TTCAAATCCT ACCACCCGGA ATACGAGCAC GGCGAACAGC ACCGCGATTT CGTCTATGTC
AAAGATTGTG TGGCGGTCAT GGCTTGGCTT TTGGACCATC CCGGGGTCAA TGGCATCTAC
AATGTCGGCA CCGGCACGTC CCGGACCTGG AACGACCTCG CCCGTGCCGT GTTCATGGCC
ATGGGGCAGT CCGAACGCAT AGAATACATG GAGATGCCGG AGTCGTTGCA GGCCAAATAC
CAGTACTATA CTCAGGCCCG CATGGAGCGC TTGGCCCGAG CCGGGTGCCC GGTGCGCATG
CGTTCCCTCG AAGACGGCGT TGCTGATTAC ATCGCGCATC TGCAGGCCTC TGATCCCTAC
CTCGAGCCTG AGACCCGGGC CGGAGCCGCC GAATAA
 
Protein sequence
MIVVTGGAGF IGSAFVWQLN QMGIRDILVV DNLACSEKWK NLVNLDYQDY LHRDAYLEKV 
RMDRLPAPRA IIHMGACSST TERDADFLME NNYRYSKILA EYAMKHGARF IYASSAATYG
DGQLGFDDDL RLAPQLKPLN MYGYSKQLFD LWVMRNGLLD RLCGLKFFNV FGPNEYHKQD
MRSVVCKAFN QVHDHGRIRL FKSYHPEYEH GEQHRDFVYV KDCVAVMAWL LDHPGVNGIY
NVGTGTSRTW NDLARAVFMA MGQSERIEYM EMPESLQAKY QYYTQARMER LARAGCPVRM
RSLEDGVADY IAHLQASDPY LEPETRAGAA E