Gene Dret_0395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0395 
Symbol 
ID8418200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp482863 
End bp484083 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content58% 
IMG OID645036958 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_003197272 
Protein GI258404530 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0132333 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.602884 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTA TGTCTGTGGT TGGTGCTCGG CCCAACTTTA TGAAGATCGC TCCATTTATT 
CGGGCTATTG AGGACTATAA CAATGCCTTC AGGGAGGATT CCCAGCCTGT CCCTCCTTCC
CGCCCCCCCT CTGGCTCCCT CCATCACTCT GCCACCTCGC CTCTCCAGCA TATCCTCGTG
CACACCGGGC AGCACTATGA CGACAAAATG TCCTTGGCCT TTTTTGAAGC CCTGGATATC
CCCCAAGCGG ACATCAATTT GGAAATCGGC TCCGGCTCCC ATGCTGAACA GGTGGGGCAG
ACCATGATAG CCTTTGAAAA GGTGCTTCGG GAACATAGGC CGGACTGGGT CGTGGTGGTC
GGGGACGTGA ACGCCACCTG CGCGTGCTCA ATCACGGCCA AAAAGGAACA TGTCAAATTG
GCGCATATCG AGGCGGGGCT CAGATCCCGG GACATGACCA TGCCGGAAGA GGTGAACCGG
CTGGTGACGG ACCGCTTGTC CGATCTGTTG TTGACACCGG ATCAGCTCTC AAGCGCCAAT
CTGCGCGCCG AAGGCGCGGC TGCAACCGCG ATCCACTTTG TTGGCAATAT TATGATCGAT
ACCCTGGAGC ACCAGCGGGC CAAGGCCGAA CGCCTGGAAA TCGCCACTAT TGTCCAGGAG
AACGCTATTG CCGGGCAAAA GCGTCCGGAG GCGTCGCCGC TGCGTCAAGA CGCGTTCGCA
TTGATGACCA TGCACCGACC TTCGAATGTG GATGACGCAG GGACACTCAC CGCTATTCTG
GATTTTCTGC TGGAGGAAGT GGCTGCGGAA ATGCCGCTTG TCTGGCCGAT CCATCCGCGG
ACGCAACAGC GATTGCAAGA CTGCGGGCTG TGGGAAAAGG TGTTGGATTG TCCAGGGGTG
CTTCTTTTGC ACCCGCTTGG CTACCATGAA CTGTTGCGTT TGAATATGGC TGCCCGGGTG
GTGCTCACCG ATAGCGGGGG GCTACAAGAA GAATGCTGTG TTCTGGGGAC GCCCTGTTTG
ACTTTGCGGT GGAATACTGA GCGGCCGGTG ACCTTGCAAG AACATGGGGG CGCCAGCATA
TTGGTGGGCA ATGATGTACA CAAAATCCGG GACGCCTACC GAGAAATGCT GGTAGCCGAG
CGGGCTCCGG CCCGACCGGA ACTCTGGGAC GGTCGCACCG CGCAGCGTAT TGTCCAGGCA
TTGGTGGCGT TCGAAGGATA G
 
Protein sequence
MKIMSVVGAR PNFMKIAPFI RAIEDYNNAF REDSQPVPPS RPPSGSLHHS ATSPLQHILV 
HTGQHYDDKM SLAFFEALDI PQADINLEIG SGSHAEQVGQ TMIAFEKVLR EHRPDWVVVV
GDVNATCACS ITAKKEHVKL AHIEAGLRSR DMTMPEEVNR LVTDRLSDLL LTPDQLSSAN
LRAEGAAATA IHFVGNIMID TLEHQRAKAE RLEIATIVQE NAIAGQKRPE ASPLRQDAFA
LMTMHRPSNV DDAGTLTAIL DFLLEEVAAE MPLVWPIHPR TQQRLQDCGL WEKVLDCPGV
LLLHPLGYHE LLRLNMAARV VLTDSGGLQE ECCVLGTPCL TLRWNTERPV TLQEHGGASI
LVGNDVHKIR DAYREMLVAE RAPARPELWD GRTAQRIVQA LVAFEG