Gene Htur_3983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3983 
Symbol 
ID8744611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp238912 
End bp240138 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content67% 
IMG OID646514558 
ProductN-acylglucosamine 2-epimerase 
Protein accessionYP_003405505 
Protein GI284167227 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2942] N-acyl-D-glucosamine 2-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACGGG ATCGGACTGC GAGGCACCGA GCGAGGCTGC TGGCGACGCT GCGACTGCAG 
TATCCGGACG CGCTCGCCGA GAGCGGCTTT CGGTTGCTCC ATCCGGAGAC GGGCGAGCCG
TACACGGACG ATCGCCGCCA TCTCGTCGCT ACCTGCCGTT CGATCGCGAA TTTCTCGGTC
GGTGCCGTCG TCGACGGTCC CGAGTGGTGT CTCGATGCGG CCGAACACGG ACTCGAGTTC
CTGCGGGAGT CACACCGCGC TGCCGACGGC GGCTACCACC TCGTCGTCGA CGACGGCGGT
GAGCCGGTCG ATCGGACGCG GTCGGCGTAC GGACACGCCT TCGTGCTGCT CGCGTACGCC
CGCGCTGCGG CTGCCGGTCT CGACGGCGCC GTCGAGGACC TCCGAGCGAC GCACGAACTG
CTCGAGGAAC GCTTCCGCGA CGATGCCGGG TTATTCCGCA GTGACTGTGA TCCCGATTGG
ACGGAGCGGG AAGCCTACCG GGGCCAAAAC GCCAACATGC ACGCCTGCGA GGCGTATATC
GCGGCCTACG AAGCGACGGG CGAGAGCCGG TACTACGACC GTGCGCGCCA CATCGCCGAG
ACGATCACCG ACGACCTCGC GGCAGCGACG GACGGACTCC TGTGGGAACA CTACACCGAC
GAGTGGATCC ACGAGTTCAC CTACAACGAA GACGAGCCGC GCCACCAGTT CCGTCCGCCG
GGCTACCAGC CCGGACACCA CGTCGAGTGG GCGAAGTTTC TCGGGTTGCT CGATCGCTAC
GGGTGCGAAG ACGAGAGTAT AGTCCCGATG GAGGGGCAGT ACGCTCGCGC GAGAGAACTA
TTCGATGCCG CCGTCGATCT CGGCTGGGCC GACGACGGAT TCGTCTACAC CGTCGACCGC
GACGGTGAAC CGATCGTTTC CGACCGCTAC GGCTGGGCCC TCGCAGAGGC CCTCGGTGCC
GCGGCGGTGC TCGCCGAACG CGCGGATTTC CACGGCGACG AGGAGGAGCG TGACCGCTTC
CGCGACTGGG AACGGCTGCT CGCCGACTGC GCCTCCGCGT ATCGCGGTCC CGCCGGACTC
TGGTACGAGA AGCGGCTGTC GCCGGAAGAC GGCGGCGATC CGATCGGGCC GGAGCCGCCG
GGCGTCGAAT CCGACTATCA CCCCGCGAGT GCGTACTACG AGCGGTGGCG ATCTGCGCGG
AAAGTAAGTG ACCGTCAACG ACCGTGA
 
Protein sequence
MTRDRTARHR ARLLATLRLQ YPDALAESGF RLLHPETGEP YTDDRRHLVA TCRSIANFSV 
GAVVDGPEWC LDAAEHGLEF LRESHRAADG GYHLVVDDGG EPVDRTRSAY GHAFVLLAYA
RAAAAGLDGA VEDLRATHEL LEERFRDDAG LFRSDCDPDW TEREAYRGQN ANMHACEAYI
AAYEATGESR YYDRARHIAE TITDDLAAAT DGLLWEHYTD EWIHEFTYNE DEPRHQFRPP
GYQPGHHVEW AKFLGLLDRY GCEDESIVPM EGQYARAREL FDAAVDLGWA DDGFVYTVDR
DGEPIVSDRY GWALAEALGA AAVLAERADF HGDEEERDRF RDWERLLADC ASAYRGPAGL
WYEKRLSPED GGDPIGPEPP GVESDYHPAS AYYERWRSAR KVSDRQRP