Gene Htur_4659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4659 
Symbol 
ID8745408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013745 
Strand
Start bp241014 
End bp242273 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content60% 
IMG OID646515168 
ProductN-acylglucosamine 2-epimerase 
Protein accessionYP_003406115 
Protein GI284172733 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2942] N-acyl-D-glucosamine 2-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGACA TAGCTACGCT CAAGGAAGAG TATCTCTCGT CGCTCAAGCA AACGCTGACG 
GACAACGTCC TCGATTTCTG GTTCCCCCGC AGCATCGACG TTGAACACGG AGGGTTCATC
ACCAGTTACG ACGAGCACGG CGAGTTCGCC GGCAACGACA ACAAACAGGT CGTCACACAG
GCGCGCATGG TCTGGTTGGC AGCGCGGCTC GTCCGCGAAG GGTATGGCGA CGAGTACCGT
GACATCGCCG ATCATGGGTT CGCATTTCTG GTCGACGAGC TGTGGGACGA ACCTAACGGC
GGCTTCGTGT GGGAAGTCCG GCGCGACGGA ACCACGGTCA AACCAAACAA ACACCTGTAC
GGACAGTCAT TCGGTCTCTA CGCGCTCTCC GAGTACTACC GCGCGACCGG GGACGACAAG
GCAGCCGACT ACGCCCACGA GCTAGTCGAC TTGATGGACG AACACGCCAA AGACGGCGAG
CACGGGGGGT ACATCGAGTA CTTCACGCCT GACTGGGAAC CGATCACGGA GGGACAGACA
TACCTCGAAA ACATCGAACC GGACTGGTCG CCTAAGGAAT CAGGCGACAG CGTCCTCGAT
CCGACGCTGA AGCTGATGAA TACGCATCTC CATCTCATGG AGGCGTTCAC GACCTACTAC
GAGGCGTTCG ACACTAGTCG CGGACGGGAG CGCCTCCACG AACTACTAAC CATTCTCACT
AACACGGTTT ACCGGAAGAA TCGCGGCTTC TGTACGGACA AGTATGATCC CGACTGGTCG
CAGAAGCTCG ACGAGGAGTT TCGGGTCGTC TCGTACGGGC ACGATCTGGA GACCGTCTGG
CTCGCAATGG AAGCCGCTGA CACGCTCGGC CACTCACAGG ACCTGTACCG GGAGTTCTTC
AAGACGCTGT GGGATTACTC GCTGGAATAC GGGTACGACG AGGAGCGCGG CGGGTTCTAC
TTCTATGGCG GCTTCGACGA ACCCGCAAGC TTCCGCGTCA AAGCCTGGTG GGTGCAGGCC
GAGTGTATGA CCAGCGCTTT GCGAACCTAC GAGTGTACCG GCGACGACCG GTATCTCGAC
GTCTTCGCCG ACACGTGGGA GTTCCTCGAC GACCATCAGA TCGACCGCGA ACACGGCGAG
TGGCACTCCG GCATCAACGA CGATCTCGAA CCCGTCGGTC GCAAGGGCGC GGTCTACAAG
GCGGCATACC ACAACGGTCG AGCGCTACTC GAGTGTATCG CAGCCCTCGA ACGGCTGTAG
 
Protein sequence
MADIATLKEE YLSSLKQTLT DNVLDFWFPR SIDVEHGGFI TSYDEHGEFA GNDNKQVVTQ 
ARMVWLAARL VREGYGDEYR DIADHGFAFL VDELWDEPNG GFVWEVRRDG TTVKPNKHLY
GQSFGLYALS EYYRATGDDK AADYAHELVD LMDEHAKDGE HGGYIEYFTP DWEPITEGQT
YLENIEPDWS PKESGDSVLD PTLKLMNTHL HLMEAFTTYY EAFDTSRGRE RLHELLTILT
NTVYRKNRGF CTDKYDPDWS QKLDEEFRVV SYGHDLETVW LAMEAADTLG HSQDLYREFF
KTLWDYSLEY GYDEERGGFY FYGGFDEPAS FRVKAWWVQA ECMTSALRTY ECTGDDRYLD
VFADTWEFLD DHQIDREHGE WHSGINDDLE PVGRKGAVYK AAYHNGRALL ECIAALERL