Gene Htur_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_2039 
Symbol 
ID8742638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp2109652 
End bp2111238 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content71% 
IMG OID646512621 
Productprotein of unknown function DUF58 
Protein accessionYP_003403596 
Protein GI284165317 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGACA GAGACGACGA CGGACGATCG GCGGACAGCC GGAGCGAGCG AGCCGACGGC 
GGCGAGGTCG CCGGAACCGA GACGGAGACT CGCTCCGAGT CCGGCCCCGA CGTCGGTACC
GGTAGTCGAA CGAACGCCGA GACTGAGCCC GACGCTGCCG AGGCTGAAGC CGGCGCCGCT
GCCGAGGTTA CGTCTAACAC AGACACCGAG ACGGACGACA GTACTGACGA CGATACCGAC
CCCGATCCCG ACGGTGCCGG CGAGCCCGCA GACGGCGATT CCGGGACCGA AGACGCCCCG
GAATCGGTCG TCGTCGACTC GAGCGTCCGG GAGACGAACC GCTGGGCAGG CGTCGCCGCC
GTCGCCTCGG TGTTCGGCGG CGCCGGCGTT ATCGTCACGT CGCCGGCGCT GCTGCTGGCG
GCGGTCGTCG GCATCGCGTA CGCGGCCTAC GCCCGGGCCG GTCGGCCGCC GACCCCGACG
CTCTCGATCA GCCGCGAACT CGAGGACGAC GACCTCGAGC CGGGTGAGCC CGTTCGCGTG
ACGGTACGCG TTCGCAACGA CGGGGACGAA CTCCTGCCGG ATCTCCGGCT CGTCGACGGC
GTGCCGTCGG AACTCGTCGT CACCGACGGC TCCCCGAGAC GCGGCACCGC GCTCCGGCCG
GGCGAGACGG AGACGTTCTC CTACGCGGTG ACCGCTCGCC AGGGCACCCA CGCGTTCGAA
CCGATAGCCG TCGTCGCGCG AGGGTTCACC GGCGACGCCG AGCGCGTCCA GCGGATCCGC
GTCGACACCG AACTCCGCTG TCCGCCCGCC GAGGCGGAGA CCGACCTCCC GCTGCGGTCG
CTGACGCTGC CTTTGACCGG CCGCGTCGAG ACCGACGTCG GCGGCGAGGG CCTCGAGTTC
CACTCGACGA GGGAGTACCG ACGCGGCGAT CCGCTCTCGC GGATCGACTG GAACCGCCGG
GCTCGCGGTC AGGAGCTGAC GACCGTCACG TTCCGCGAGG AGCGGTCGGC GACGGTGATG
CTCGCCGTCG ACACGCGCAC GGACGCGTAC CGTCGACCCG ACGAGACGGG TCGTCATGCG
GTCGATCGCA GCATCGAGGC GGCCGTCGCC GTGCTCGACG CGCTGGATGC CGGCGGCAAC
AACGTCGGCC TCGCCAGCTT CGGACCGCAC ACAGAGTGGC TGGCACCGGG CTCGGGGCCC
GACCATCGAG CGTCGGCCCG CCGACTGCTC GCGACCGATT CCGCGTTCGA ACTGTCCCCG
CCGGAGTCGT CGGCGTCGAT GCTGTACGTC CAGCGCAAGC GATTCCGCTC GCGGATCCCC
GCTGATTCGC AGGTGATCCT GTTCACGCCG CTGTGCGACG ACGACATCGT GCGTACCGCG
CAGTTGCTCG AGGCCGACGG CCATCTCGTC ACCGTTATTA GTCCCGATCC GACTGGACGC
GACGCGCCCG GCGAACGGCT CGGGGTGGTC GAACGAGCGG CTCGCATCTC GACGCTCAGG
GGGGCGGGCA TCAGGGTCGT CGATTGGCCG GCGGACGACT CGCTGGCGGC GACGCTCGAG
CGCAGTCGAC GGCGGTGGTC GGCGTGA
 
Protein sequence
MTDRDDDGRS ADSRSERADG GEVAGTETET RSESGPDVGT GSRTNAETEP DAAEAEAGAA 
AEVTSNTDTE TDDSTDDDTD PDPDGAGEPA DGDSGTEDAP ESVVVDSSVR ETNRWAGVAA
VASVFGGAGV IVTSPALLLA AVVGIAYAAY ARAGRPPTPT LSISRELEDD DLEPGEPVRV
TVRVRNDGDE LLPDLRLVDG VPSELVVTDG SPRRGTALRP GETETFSYAV TARQGTHAFE
PIAVVARGFT GDAERVQRIR VDTELRCPPA EAETDLPLRS LTLPLTGRVE TDVGGEGLEF
HSTREYRRGD PLSRIDWNRR ARGQELTTVT FREERSATVM LAVDTRTDAY RRPDETGRHA
VDRSIEAAVA VLDALDAGGN NVGLASFGPH TEWLAPGSGP DHRASARRLL ATDSAFELSP
PESSASMLYV QRKRFRSRIP ADSQVILFTP LCDDDIVRTA QLLEADGHLV TVISPDPTGR
DAPGERLGVV ERAARISTLR GAGIRVVDWP ADDSLAATLE RSRRRWSA