Gene Htur_1209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1209 
Symbol 
ID8741798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp1264033 
End bp1265502 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content72% 
IMG OID646511788 
Productprotein of unknown function DUF58 
Protein accessionYP_003402773 
Protein GI284164494 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCCA CGCGCCGACT GTGGGCCGTC GCGAGCCTCG CGGCCTTTCT CGCGGGCGTC 
GCAGTCGTTA CTGCCCGCCC GCTCCTCCTC GGCGGTGCCG GGCTGGTCGG CTCGTGGATC
GTCGCGCGCC AGTATCGGTT CTACCGCGCG CTCGAGGAGA CGGTCGACGC GCTGGCCGTC
GAGCAGTCGG CCGTCCGCGC CGGCGTTCGA ACGGGCGATA CCGTTCCGGT GACTCTCTCG
GCGAGGCTGG CCTCACCGTC GCCGCTCGCC GTCGCGATCG AGGCCGGGCT CCCGACGACC
GCCGTGGCCG ACGAATCCTT CTCGCTGTCC CTCGATCCGT CGACGTCCGC GACCACCCGA
ACGGTCGACG TCTCGTGGCC GGTCGCGGGC CGCCACCGGT TTGACGAACC GACTGTGACC
GCGACGGACG GATTCCTCCG CGAGACGGTG TCGCTCGGAA CGACCTCGAC GGTCACCGTC
GAGCCCCGCG GTCCGCGAAC CATCCACGTC GGCGAGGGCG GCGATCGGAT CACGATGGCC
TACGGCGAAC ACGAGGCCGG TCGTCTCGGG TCGGGGATCG AACCCGCGGA ACTCCGCGAG
TACATGCCCG GCGACACGGC CGACCGGATC GACTGGAAGG CCACGGCCAG GCTGGCGACG
CCTCACGTCC GCGAGTACGA GGCCGAGACC GACCGACGGA CGCTACTGGT CGTCGACCAC
CGCGGCTCGC TGGCGACGGG GCGGCCGGAC GAAACCGAAC TCGACTACCT CCGCGACGTC
GCGCTCGCGA CGGCCGCGAG CGCGCGCCGA CTCGGCGACC CCGTTGGACT GCGTACCGTC
GGCGACGAGG GGATCACGTT TCGCCTCGAC CCGACGGCGA CGCCGGTGGC GTACGATCGG
ATCCGGCGTC GATTGCTCGA CCTGGAGCCG ACGGTCGATC CGACGACGCT CGACGGAAGC
GGCCGAGAGG GGCGGCGGAG ACGGACCCCA ACACCCCGGG GCGGCGGCTT CACCGCGGCC
GACGCCCGGG CGAAACGCAT CGGCCTCGGG GACGACGACG ACCCGTTCGC TTCGACCCTT
CGCCCGTTCT ACGCCGCGCG GGAGGGCTAC CGCGAACGCA TCGAATCGGA TCCGCTCTAC
GGCGCCGTCA AGCGTGCCCA CAGCGGCAAC ACCGAGGGGC TCTGGACGAT CCTCTTCACC
GACGACTCGC GGCCGGCGGA GCTCCGCGAG ACGGTCAAAC TCGCCCGCGG CAACGGCAAC
TCGGTGCTGG TGCTGCTCGC GCCGACGGTG CTCTACGAAT CCGACGGTCT CGCGGACGTC
GAGGACGCCT ACGATCGCTA CGTCGAGTTC GAGAACCTGC GTCGCGACCT CGCCCGGATG
CCCCGCGTGA CCGCCCTCGA GGTCGGCCCG CGGGATCGCC TCTCGACGAT CCTCTCGGAC
GGCCGCGCCG CTCGAGGTGA GCGCGCGTGA
 
Protein sequence
MKPTRRLWAV ASLAAFLAGV AVVTARPLLL GGAGLVGSWI VARQYRFYRA LEETVDALAV 
EQSAVRAGVR TGDTVPVTLS ARLASPSPLA VAIEAGLPTT AVADESFSLS LDPSTSATTR
TVDVSWPVAG RHRFDEPTVT ATDGFLRETV SLGTTSTVTV EPRGPRTIHV GEGGDRITMA
YGEHEAGRLG SGIEPAELRE YMPGDTADRI DWKATARLAT PHVREYEAET DRRTLLVVDH
RGSLATGRPD ETELDYLRDV ALATAASARR LGDPVGLRTV GDEGITFRLD PTATPVAYDR
IRRRLLDLEP TVDPTTLDGS GREGRRRRTP TPRGGGFTAA DARAKRIGLG DDDDPFASTL
RPFYAAREGY RERIESDPLY GAVKRAHSGN TEGLWTILFT DDSRPAELRE TVKLARGNGN
SVLVLLAPTV LYESDGLADV EDAYDRYVEF ENLRRDLARM PRVTALEVGP RDRLSTILSD
GRAARGERA