Gene Htur_0036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_0036 
Symbol 
ID8740599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp37566 
End bp41906 
Gene Length4341 bp 
Protein Length1446 aa 
Translation table11 
GC content62% 
IMG OID646510599 
Productvon Willebrand factor type A 
Protein accessionYP_003401610 
Protein GI284163331 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGTTCA ACAAAGAGCG TGTCGTTGCA GTCTTCTTCG CCGTCCTGAT GGTGACGTCG 
GCGATCGCCA TGCCCGCACT CGCGAGCGGT CCGAGTGCGA CTGCGGCAAA CGAACGATCC
GCATCGGTCG CGGCCCATCC CGGAAACGGA CCCGCCCCTA ACGCCGGTCC GCCCGGACAG
AACGGCGGCC CGCCGGGGCA CGACCGCGGT GACGCGAATG TCACGATCCC GGATCTCGAG
GAGAACACGA GCGTCGATGC GATCCTGAAC GCGACCTACC GCCTCGAGGA ACTCGAGATC
GAGAACGAGA CGGCAGCCGC GGTAGCGGTG AACGATACCG TCGACGCGGT CAACGCACTG
GTCGGGGAGT ATCGACGGGT TCGGTACGCC GACTCACGTG CAGCGTTCGA CCGTCTGGCC
GATGCCCAAC GGTCGCTTGC AGTGCTAAAA GACGAGGTCG ACGGCGACGA CGAGGCGATC
GTCGACGCGA TCAGCGAAGA GCTGTACGCG GCGGGCAACG CGAGCGCTCG TCTCGCTGTC
TCCGACGCGA ACGCTGTCGT CGCCGCGAAC GAGGGCGAGT TCCGCAACCC CGGCCAGCGA
CAAAAGGCTG AGAGCGCGCT CGGAAACGCT GTCGACGGGC TCGAACGAGC AGATAGAGCC
GTCTCGGACG GGGTGTCCGG CAACGGAACG GGCAAGTCCA AAGCCAAGAA ATCCGATCGG
CCGATCGGCC CGACTGACCG CGCAAAAGCG CTGACCCACC TCGAGAACGC GTGGAAACAC
GCGGAACGAA CGCTCGATAC GGTCGAGGCC AACACCGAAC CGTCGCTGTC GCTGTCACAG
GGACGGCCCT TCGAGCGGAA CGGGACCGTC CGGGTGTCGA TCCGGGCAGT CCTCTCCGAC
GTTCGTCCGT ACGCGTACGA TAACGCGACG GTGACCGTCA ACGGTGATGC CGACGCCGAC
GCTGTCTCGT TTGCCACAGA CGAGGCGGCC GGGACAGACG CCATCGGATC GACGCTCGTC
GATCTCGGAT CCGATCCCGA AAACGTGACC ATCACGGTGA CGGCGACGGC CGCACACGAC
GCCGATCGAA CGGTCGAAGC GACTCACGAC ATTCGCATTG CGGAGGAGGC TGTCGTCCGG
GAACGACCCG ATCCGGACGA GTACCGGAAC GTCGAGGTCG TAAACGAATC GTCGGGCGTT
TCGGTCGCCG TCGGCGGTGA CGGTCTCGAT GACGCCGACG TTTCGATCAC CGACGAGACG
CCGACGACCG ACGACCCCTA CCGCGCCGGT CCGATGGTTC GCATCGAGAA CGAGAGACCG
ATCGACGACG CGACGGTCGA GATTCCGATC GACGAGAACG CGCTCGAGGC GGACGCCAAC
CTCTCGATCG TGACGTGGGA TCCGACCAGC GACGAACCGT GGACACCGGT CGAGACCGAG
ATCGATCGCG ACGCCGGCGT CGCAACTGCC GAGGTCGATC ACTTCTCGTT CTTCTCGGTG
TTCCGGATCG AGGAGTGGGA AGACGAGACC AGCGACACGA TCACGCTCGA CGGGAACGAG
ACCGATGGTG AGATCGGTAA CGGAAGCGGG ATCGAAACGG CGGACTTCGT CTTCGTGAAC
GACGAAAGCG GCAGCATGAG CGGCTCTCCG ACCCACTACG CCGAACTCGC CGGTAAGCGC
TTCGTCGGCG CACTCACCGA TTCTGAACGA GCGGGGCGGG TCGGTTACGC CTCCGGTGCG
AACCTCGACC AGCCGCTGAC GACCGACCAT GACGCGGTTA ACAGCAGTCT CGAGCGACTG
TCTGCGAGCG GCGGTACCAA CACGAGAGCC GGGCTCCGGG TCGGTCTCAA CCACCTCGAG
GAGGAAGGGT GGGAAAACCG CTCCGCGGTG ATGATCCTGC TGTCCGACGG CAAAAGCGGG
TCCGATCCGT TGCCCGTCGC CGAGGACGCC GCCGAGGCGG GCGTCGAGAT CAGCACGGTG
GGTCTCGGAA ACAACATTAA CGAGAACGAA CTCCGCGAGA TCGCGGCCAT CACCGGCGGC
GACTTCTATC ACGTCGAACG GGAAGAAGAC CTGCCCGACA CGTTCGAACG GGTCGCGGAG
AACCAGACCG GTCCCGGCCT GCAAGACACC AACGGCGACG GCATCCCCGA CCTCGTCGCC
GAAATGGACC TCTCGATGCC GACCGGCGAA CCGGGCGTCG TCGGCGAACC GCTGAATCTC
GATCCGACCG CGCTCGATAC CAGCGGTGAC GGGATTCTCG ACAACGAAAC GGTTGACATC
AAGTACCGCG TGTTCCAGGA GGACAACGAA ACGAAGCTCC ACGCCGCCGT TACGTACGCC
GAACACCACC CGGCACGAAT CGACACGACC GGTGACGGCC TGACCGACGC CGAACAGCTC
TCGGATCGGA CGATCACCTA TACGGATTCG CGGTCCGATT CACTCGAGTT CCTCTCGGAA
CTCGAGGATG CCGACGATAT CGACGACCTG GACGGACTCG AGGGCGACGT GTTGACGACC
GACACCGTTC GCTCCGATCC GCTCGTCGAC GACAGCGACG GCGACGGTGT GACCGACGCC
GAGGAGGTGC GTCTCGGGAC CGATCCCGAG TCGCGCGACA CCACGGGAGA CGGAATCTCA
GACTCCGAGG GCCTGAACGG CGACTACGAT CCAACCCTCT TCGATATCGA GCCGCCGGAG
ATCACGGTCA CGTACGCGAC GTTTAACGAT CCGGACGCGG ACGTCGAACT GAAAGATCCC
GTCGACGTCG ACTGGTCGAA CGGAAAGGTC AACTTCAATG ATCCGGTCGA CGCCTCGGTG
CGGGTCTCCG GAAGCTACGA GGTCGATTTC ACCGTCACCG ACTCGGCGGG CCTCGACGAG
GCGCGGGTCG TTCGCGACGG TGACGTCGAA GAGACGGTAT CGCTGTCCGG CCAATGGGAT
GACGCTGACG TCGAATTCGA TGTCGGGACG GTCGATACGT TCACCGACGC GTTCGCGGGC
TCGCGAGTCA CGGTGCAGGC CGACGACCGT CACGGGATCG TCGACGGAGT CGGCACGACC
GAAGCCGCTG CCGTCGAAGT CGGCGGCGTT TGGGCCGCGG CGTCAGACGA GCTCCGCGCG
CAAGGAGTCT CCGATCCGCG ACTCGAGCAG GATCTTGGAA CCCTGCAGGG GATGACGACC
GGTGCCGGCG AGTCGATCGA TTCGCTGCGA GCGCTGTACA ACGAACCGAT CGAGACCATC
ACCGCGGTGC GGGAGATCCC CGGCGCCATC GCGAACTTCG ACGAGATCAT CGCGGCGATG
CCCGATTCGA TCGAAGCCCA ACAGCAACGC AACAATCCCC ACGACCCCGA CGAGAGCCCT
CGTCTCTACG AGTCCTTCCG ACAGGGATGG TACGAGGGCT ACATCGCCTG GTTCGTCATC
GAAGCCGCGA TCCCCGCCGG GGAAGCCGGC AAGGCTCTCA AGAGCTCCGA TCGCGTCCGG
AAGACCGTCG ACAAAATTAG CACGCCTCGG ATCCGCCAGG CGGCCCAGAT GGCGGGCCGA
GCGGGTCATA CTGCAAAAAC GCCGGTCAGA TACGGCAGGC TTCAGTTCTC GCGCGGTCTC
TCCACCGGAA TCGGCCTTAC GCAAAAAGCC GGTGAAAACG TACTCAGCAG AGTATCAACG
GTCGGCCAGC AGTATCGGGT GGCGAAACTC CTCAATCGCC ACGATGTCGA CGGTGCTGCT
ATTAACCGAC TCGATGCTGA CGGACAAGAA GCCATTGGCA AGGCAACAGC TCGGAACGGT
GACGACGCAA GTCGGGTAAT GGCCGACGGT GGCCCTGATC CAGTTGCTCG TGCCTATCAA
CTGGACCTCG ACGTGAACAA TGAGAATCTC GTATCAAATC TGTTTAGACA CAGTGAAGCG
GTTGACTTTG AACGGGTTAT CGACAACCTC GAGGAGTTGA ATCGACCGAA CGCAGATATC
GAAGGCGTGG ACGATCTTGC AAAGCGTCTG GCAGCAGGAG ATCAGAGTAA CGTCAAAGGT
GCGGCATTCG AAGCTGAAGT TGCTGTAGTC CGCGGGAGTG ATAACGTAGA GGCGGTTGGG
AAACCAAATC CCTACTCCCG TGGTGAAATT GATATCGAAA CGAGTGATGG ACGAGTCATC
GAAACGAAAA GTGGAGACTA CAGCCAAGCT GCAGATGGTA GCGATAAATA CATCGAGCTA
GAGAACCAGA TCGGTCACTA CCAACAGTAC ACAGAGGTAG AGGGCGGAAC GATCGAGGTT
GCATTCCGAG AAGAACCTCA CGACGATATC AAAGATATGC TCAACGATAA CGATGCAGAG
GTAGAAATTT ACAATGAGTG A
 
Protein sequence
MRFNKERVVA VFFAVLMVTS AIAMPALASG PSATAANERS ASVAAHPGNG PAPNAGPPGQ 
NGGPPGHDRG DANVTIPDLE ENTSVDAILN ATYRLEELEI ENETAAAVAV NDTVDAVNAL
VGEYRRVRYA DSRAAFDRLA DAQRSLAVLK DEVDGDDEAI VDAISEELYA AGNASARLAV
SDANAVVAAN EGEFRNPGQR QKAESALGNA VDGLERADRA VSDGVSGNGT GKSKAKKSDR
PIGPTDRAKA LTHLENAWKH AERTLDTVEA NTEPSLSLSQ GRPFERNGTV RVSIRAVLSD
VRPYAYDNAT VTVNGDADAD AVSFATDEAA GTDAIGSTLV DLGSDPENVT ITVTATAAHD
ADRTVEATHD IRIAEEAVVR ERPDPDEYRN VEVVNESSGV SVAVGGDGLD DADVSITDET
PTTDDPYRAG PMVRIENERP IDDATVEIPI DENALEADAN LSIVTWDPTS DEPWTPVETE
IDRDAGVATA EVDHFSFFSV FRIEEWEDET SDTITLDGNE TDGEIGNGSG IETADFVFVN
DESGSMSGSP THYAELAGKR FVGALTDSER AGRVGYASGA NLDQPLTTDH DAVNSSLERL
SASGGTNTRA GLRVGLNHLE EEGWENRSAV MILLSDGKSG SDPLPVAEDA AEAGVEISTV
GLGNNINENE LREIAAITGG DFYHVEREED LPDTFERVAE NQTGPGLQDT NGDGIPDLVA
EMDLSMPTGE PGVVGEPLNL DPTALDTSGD GILDNETVDI KYRVFQEDNE TKLHAAVTYA
EHHPARIDTT GDGLTDAEQL SDRTITYTDS RSDSLEFLSE LEDADDIDDL DGLEGDVLTT
DTVRSDPLVD DSDGDGVTDA EEVRLGTDPE SRDTTGDGIS DSEGLNGDYD PTLFDIEPPE
ITVTYATFND PDADVELKDP VDVDWSNGKV NFNDPVDASV RVSGSYEVDF TVTDSAGLDE
ARVVRDGDVE ETVSLSGQWD DADVEFDVGT VDTFTDAFAG SRVTVQADDR HGIVDGVGTT
EAAAVEVGGV WAAASDELRA QGVSDPRLEQ DLGTLQGMTT GAGESIDSLR ALYNEPIETI
TAVREIPGAI ANFDEIIAAM PDSIEAQQQR NNPHDPDESP RLYESFRQGW YEGYIAWFVI
EAAIPAGEAG KALKSSDRVR KTVDKISTPR IRQAAQMAGR AGHTAKTPVR YGRLQFSRGL
STGIGLTQKA GENVLSRVST VGQQYRVAKL LNRHDVDGAA INRLDADGQE AIGKATARNG
DDASRVMADG GPDPVARAYQ LDLDVNNENL VSNLFRHSEA VDFERVIDNL EELNRPNADI
EGVDDLAKRL AAGDQSNVKG AAFEAEVAVV RGSDNVEAVG KPNPYSRGEI DIETSDGRVI
ETKSGDYSQA ADGSDKYIEL ENQIGHYQQY TEVEGGTIEV AFREEPHDDI KDMLNDNDAE
VEIYNE