Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_0036 |
Symbol | |
ID | 8740599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 37566 |
End bp | 41906 |
Gene Length | 4341 bp |
Protein Length | 1446 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 646510599 |
Product | von Willebrand factor type A |
Protein accession | YP_003401610 |
Protein GI | 284163331 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGTTCA ACAAAGAGCG TGTCGTTGCA GTCTTCTTCG CCGTCCTGAT GGTGACGTCG GCGATCGCCA TGCCCGCACT CGCGAGCGGT CCGAGTGCGA CTGCGGCAAA CGAACGATCC GCATCGGTCG CGGCCCATCC CGGAAACGGA CCCGCCCCTA ACGCCGGTCC GCCCGGACAG AACGGCGGCC CGCCGGGGCA CGACCGCGGT GACGCGAATG TCACGATCCC GGATCTCGAG GAGAACACGA GCGTCGATGC GATCCTGAAC GCGACCTACC GCCTCGAGGA ACTCGAGATC GAGAACGAGA CGGCAGCCGC GGTAGCGGTG AACGATACCG TCGACGCGGT CAACGCACTG GTCGGGGAGT ATCGACGGGT TCGGTACGCC GACTCACGTG CAGCGTTCGA CCGTCTGGCC GATGCCCAAC GGTCGCTTGC AGTGCTAAAA GACGAGGTCG ACGGCGACGA CGAGGCGATC GTCGACGCGA TCAGCGAAGA GCTGTACGCG GCGGGCAACG CGAGCGCTCG TCTCGCTGTC TCCGACGCGA ACGCTGTCGT CGCCGCGAAC GAGGGCGAGT TCCGCAACCC CGGCCAGCGA CAAAAGGCTG AGAGCGCGCT CGGAAACGCT GTCGACGGGC TCGAACGAGC AGATAGAGCC GTCTCGGACG GGGTGTCCGG CAACGGAACG GGCAAGTCCA AAGCCAAGAA ATCCGATCGG CCGATCGGCC CGACTGACCG CGCAAAAGCG CTGACCCACC TCGAGAACGC GTGGAAACAC GCGGAACGAA CGCTCGATAC GGTCGAGGCC AACACCGAAC CGTCGCTGTC GCTGTCACAG GGACGGCCCT TCGAGCGGAA CGGGACCGTC CGGGTGTCGA TCCGGGCAGT CCTCTCCGAC GTTCGTCCGT ACGCGTACGA TAACGCGACG GTGACCGTCA ACGGTGATGC CGACGCCGAC GCTGTCTCGT TTGCCACAGA CGAGGCGGCC GGGACAGACG CCATCGGATC GACGCTCGTC GATCTCGGAT CCGATCCCGA AAACGTGACC ATCACGGTGA CGGCGACGGC CGCACACGAC GCCGATCGAA CGGTCGAAGC GACTCACGAC ATTCGCATTG CGGAGGAGGC TGTCGTCCGG GAACGACCCG ATCCGGACGA GTACCGGAAC GTCGAGGTCG TAAACGAATC GTCGGGCGTT TCGGTCGCCG TCGGCGGTGA CGGTCTCGAT GACGCCGACG TTTCGATCAC CGACGAGACG CCGACGACCG ACGACCCCTA CCGCGCCGGT CCGATGGTTC GCATCGAGAA CGAGAGACCG ATCGACGACG CGACGGTCGA GATTCCGATC GACGAGAACG CGCTCGAGGC GGACGCCAAC CTCTCGATCG TGACGTGGGA TCCGACCAGC GACGAACCGT GGACACCGGT CGAGACCGAG ATCGATCGCG ACGCCGGCGT CGCAACTGCC GAGGTCGATC ACTTCTCGTT CTTCTCGGTG TTCCGGATCG AGGAGTGGGA AGACGAGACC AGCGACACGA TCACGCTCGA CGGGAACGAG ACCGATGGTG AGATCGGTAA CGGAAGCGGG ATCGAAACGG CGGACTTCGT CTTCGTGAAC GACGAAAGCG GCAGCATGAG CGGCTCTCCG ACCCACTACG CCGAACTCGC CGGTAAGCGC TTCGTCGGCG CACTCACCGA TTCTGAACGA GCGGGGCGGG TCGGTTACGC CTCCGGTGCG AACCTCGACC AGCCGCTGAC GACCGACCAT GACGCGGTTA ACAGCAGTCT CGAGCGACTG TCTGCGAGCG GCGGTACCAA CACGAGAGCC GGGCTCCGGG TCGGTCTCAA CCACCTCGAG GAGGAAGGGT GGGAAAACCG CTCCGCGGTG ATGATCCTGC TGTCCGACGG CAAAAGCGGG TCCGATCCGT TGCCCGTCGC CGAGGACGCC GCCGAGGCGG GCGTCGAGAT CAGCACGGTG GGTCTCGGAA ACAACATTAA CGAGAACGAA CTCCGCGAGA TCGCGGCCAT CACCGGCGGC GACTTCTATC ACGTCGAACG GGAAGAAGAC CTGCCCGACA CGTTCGAACG GGTCGCGGAG AACCAGACCG GTCCCGGCCT GCAAGACACC AACGGCGACG GCATCCCCGA CCTCGTCGCC GAAATGGACC TCTCGATGCC GACCGGCGAA CCGGGCGTCG TCGGCGAACC GCTGAATCTC GATCCGACCG CGCTCGATAC CAGCGGTGAC GGGATTCTCG ACAACGAAAC GGTTGACATC AAGTACCGCG TGTTCCAGGA GGACAACGAA ACGAAGCTCC ACGCCGCCGT TACGTACGCC GAACACCACC CGGCACGAAT CGACACGACC GGTGACGGCC TGACCGACGC CGAACAGCTC TCGGATCGGA CGATCACCTA TACGGATTCG CGGTCCGATT CACTCGAGTT CCTCTCGGAA CTCGAGGATG CCGACGATAT CGACGACCTG GACGGACTCG AGGGCGACGT GTTGACGACC GACACCGTTC GCTCCGATCC GCTCGTCGAC GACAGCGACG GCGACGGTGT GACCGACGCC GAGGAGGTGC GTCTCGGGAC CGATCCCGAG TCGCGCGACA CCACGGGAGA CGGAATCTCA GACTCCGAGG GCCTGAACGG CGACTACGAT CCAACCCTCT TCGATATCGA GCCGCCGGAG ATCACGGTCA CGTACGCGAC GTTTAACGAT CCGGACGCGG ACGTCGAACT GAAAGATCCC GTCGACGTCG ACTGGTCGAA CGGAAAGGTC AACTTCAATG ATCCGGTCGA CGCCTCGGTG CGGGTCTCCG GAAGCTACGA GGTCGATTTC ACCGTCACCG ACTCGGCGGG CCTCGACGAG GCGCGGGTCG TTCGCGACGG TGACGTCGAA GAGACGGTAT CGCTGTCCGG CCAATGGGAT GACGCTGACG TCGAATTCGA TGTCGGGACG GTCGATACGT TCACCGACGC GTTCGCGGGC TCGCGAGTCA CGGTGCAGGC CGACGACCGT CACGGGATCG TCGACGGAGT CGGCACGACC GAAGCCGCTG CCGTCGAAGT CGGCGGCGTT TGGGCCGCGG CGTCAGACGA GCTCCGCGCG CAAGGAGTCT CCGATCCGCG ACTCGAGCAG GATCTTGGAA CCCTGCAGGG GATGACGACC GGTGCCGGCG AGTCGATCGA TTCGCTGCGA GCGCTGTACA ACGAACCGAT CGAGACCATC ACCGCGGTGC GGGAGATCCC CGGCGCCATC GCGAACTTCG ACGAGATCAT CGCGGCGATG CCCGATTCGA TCGAAGCCCA ACAGCAACGC AACAATCCCC ACGACCCCGA CGAGAGCCCT CGTCTCTACG AGTCCTTCCG ACAGGGATGG TACGAGGGCT ACATCGCCTG GTTCGTCATC GAAGCCGCGA TCCCCGCCGG GGAAGCCGGC AAGGCTCTCA AGAGCTCCGA TCGCGTCCGG AAGACCGTCG ACAAAATTAG CACGCCTCGG ATCCGCCAGG CGGCCCAGAT GGCGGGCCGA GCGGGTCATA CTGCAAAAAC GCCGGTCAGA TACGGCAGGC TTCAGTTCTC GCGCGGTCTC TCCACCGGAA TCGGCCTTAC GCAAAAAGCC GGTGAAAACG TACTCAGCAG AGTATCAACG GTCGGCCAGC AGTATCGGGT GGCGAAACTC CTCAATCGCC ACGATGTCGA CGGTGCTGCT ATTAACCGAC TCGATGCTGA CGGACAAGAA GCCATTGGCA AGGCAACAGC TCGGAACGGT GACGACGCAA GTCGGGTAAT GGCCGACGGT GGCCCTGATC CAGTTGCTCG TGCCTATCAA CTGGACCTCG ACGTGAACAA TGAGAATCTC GTATCAAATC TGTTTAGACA CAGTGAAGCG GTTGACTTTG AACGGGTTAT CGACAACCTC GAGGAGTTGA ATCGACCGAA CGCAGATATC GAAGGCGTGG ACGATCTTGC AAAGCGTCTG GCAGCAGGAG ATCAGAGTAA CGTCAAAGGT GCGGCATTCG AAGCTGAAGT TGCTGTAGTC CGCGGGAGTG ATAACGTAGA GGCGGTTGGG AAACCAAATC CCTACTCCCG TGGTGAAATT GATATCGAAA CGAGTGATGG ACGAGTCATC GAAACGAAAA GTGGAGACTA CAGCCAAGCT GCAGATGGTA GCGATAAATA CATCGAGCTA GAGAACCAGA TCGGTCACTA CCAACAGTAC ACAGAGGTAG AGGGCGGAAC GATCGAGGTT GCATTCCGAG AAGAACCTCA CGACGATATC AAAGATATGC TCAACGATAA CGATGCAGAG GTAGAAATTT ACAATGAGTG A
|
Protein sequence | MRFNKERVVA VFFAVLMVTS AIAMPALASG PSATAANERS ASVAAHPGNG PAPNAGPPGQ NGGPPGHDRG DANVTIPDLE ENTSVDAILN ATYRLEELEI ENETAAAVAV NDTVDAVNAL VGEYRRVRYA DSRAAFDRLA DAQRSLAVLK DEVDGDDEAI VDAISEELYA AGNASARLAV SDANAVVAAN EGEFRNPGQR QKAESALGNA VDGLERADRA VSDGVSGNGT GKSKAKKSDR PIGPTDRAKA LTHLENAWKH AERTLDTVEA NTEPSLSLSQ GRPFERNGTV RVSIRAVLSD VRPYAYDNAT VTVNGDADAD AVSFATDEAA GTDAIGSTLV DLGSDPENVT ITVTATAAHD ADRTVEATHD IRIAEEAVVR ERPDPDEYRN VEVVNESSGV SVAVGGDGLD DADVSITDET PTTDDPYRAG PMVRIENERP IDDATVEIPI DENALEADAN LSIVTWDPTS DEPWTPVETE IDRDAGVATA EVDHFSFFSV FRIEEWEDET SDTITLDGNE TDGEIGNGSG IETADFVFVN DESGSMSGSP THYAELAGKR FVGALTDSER AGRVGYASGA NLDQPLTTDH DAVNSSLERL SASGGTNTRA GLRVGLNHLE EEGWENRSAV MILLSDGKSG SDPLPVAEDA AEAGVEISTV GLGNNINENE LREIAAITGG DFYHVEREED LPDTFERVAE NQTGPGLQDT NGDGIPDLVA EMDLSMPTGE PGVVGEPLNL DPTALDTSGD GILDNETVDI KYRVFQEDNE TKLHAAVTYA EHHPARIDTT GDGLTDAEQL SDRTITYTDS RSDSLEFLSE LEDADDIDDL DGLEGDVLTT DTVRSDPLVD DSDGDGVTDA EEVRLGTDPE SRDTTGDGIS DSEGLNGDYD PTLFDIEPPE ITVTYATFND PDADVELKDP VDVDWSNGKV NFNDPVDASV RVSGSYEVDF TVTDSAGLDE ARVVRDGDVE ETVSLSGQWD DADVEFDVGT VDTFTDAFAG SRVTVQADDR HGIVDGVGTT EAAAVEVGGV WAAASDELRA QGVSDPRLEQ DLGTLQGMTT GAGESIDSLR ALYNEPIETI TAVREIPGAI ANFDEIIAAM PDSIEAQQQR NNPHDPDESP RLYESFRQGW YEGYIAWFVI EAAIPAGEAG KALKSSDRVR KTVDKISTPR IRQAAQMAGR AGHTAKTPVR YGRLQFSRGL STGIGLTQKA GENVLSRVST VGQQYRVAKL LNRHDVDGAA INRLDADGQE AIGKATARNG DDASRVMADG GPDPVARAYQ LDLDVNNENL VSNLFRHSEA VDFERVIDNL EELNRPNADI EGVDDLAKRL AAGDQSNVKG AAFEAEVAVV RGSDNVEAVG KPNPYSRGEI DIETSDGRVI ETKSGDYSQA ADGSDKYIEL ENQIGHYQQY TEVEGGTIEV AFREEPHDDI KDMLNDNDAE VEIYNE
|
| |