Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1604 |
Symbol | |
ID | 8383883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 1580474 |
End bp | 1583776 |
Gene Length | 3303 bp |
Protein Length | 1100 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644972665 |
Product | von Willebrand factor type A |
Protein accession | YP_003130511 |
Protein GI | 257052678 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGTCT CGCCTGTACT GGCAAGCGAC CCGGTTGCAA CAAAGCTATC GGAGCATCAG TCATTCGATT CACTTTCTGA CTCAGTGAGG GACGCTGACG GTGCGTTCAC AGGGACGGAA TACACTTTAA GAGGCGAAAA GTACGGAAAA AGTAACGGGA ACGGAAACGG CGGGTCTCCA GGTAACGGTG GGCCACCAGG GAAGTCTGGG AACGACTTCA CCGGGCCGGA CAGTAATGTA ACGATCCCAT CATATATCGA GAATGCGACT GTCCAGAATC TCGAGATGCT GTCTTCTTTG GAACAACGGG CCTGGGTTCT GGCACAACTC GAAGATGTTG ACCCGACGAA CCGAAAGGAA GATCGACGAT TGACGGACGC ATTCAAGTCG ATAAACGAGT CAGTGCGTTC CTATGTCGAT TGGTCCAGAG TCGAAGCGTC CGACCTGTTC GGTTCAGATC GTTCGGCACT GCAAGCACTC CGGCACTTCG GGGCCGATTC GACCGTGTCC AACGCCACTA CTGCACTCGT CTTGTCCGAT CGGTCACTGG CAACACAGTC GATCGCAGAC GCTGAGCACG TCTTCGAGGA GTTTGAGGAC GAAGTCGAGA CACCGGGACA GCGACGGAAG GCGGAACGAC AGATCGAAAA CGCAAAGCGT GCGCTGGATC GCGGAGATGA GCGACGTTCC GACGAGAAAC ACGGACACAA TCGGGATCGA CAGGCGATCA AGCACTACGA ACAGGCCTGG AAACACGCAC AAAAGGCGAT CGAAGCCGTC GATAGCGAGG TCGGTCTGTC GTTGTCCGTT TCGACCGGAC AGCACGAACC CGGAAACGAG ACGATCACGT ATCCGGTCAG TGGGACGATC TCGGCACCGT CCGCAACGGT CGAATCTGTC GAGATTTACG TCGATGGCGA GCGCCACAAG ACCGTCAATG TCTCGACCTC GATGATGCCT GGCATTCCCG AGCGCTTCGA GACGGAACTC GAACTGGCGA CGACGGAGGC AACGGTCACC GTGGTAGCGA GAGACGAATC CAGTGGTGAA GAAGTCTCCA AGACGATTAC ACTTGACGCA CCGGGATTCG CCGACGAAGT GTATGATATC GAACTCACGG ATCCCGAAAG TGGCGCGGAA ATATCTGTCA CCGGTGAGGG AATCGTCAAA AGCGATTTCG TGGTCGATCC CGTCCCGGCT GAGGAAAATC GTTCGTTCTA TGCTGGTCCG TTCATCCACA TCCGGAACTT CTCCGATTTC GAGAGCGCGA CTGTCGAGAT GCCACTTGAC GATGATGTCG ATCCGTCCGA CGGAAACCTC TCGGTGTACA AGTGGGATCA ACACGACGAG AAGCCCTGGC ACGCAGTCGA GACTGACGTT CACGTCGAGA ACGGGACGGC CGTCGCGACG GTGGATTCGT TCTCGTATTT TTCGGTGTTC TGGGTGGACA ATTGGAACGA TGCCATCACG GACACGGTGA ACCTCGCTGA ACACCCCGAA TACGTCGCAA ACGAAACCGA GGGATCGATA GAACCGATCG ATCTCGCGTT TGTCATCGAC GAAAGCGGGA GTATGGGTGG CGCTCGTATC CAAGACGCCA AAGCCTCAGC CAAGCGCTTC GTCGGCGGTC TCTACGAGGA CGATAGGGCC GCACTCGTCA GCTTCGCGGG AGGCGCGACA CTCGGACAAT CACTGACGAC CGATCACGGA GCAGTAAACG CAAGTATCGA CCAGTTGAAT GCTGGCGGCG GGACCAATAC CGGTGCTGGA CTCCAGAAAG CAGTTGACGA GCTCACCAGT AATGGTGAGG GCGACACCCA GGAGATCATC CTGCTCGCGG ACGGCGGAAC CGGCCTGGGC CCGGACCCAG TTACAATTGC TCAGACCGCA GATGAGCATC GGATTACAAT TAACACGATC GGAATGGGGA CTGGAATCGA CGCCCAGGAA CTGACGAGTA TCGCCGATGC GACCGGCGGC GAGTTCTATC AGGTCAGCGA TTCCTCGGAA CTTCCAGAGG TGTTCGACCG CGTTGAGCAA AACCGGATTT CGCTGGTCGA TTCCGACGAA GACGGAATCT CGGACGCAGT CGAAGATATG GAGTTGGGGA TGACTTTCGG TCGTCCCGGG ATGGTCGGGA GGCCCCCAGA ACTCGAGCCA GACAATCCAC GTACGGCTGC TGAAGAATAT GTCGACGGCG ATGCTACCGA GTTCGAGCGG GTGACCTACG AAGACGATGG CGACCCGATG GTCACCGCGA AGATGATCGA CGCGAGTGTC CATCCCGGGA GTGGTGAGTC CGTGCGGAAG GAAGCGATTC GCCTCGGTGT GACGATTCCA TCGCGGATCG ATGCCGAAAC GAAGGAAAAT GTGTTCCATC TTAGATGGGG TGAAAACAGG AACCCGTACG GCTCGGAGGG CAAAGATGGA GACGTAGCAA CTGGGTATGC CGCGGTGCAT CGCAAACACC ATGTTCCAGC CGAGGGTGGG TGCTTCATTG GTTGCTGGGT TCCAGGCCAG TCAAAACATC CAGATTGGCT CCCAGATGAA GATGCGATCA AGGACGAGGA TCAGAAACAC ATCCTTCTCG AAAATGTCGA AGTCCATCTT CATTATTCGC ATGACGTAGC CACCGAGGAT ATTCCGACAG AGTACGAATT ACAATTCAAG CCCGGCGATT CGTCACGCGA CATTCATATG TATAATAATG TTGGAGAGAT CAATAGAGAT GAAATCATAA CGCAGACTGA TATCGTAGTA TCTGCTCCAG ATACGGGTGA AAACGATTAC CAATGGGAAG AACTCGGTAT TCTCGAAGCA AACTTCGATA TGTCTCAGGA TAGTCCAATT TATTTCGAAG CAAGAGATGA AGACGGTACG ATCACGATAG AATCCGATGA ACCATATCTC TACGAAAGTA CGGTGAAAAC TAGGTTCCAA GAAACGTACG ATGAAGCAAT GAACACATTG GAAACGGGGC TAATTATGGC TGCAGGCGGT GGCCCTCTAT CTTCAGTAAC AGCATCTGCG AATACGCTAA TCGTAAGTGG TAGCTATGGG ACGGCAGTTG GTGCGTTTGT CCTTGAGGAG GGAACGGGTC AACTTGTTAA TGTCGCGGGT GGAAAAAGTA AGTACAGCAT GTACCATTCT GCCATCTACA AAAACATGTC TTCGGAATTC ACTTGGACTA TCTCAGATGG AGATCCTACT GGAGGTACCA TGGATGTCAC TGTTCAAATA GCTGGTATAG GCCCCTCTGT CGTCCGTGTG AACGATCCAC TATCGGATGG ACAAGAAGAC TGA
|
Protein sequence | MVVSPVLASD PVATKLSEHQ SFDSLSDSVR DADGAFTGTE YTLRGEKYGK SNGNGNGGSP GNGGPPGKSG NDFTGPDSNV TIPSYIENAT VQNLEMLSSL EQRAWVLAQL EDVDPTNRKE DRRLTDAFKS INESVRSYVD WSRVEASDLF GSDRSALQAL RHFGADSTVS NATTALVLSD RSLATQSIAD AEHVFEEFED EVETPGQRRK AERQIENAKR ALDRGDERRS DEKHGHNRDR QAIKHYEQAW KHAQKAIEAV DSEVGLSLSV STGQHEPGNE TITYPVSGTI SAPSATVESV EIYVDGERHK TVNVSTSMMP GIPERFETEL ELATTEATVT VVARDESSGE EVSKTITLDA PGFADEVYDI ELTDPESGAE ISVTGEGIVK SDFVVDPVPA EENRSFYAGP FIHIRNFSDF ESATVEMPLD DDVDPSDGNL SVYKWDQHDE KPWHAVETDV HVENGTAVAT VDSFSYFSVF WVDNWNDAIT DTVNLAEHPE YVANETEGSI EPIDLAFVID ESGSMGGARI QDAKASAKRF VGGLYEDDRA ALVSFAGGAT LGQSLTTDHG AVNASIDQLN AGGGTNTGAG LQKAVDELTS NGEGDTQEII LLADGGTGLG PDPVTIAQTA DEHRITINTI GMGTGIDAQE LTSIADATGG EFYQVSDSSE LPEVFDRVEQ NRISLVDSDE DGISDAVEDM ELGMTFGRPG MVGRPPELEP DNPRTAAEEY VDGDATEFER VTYEDDGDPM VTAKMIDASV HPGSGESVRK EAIRLGVTIP SRIDAETKEN VFHLRWGENR NPYGSEGKDG DVATGYAAVH RKHHVPAEGG CFIGCWVPGQ SKHPDWLPDE DAIKDEDQKH ILLENVEVHL HYSHDVATED IPTEYELQFK PGDSSRDIHM YNNVGEINRD EIITQTDIVV SAPDTGENDY QWEELGILEA NFDMSQDSPI YFEARDEDGT ITIESDEPYL YESTVKTRFQ ETYDEAMNTL ETGLIMAAGG GPLSSVTASA NTLIVSGSYG TAVGAFVLEE GTGQLVNVAG GKSKYSMYHS AIYKNMSSEF TWTISDGDPT GGTMDVTVQI AGIGPSVVRV NDPLSDGQED
|
| |