Gene Huta_1604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1604 
Symbol 
ID8383883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1580474 
End bp1583776 
Gene Length3303 bp 
Protein Length1100 aa 
Translation table11 
GC content54% 
IMG OID644972665 
Productvon Willebrand factor type A 
Protein accessionYP_003130511 
Protein GI257052678 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGTCT CGCCTGTACT GGCAAGCGAC CCGGTTGCAA CAAAGCTATC GGAGCATCAG 
TCATTCGATT CACTTTCTGA CTCAGTGAGG GACGCTGACG GTGCGTTCAC AGGGACGGAA
TACACTTTAA GAGGCGAAAA GTACGGAAAA AGTAACGGGA ACGGAAACGG CGGGTCTCCA
GGTAACGGTG GGCCACCAGG GAAGTCTGGG AACGACTTCA CCGGGCCGGA CAGTAATGTA
ACGATCCCAT CATATATCGA GAATGCGACT GTCCAGAATC TCGAGATGCT GTCTTCTTTG
GAACAACGGG CCTGGGTTCT GGCACAACTC GAAGATGTTG ACCCGACGAA CCGAAAGGAA
GATCGACGAT TGACGGACGC ATTCAAGTCG ATAAACGAGT CAGTGCGTTC CTATGTCGAT
TGGTCCAGAG TCGAAGCGTC CGACCTGTTC GGTTCAGATC GTTCGGCACT GCAAGCACTC
CGGCACTTCG GGGCCGATTC GACCGTGTCC AACGCCACTA CTGCACTCGT CTTGTCCGAT
CGGTCACTGG CAACACAGTC GATCGCAGAC GCTGAGCACG TCTTCGAGGA GTTTGAGGAC
GAAGTCGAGA CACCGGGACA GCGACGGAAG GCGGAACGAC AGATCGAAAA CGCAAAGCGT
GCGCTGGATC GCGGAGATGA GCGACGTTCC GACGAGAAAC ACGGACACAA TCGGGATCGA
CAGGCGATCA AGCACTACGA ACAGGCCTGG AAACACGCAC AAAAGGCGAT CGAAGCCGTC
GATAGCGAGG TCGGTCTGTC GTTGTCCGTT TCGACCGGAC AGCACGAACC CGGAAACGAG
ACGATCACGT ATCCGGTCAG TGGGACGATC TCGGCACCGT CCGCAACGGT CGAATCTGTC
GAGATTTACG TCGATGGCGA GCGCCACAAG ACCGTCAATG TCTCGACCTC GATGATGCCT
GGCATTCCCG AGCGCTTCGA GACGGAACTC GAACTGGCGA CGACGGAGGC AACGGTCACC
GTGGTAGCGA GAGACGAATC CAGTGGTGAA GAAGTCTCCA AGACGATTAC ACTTGACGCA
CCGGGATTCG CCGACGAAGT GTATGATATC GAACTCACGG ATCCCGAAAG TGGCGCGGAA
ATATCTGTCA CCGGTGAGGG AATCGTCAAA AGCGATTTCG TGGTCGATCC CGTCCCGGCT
GAGGAAAATC GTTCGTTCTA TGCTGGTCCG TTCATCCACA TCCGGAACTT CTCCGATTTC
GAGAGCGCGA CTGTCGAGAT GCCACTTGAC GATGATGTCG ATCCGTCCGA CGGAAACCTC
TCGGTGTACA AGTGGGATCA ACACGACGAG AAGCCCTGGC ACGCAGTCGA GACTGACGTT
CACGTCGAGA ACGGGACGGC CGTCGCGACG GTGGATTCGT TCTCGTATTT TTCGGTGTTC
TGGGTGGACA ATTGGAACGA TGCCATCACG GACACGGTGA ACCTCGCTGA ACACCCCGAA
TACGTCGCAA ACGAAACCGA GGGATCGATA GAACCGATCG ATCTCGCGTT TGTCATCGAC
GAAAGCGGGA GTATGGGTGG CGCTCGTATC CAAGACGCCA AAGCCTCAGC CAAGCGCTTC
GTCGGCGGTC TCTACGAGGA CGATAGGGCC GCACTCGTCA GCTTCGCGGG AGGCGCGACA
CTCGGACAAT CACTGACGAC CGATCACGGA GCAGTAAACG CAAGTATCGA CCAGTTGAAT
GCTGGCGGCG GGACCAATAC CGGTGCTGGA CTCCAGAAAG CAGTTGACGA GCTCACCAGT
AATGGTGAGG GCGACACCCA GGAGATCATC CTGCTCGCGG ACGGCGGAAC CGGCCTGGGC
CCGGACCCAG TTACAATTGC TCAGACCGCA GATGAGCATC GGATTACAAT TAACACGATC
GGAATGGGGA CTGGAATCGA CGCCCAGGAA CTGACGAGTA TCGCCGATGC GACCGGCGGC
GAGTTCTATC AGGTCAGCGA TTCCTCGGAA CTTCCAGAGG TGTTCGACCG CGTTGAGCAA
AACCGGATTT CGCTGGTCGA TTCCGACGAA GACGGAATCT CGGACGCAGT CGAAGATATG
GAGTTGGGGA TGACTTTCGG TCGTCCCGGG ATGGTCGGGA GGCCCCCAGA ACTCGAGCCA
GACAATCCAC GTACGGCTGC TGAAGAATAT GTCGACGGCG ATGCTACCGA GTTCGAGCGG
GTGACCTACG AAGACGATGG CGACCCGATG GTCACCGCGA AGATGATCGA CGCGAGTGTC
CATCCCGGGA GTGGTGAGTC CGTGCGGAAG GAAGCGATTC GCCTCGGTGT GACGATTCCA
TCGCGGATCG ATGCCGAAAC GAAGGAAAAT GTGTTCCATC TTAGATGGGG TGAAAACAGG
AACCCGTACG GCTCGGAGGG CAAAGATGGA GACGTAGCAA CTGGGTATGC CGCGGTGCAT
CGCAAACACC ATGTTCCAGC CGAGGGTGGG TGCTTCATTG GTTGCTGGGT TCCAGGCCAG
TCAAAACATC CAGATTGGCT CCCAGATGAA GATGCGATCA AGGACGAGGA TCAGAAACAC
ATCCTTCTCG AAAATGTCGA AGTCCATCTT CATTATTCGC ATGACGTAGC CACCGAGGAT
ATTCCGACAG AGTACGAATT ACAATTCAAG CCCGGCGATT CGTCACGCGA CATTCATATG
TATAATAATG TTGGAGAGAT CAATAGAGAT GAAATCATAA CGCAGACTGA TATCGTAGTA
TCTGCTCCAG ATACGGGTGA AAACGATTAC CAATGGGAAG AACTCGGTAT TCTCGAAGCA
AACTTCGATA TGTCTCAGGA TAGTCCAATT TATTTCGAAG CAAGAGATGA AGACGGTACG
ATCACGATAG AATCCGATGA ACCATATCTC TACGAAAGTA CGGTGAAAAC TAGGTTCCAA
GAAACGTACG ATGAAGCAAT GAACACATTG GAAACGGGGC TAATTATGGC TGCAGGCGGT
GGCCCTCTAT CTTCAGTAAC AGCATCTGCG AATACGCTAA TCGTAAGTGG TAGCTATGGG
ACGGCAGTTG GTGCGTTTGT CCTTGAGGAG GGAACGGGTC AACTTGTTAA TGTCGCGGGT
GGAAAAAGTA AGTACAGCAT GTACCATTCT GCCATCTACA AAAACATGTC TTCGGAATTC
ACTTGGACTA TCTCAGATGG AGATCCTACT GGAGGTACCA TGGATGTCAC TGTTCAAATA
GCTGGTATAG GCCCCTCTGT CGTCCGTGTG AACGATCCAC TATCGGATGG ACAAGAAGAC
TGA
 
Protein sequence
MVVSPVLASD PVATKLSEHQ SFDSLSDSVR DADGAFTGTE YTLRGEKYGK SNGNGNGGSP 
GNGGPPGKSG NDFTGPDSNV TIPSYIENAT VQNLEMLSSL EQRAWVLAQL EDVDPTNRKE
DRRLTDAFKS INESVRSYVD WSRVEASDLF GSDRSALQAL RHFGADSTVS NATTALVLSD
RSLATQSIAD AEHVFEEFED EVETPGQRRK AERQIENAKR ALDRGDERRS DEKHGHNRDR
QAIKHYEQAW KHAQKAIEAV DSEVGLSLSV STGQHEPGNE TITYPVSGTI SAPSATVESV
EIYVDGERHK TVNVSTSMMP GIPERFETEL ELATTEATVT VVARDESSGE EVSKTITLDA
PGFADEVYDI ELTDPESGAE ISVTGEGIVK SDFVVDPVPA EENRSFYAGP FIHIRNFSDF
ESATVEMPLD DDVDPSDGNL SVYKWDQHDE KPWHAVETDV HVENGTAVAT VDSFSYFSVF
WVDNWNDAIT DTVNLAEHPE YVANETEGSI EPIDLAFVID ESGSMGGARI QDAKASAKRF
VGGLYEDDRA ALVSFAGGAT LGQSLTTDHG AVNASIDQLN AGGGTNTGAG LQKAVDELTS
NGEGDTQEII LLADGGTGLG PDPVTIAQTA DEHRITINTI GMGTGIDAQE LTSIADATGG
EFYQVSDSSE LPEVFDRVEQ NRISLVDSDE DGISDAVEDM ELGMTFGRPG MVGRPPELEP
DNPRTAAEEY VDGDATEFER VTYEDDGDPM VTAKMIDASV HPGSGESVRK EAIRLGVTIP
SRIDAETKEN VFHLRWGENR NPYGSEGKDG DVATGYAAVH RKHHVPAEGG CFIGCWVPGQ
SKHPDWLPDE DAIKDEDQKH ILLENVEVHL HYSHDVATED IPTEYELQFK PGDSSRDIHM
YNNVGEINRD EIITQTDIVV SAPDTGENDY QWEELGILEA NFDMSQDSPI YFEARDEDGT
ITIESDEPYL YESTVKTRFQ ETYDEAMNTL ETGLIMAAGG GPLSSVTASA NTLIVSGSYG
TAVGAFVLEE GTGQLVNVAG GKSKYSMYHS AIYKNMSSEF TWTISDGDPT GGTMDVTVQI
AGIGPSVVRV NDPLSDGQED