Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0691 |
Symbol | |
ID | 5732592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 794783 |
End bp | 796522 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641277821 |
Product | von Willebrand factor type A |
Protein accession | YP_001543467 |
Protein GI | 159897220 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02226] N-terminal double-transmembrane domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTGC TTGTGCCATT GGGTTTAATT GGCTTAATCG CCTTGCCGAT CATCGTGGTG TTGCATATGG TGCGCCAGCG CCGCCAACGG CTGAGGATTC CGACGATTCG GCTGTGGGCG GCATTAACGC CACCACCTGA GCGCCAACAA CGCAAATTGC CCCTAACCTT GCTGTTGGCT TTGCATTTGC TGGTTGCTGC TTGTTTAGCC TTGAGTTTAG CCCAACCAGC TTGGATTTTT GGAGCCGCTG CACCACGCCA CCTCGTAATT ATTCTTGATA CCACCTCTAG TATGGCGGCC AATCGCTCAT TTACTCAAGC TCAACAACAA ACCGAGGATT TGATTAACGA TTTGGGGCGT GATGATAGCC TGGCGCTGGT TGAGCTGAAT CACGAAGCGC GTTTGCTTGG CTATGGCGGC TATGCTGAAC GCCAACAATT GCGCCAAATT GTGGCCGAGC TGGCTCCCGC TGGTAATAAT GCCAACTTGG CTCAATCGCT GAGCATTGCC AATGCCACCC TCGCCAACGA TCGCCAAAAC CAACTGATTG TACTAAGCGA TGGCGCATTG CCCGCCAACA GCACGCCGTT ATCGGTTGCC GCCGAATTAG AATGGCGGAT GTTGGGCGAA AGCACCGCCA ACAGTGGCAT TGTCAATTTT GCGAGCCGCC GTTTGCCCAA CCAGCGCAAC GCCCTGTATG CCCGTGTGAC CAATTTTAGC GATCTGCCTG CGGCCCGAAC CTTGACTCTT TTGGTTGATG GTGAAATTGA ATCGGAGCAA AATTTGGTCA TTCAGCCTGG TGGCAGCGAG GAGCGCACGT GGGAAGTAGC CAATGGCGAG TTGGCTGAAT TACAACTTAG CCCTAACGAT GGTTATGCGC TTGATGATCG GGCGGTGCTT GCGCTCAGTC GCTCTGGCTC ATTGCGAGTT CATTTGGCCA CATTAACTCC CTCGCCACTT GAACGAATGC TGCGCAGTTT GCCCAATATT GAGCTAAGCG TTGGCCCAAG CGTCAGCAAT CAACGGGTGG ATTTGACTGT GTTGAATGGA GTTTTACCGC AGCAATTGCC AACCAGCGCC TTGTTGATTG TCAATCCGCC GAGCGACCCA CGCTTGCCAA CCCAAGATAG TGTGCTAGGT GAGCAGGCTA GCAGCGCGGT CTTGGATGCC GATTTTGCTG GCATCGACCT TTCAAGTGTG CAATGGGGTG GTCGTCGCCC GATCAAGCGT GAAGATATTC CCGCAGGCTT GAGCAGTGTG ATCGAAACTG ATACGCAAGC GCCCTTGGTG CTGCGTGGCA CATGGCAAGA GCGAGCAACC ATCGTTTGGC TGTTTAATTT AGATAATGCG AACCTTAGCG CAAAATTGGC ATTTCCTTTG TTGACAGCGG CCAGCATCGC CAACTTAACG GGTGGATCAT TGCCTGAGCA ACTGGCGGCG GGCAGTTTTG CCCCCAATAC GCCGCTAACC CGCCCTGATA GCGAGGCCCA AGCGCTTGAT CAGCGCCTGA ATCAAGCAGG TTTGTATCGG GTCGTTGGGA GTAATCGTGG CGGGATTGCG GTCAACTTTG GCGATCCACA GGAGTCAAAT CTGCAACAAC AAACCCAGCC AACGATTAGC CAAAGCCCGC AACCTGAGGG TGATCGCTTG CCGCCCCAAG GTACGCCATT ATGGCCGATG CTGGTTGGTT TGGCCTTGGT CGGATTGATT TTTGAATGGT GGTATAGCTT TCGATCGTAG
|
Protein sequence | MNLLVPLGLI GLIALPIIVV LHMVRQRRQR LRIPTIRLWA ALTPPPERQQ RKLPLTLLLA LHLLVAACLA LSLAQPAWIF GAAAPRHLVI ILDTTSSMAA NRSFTQAQQQ TEDLINDLGR DDSLALVELN HEARLLGYGG YAERQQLRQI VAELAPAGNN ANLAQSLSIA NATLANDRQN QLIVLSDGAL PANSTPLSVA AELEWRMLGE STANSGIVNF ASRRLPNQRN ALYARVTNFS DLPAARTLTL LVDGEIESEQ NLVIQPGGSE ERTWEVANGE LAELQLSPND GYALDDRAVL ALSRSGSLRV HLATLTPSPL ERMLRSLPNI ELSVGPSVSN QRVDLTVLNG VLPQQLPTSA LLIVNPPSDP RLPTQDSVLG EQASSAVLDA DFAGIDLSSV QWGGRRPIKR EDIPAGLSSV IETDTQAPLV LRGTWQERAT IVWLFNLDNA NLSAKLAFPL LTAASIANLT GGSLPEQLAA GSFAPNTPLT RPDSEAQALD QRLNQAGLYR VVGSNRGGIA VNFGDPQESN LQQQTQPTIS QSPQPEGDRL PPQGTPLWPM LVGLALVGLI FEWWYSFRS
|
| |