Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2141 |
Symbol | |
ID | 5734043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2695539 |
End bp | 2697371 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279282 |
Product | von Willebrand factor type A |
Protein accession | YP_001544909 |
Protein GI | 159898662 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGTCAA TTCGCAAATC GAAGTTCGAA GAGCAGAGTA GAAACCTACA GATACAAAAG GGAACCAAAG AGGAGCAAGC AACGTTTGAT CGGTGTGGTC CACAATGCCT TATCAATTTA ACCAAAAAAG AGGATGTTAT GCGACTAAAA CGAAGCAGCA TTGTGCTGAT CGCATTGATC ATCAGCGCTT GTGGGGGAGA AGCGAGCCTA CCAACGATCA ATCCGCAACC ACAACGGCCA GCGCCGCAGC CACGGCCAAC CAGCGCCGCC GACGATCAAT CAGCGCAATG GCCGACTGCC GAAGCAACGA GCGTAGCGCC AGCGCCACAA CCAATGCCGA CTCAAGCAGC AGATGCCGGT CAGCCAGTGC CAAATCCTGC TGCTGGTAAA CCCTTGGTTG ATACGTGGGA GCTGCCAACC CAACCGATCG ATCCAAATCC AAATTACGCC TACGAACAAG ATCAAGAAAT CTTTGATTCG ATGTATTTTA AAAATTATGG CACAAATCCA TTCGTGCGGA CAGAAACCGA CCCCTTATCA ACCTTTGCGA TGGATATTGA CAGTGCTTCG TACAGCCTGA TGCGCAGTAG CATCAACCAA GGCCTCTTAC CGCCAGCCGA TTCAGTGCGA GTCGAAGAAT ATCTGAACGC CTTTGATTAC GAGTATCCCC AGCCCGAAGA TGGCGATTTT GCGATCTACA GCGAAGTAGC GCCATCGCCA TTTGGCGGCC CCAACTACGA GCTAGTGCAA ATTGGCATTC AAGCTCGAAG TATCGAAGTA GCTGATCGCA AGCCTGCCGC CCTTACCTTT GTGATCGATA CATCGGGATC GATGGCCCAA GATAATCGCT TGGAAATGGT CAAAAATGCC CTGATTTATT TGGCTGGGCA ACTTGAGCCT GACGATAGTT TGGCAATTGT GGCCTTTAAC GATGGAATGC GAGTGGTGTT AAACCCAACT TCGGGCGAAA ATCAGATGGA TATCATCACC GCAATCAATT CACTTGAGCC AGCTGGCAGC ACCAACGCCG AAGCTGGACT TTATAAAGGC TTTGAATTAG CCTGGCAAGC CTTCAAACCG GAAGGCATCA ACCGGATTTT GCTCTGCTCA GATGGCGTGG CTAACAGCGG CATGACCGAA CCAAGTCAAC TGCTCGCGAC CTTCCAACAA TATCTTGATG CAGGCGTTCA GCTTTCGACC TATGGCGTGG GTATGGGCAA CTACAACGAC ATTTTGTTAG AGCAACTGGC CGACAAAGGC GATGGCAATT ATGCCTATTT CGATTCAGCC GATGAAGCCC AACGCCTGTT TGGCGAGCAA TTGACTGGTT CGCTGCAAAC CATCGGGCGC GAAGCCAAAA TCCAAGTTAA TTTTGACCCA AATGTAGTGA AACGGTATCG CTTGATTGGC TATGAAAATC GTGCGGTAGC CGATAGCGAC TTCCGCAACG ACAGTGTTGA TGGTGGCGAA GTTGGCGCGG GCCATAGTGT GACAGCGCTG TATGAAATCA AGCGCCATCC TGATGCCCAA GGCCCAATCG CCCAAGTTAA TATTCGCTAT ATCAGCATGG ATACTAACGC ACCAGTTGAA GAAAGCCTGA ATATTTCAAC GGCGCAAATT CATAGCAGTT TTGATCGCGC CAGTGCGCGA ATGCACCTAG CAACGAGCGT CGCCGAATAC GCCGAACTAT TACGCCATTC ACGCTGGAAT AACGGCACTG ATATCCTTGA TGTGCTTGAT CTGGCTGAAG AAGCGGCGCT AGATTTACCC AATAATCAAA GTGCCGTTGA ATTTGTTACC CTGCTACGGC GGGCTGAGCA GATGCACCAA TAA
|
Protein sequence | MRSIRKSKFE EQSRNLQIQK GTKEEQATFD RCGPQCLINL TKKEDVMRLK RSSIVLIALI ISACGGEASL PTINPQPQRP APQPRPTSAA DDQSAQWPTA EATSVAPAPQ PMPTQAADAG QPVPNPAAGK PLVDTWELPT QPIDPNPNYA YEQDQEIFDS MYFKNYGTNP FVRTETDPLS TFAMDIDSAS YSLMRSSINQ GLLPPADSVR VEEYLNAFDY EYPQPEDGDF AIYSEVAPSP FGGPNYELVQ IGIQARSIEV ADRKPAALTF VIDTSGSMAQ DNRLEMVKNA LIYLAGQLEP DDSLAIVAFN DGMRVVLNPT SGENQMDIIT AINSLEPAGS TNAEAGLYKG FELAWQAFKP EGINRILLCS DGVANSGMTE PSQLLATFQQ YLDAGVQLST YGVGMGNYND ILLEQLADKG DGNYAYFDSA DEAQRLFGEQ LTGSLQTIGR EAKIQVNFDP NVVKRYRLIG YENRAVADSD FRNDSVDGGE VGAGHSVTAL YEIKRHPDAQ GPIAQVNIRY ISMDTNAPVE ESLNISTAQI HSSFDRASAR MHLATSVAEY AELLRHSRWN NGTDILDVLD LAEEAALDLP NNQSAVEFVT LLRRAEQMHQ
|
| |