Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_0254 |
Symbol | |
ID | 3682941 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 325570 |
End bp | 327417 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637715582 |
Product | von Willebrand factor, type A |
Protein accession | YP_320775 |
Protein GI | 75906479 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000000406076 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAAAAA CAAGTTATGA ATTCGACCAA CCTATTTTAC CCGCCGGATT TTCATTAAAG GCTAATATCC TACTACGTTT TCGTGCGGAA ATACCCGAAT CTCCCCGGCG CAACCTTAAC CTTTCCCTTG TAATTGACCG CTCAGGTTCT ATGGCAGGTG CAGCTTTACA TCATGCCCTC AAGGCGGCTG AATCTGTGGT AGATCAACTT GAGCCAAAGG ATATTCTCTC AGTGGTCGTT TACGATGATG CGGTAGATAC GGTTGTTTCA CCCCAACCTG TAACTGACAA ACCTGCGCTC AAAAAGTCCA TACGTCAGGT GAGGGCAGGT GGTATTACTA ACTTATCGGG AGGATGGCTC AAGGGGTGCG AATATGTCAA GCATCAACTC GATCCGCAAA AAATAAATCG TGTGCTACTG CTGACTGATG GTCATGCCAA TATGGGTATT CAAGACCCAA AGATACTCAC AGCCACATCA GCCCAAAAAG CTGAAGAAGG GATTACTACA ACTACTTTGG GGTTTGCTCA AGGTTTCAAT GAGGATCTAC TAATTGGGAT GGCGAGAGCT GCTAATGGCA ACTTCTACTT CATTCAAAGC ATTGATGAAG CAGCAGAAGT TTTTAGCATT GAACTAGATA GTCTTAGAGC TGTAGTAGGT CAAAACTTGA AAGTAACACT AGAGTTAGCT GATGGTATCA CTCTGGTTGA TACTTTAAGT CTTGCTAAAG TTAGCCAAAA TGAAGCTGGT CAACCTGTAA TTACCTTGGG GGAGCTTTAC GAGGGAGAAG ATAAGCTTCT GGGTTTGAGT TTGATGATAT CCTCTGCTCA AGTAGGTAAT TTACCAGTGA TGAAGCTGCA TTACAGTGCT GATGTAGTGC AGAATGACGT TATTCAAAGG GTTTCCGGAA CAACAGATGT CATCGCCAAA GTTGGCACAG TTGAGGAATC AGCGTTAGCC TCTTCCAGTC ATATTATCCT CGACCTCAGC CGCCTGACGA TCGCCAAAGC TAAAGAAACA GCCCTCGAAT TAGCAGAACA TGGTCAGCAT CAAGCAGCTG AAAAAACCCT CCGTGATTTG GTGCAGTATC TCCGAGACCA AGGCTTAAAT GAGAATTTTG AAATTGCAGA AGAGATTGAT CAGCTGGAGT ATTTCGCAGG TCGAATTGCA CAACAAGCTC TAGGTAACGC CGGGCGGAAA GAACTACGTG ATCAAAGTTA TCAAACAATG ACGCGCAATC GCGGTGATCT GGTGGGGCGC GGTGTCACTG CTGGCGATGA AGTACACGCA ATGCCGGTTG TCAATGAAAT CGGTACTGGG GTGGAACTTG CTTGCGTCCG TGAAGGCGGT AAACTACGGA TTAAAGTTAT ATCTGATGGT TACGACCAAA CTAAAAATGT TCAATTTCCC CGTTCCATTC GTGCTGAAGG AGCGCGGTAC ATTGTTGAAG GGCTAGAATT GTCAAGTAAT GGCTCTTTCT ACCGTGTAGT CGGCAAAGTC AGTCGTTTTG CCAAACCTGG TGAAACAGAC ATTTTTGTTG CTCCTAGACA ATCGAGATCA ACTAACACAA GTAAAGCCTC CAAAGCTCCA GCTACTGCGG CCGATCTTCC CACCACTGAC ACCATAGATC ATGGTGTTCT CATTCAATGC GTCAAAGATG GTAGCAAGTT ACGCGCTAGG GTAGTCTCCG ACGGATATGA ACCGGATTGG AATATGCGTT TTCCCCGTTC CATTCGTGAG GAAGGAATGC TTTATGTGGT TGAGGAAGTG AAGACAGCAC CTGATGGCAA GTCTTATATT GCTTCTGGCG AGATTAAGCG ATTTTTACAA CCGAATATTA CTAACTAA
|
Protein sequence | MVKTSYEFDQ PILPAGFSLK ANILLRFRAE IPESPRRNLN LSLVIDRSGS MAGAALHHAL KAAESVVDQL EPKDILSVVV YDDAVDTVVS PQPVTDKPAL KKSIRQVRAG GITNLSGGWL KGCEYVKHQL DPQKINRVLL LTDGHANMGI QDPKILTATS AQKAEEGITT TTLGFAQGFN EDLLIGMARA ANGNFYFIQS IDEAAEVFSI ELDSLRAVVG QNLKVTLELA DGITLVDTLS LAKVSQNEAG QPVITLGELY EGEDKLLGLS LMISSAQVGN LPVMKLHYSA DVVQNDVIQR VSGTTDVIAK VGTVEESALA SSSHIILDLS RLTIAKAKET ALELAEHGQH QAAEKTLRDL VQYLRDQGLN ENFEIAEEID QLEYFAGRIA QQALGNAGRK ELRDQSYQTM TRNRGDLVGR GVTAGDEVHA MPVVNEIGTG VELACVREGG KLRIKVISDG YDQTKNVQFP RSIRAEGARY IVEGLELSSN GSFYRVVGKV SRFAKPGETD IFVAPRQSRS TNTSKASKAP ATAADLPTTD TIDHGVLIQC VKDGSKLRAR VVSDGYEPDW NMRFPRSIRE EGMLYVVEEV KTAPDGKSYI ASGEIKRFLQ PNITN
|
| |