Gene Ava_0254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0254 
Symbol 
ID3682941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp325570 
End bp327417 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content45% 
IMG OID637715582 
Productvon Willebrand factor, type A 
Protein accessionYP_320775 
Protein GI75906479 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000406076 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAAAA CAAGTTATGA ATTCGACCAA CCTATTTTAC CCGCCGGATT TTCATTAAAG 
GCTAATATCC TACTACGTTT TCGTGCGGAA ATACCCGAAT CTCCCCGGCG CAACCTTAAC
CTTTCCCTTG TAATTGACCG CTCAGGTTCT ATGGCAGGTG CAGCTTTACA TCATGCCCTC
AAGGCGGCTG AATCTGTGGT AGATCAACTT GAGCCAAAGG ATATTCTCTC AGTGGTCGTT
TACGATGATG CGGTAGATAC GGTTGTTTCA CCCCAACCTG TAACTGACAA ACCTGCGCTC
AAAAAGTCCA TACGTCAGGT GAGGGCAGGT GGTATTACTA ACTTATCGGG AGGATGGCTC
AAGGGGTGCG AATATGTCAA GCATCAACTC GATCCGCAAA AAATAAATCG TGTGCTACTG
CTGACTGATG GTCATGCCAA TATGGGTATT CAAGACCCAA AGATACTCAC AGCCACATCA
GCCCAAAAAG CTGAAGAAGG GATTACTACA ACTACTTTGG GGTTTGCTCA AGGTTTCAAT
GAGGATCTAC TAATTGGGAT GGCGAGAGCT GCTAATGGCA ACTTCTACTT CATTCAAAGC
ATTGATGAAG CAGCAGAAGT TTTTAGCATT GAACTAGATA GTCTTAGAGC TGTAGTAGGT
CAAAACTTGA AAGTAACACT AGAGTTAGCT GATGGTATCA CTCTGGTTGA TACTTTAAGT
CTTGCTAAAG TTAGCCAAAA TGAAGCTGGT CAACCTGTAA TTACCTTGGG GGAGCTTTAC
GAGGGAGAAG ATAAGCTTCT GGGTTTGAGT TTGATGATAT CCTCTGCTCA AGTAGGTAAT
TTACCAGTGA TGAAGCTGCA TTACAGTGCT GATGTAGTGC AGAATGACGT TATTCAAAGG
GTTTCCGGAA CAACAGATGT CATCGCCAAA GTTGGCACAG TTGAGGAATC AGCGTTAGCC
TCTTCCAGTC ATATTATCCT CGACCTCAGC CGCCTGACGA TCGCCAAAGC TAAAGAAACA
GCCCTCGAAT TAGCAGAACA TGGTCAGCAT CAAGCAGCTG AAAAAACCCT CCGTGATTTG
GTGCAGTATC TCCGAGACCA AGGCTTAAAT GAGAATTTTG AAATTGCAGA AGAGATTGAT
CAGCTGGAGT ATTTCGCAGG TCGAATTGCA CAACAAGCTC TAGGTAACGC CGGGCGGAAA
GAACTACGTG ATCAAAGTTA TCAAACAATG ACGCGCAATC GCGGTGATCT GGTGGGGCGC
GGTGTCACTG CTGGCGATGA AGTACACGCA ATGCCGGTTG TCAATGAAAT CGGTACTGGG
GTGGAACTTG CTTGCGTCCG TGAAGGCGGT AAACTACGGA TTAAAGTTAT ATCTGATGGT
TACGACCAAA CTAAAAATGT TCAATTTCCC CGTTCCATTC GTGCTGAAGG AGCGCGGTAC
ATTGTTGAAG GGCTAGAATT GTCAAGTAAT GGCTCTTTCT ACCGTGTAGT CGGCAAAGTC
AGTCGTTTTG CCAAACCTGG TGAAACAGAC ATTTTTGTTG CTCCTAGACA ATCGAGATCA
ACTAACACAA GTAAAGCCTC CAAAGCTCCA GCTACTGCGG CCGATCTTCC CACCACTGAC
ACCATAGATC ATGGTGTTCT CATTCAATGC GTCAAAGATG GTAGCAAGTT ACGCGCTAGG
GTAGTCTCCG ACGGATATGA ACCGGATTGG AATATGCGTT TTCCCCGTTC CATTCGTGAG
GAAGGAATGC TTTATGTGGT TGAGGAAGTG AAGACAGCAC CTGATGGCAA GTCTTATATT
GCTTCTGGCG AGATTAAGCG ATTTTTACAA CCGAATATTA CTAACTAA
 
Protein sequence
MVKTSYEFDQ PILPAGFSLK ANILLRFRAE IPESPRRNLN LSLVIDRSGS MAGAALHHAL 
KAAESVVDQL EPKDILSVVV YDDAVDTVVS PQPVTDKPAL KKSIRQVRAG GITNLSGGWL
KGCEYVKHQL DPQKINRVLL LTDGHANMGI QDPKILTATS AQKAEEGITT TTLGFAQGFN
EDLLIGMARA ANGNFYFIQS IDEAAEVFSI ELDSLRAVVG QNLKVTLELA DGITLVDTLS
LAKVSQNEAG QPVITLGELY EGEDKLLGLS LMISSAQVGN LPVMKLHYSA DVVQNDVIQR
VSGTTDVIAK VGTVEESALA SSSHIILDLS RLTIAKAKET ALELAEHGQH QAAEKTLRDL
VQYLRDQGLN ENFEIAEEID QLEYFAGRIA QQALGNAGRK ELRDQSYQTM TRNRGDLVGR
GVTAGDEVHA MPVVNEIGTG VELACVREGG KLRIKVISDG YDQTKNVQFP RSIRAEGARY
IVEGLELSSN GSFYRVVGKV SRFAKPGETD IFVAPRQSRS TNTSKASKAP ATAADLPTTD
TIDHGVLIQC VKDGSKLRAR VVSDGYEPDW NMRFPRSIRE EGMLYVVEEV KTAPDGKSYI
ASGEIKRFLQ PNITN