Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_2951 |
Symbol | |
ID | 9340755 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 3034881 |
End bp | 3036137 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | |
Product | von Willebrand factor type A |
Protein accession | YP_003721885 |
Protein GI | 298491708 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGTTA ACTTACAGCC TGTTCTCAAT GACGCTAAAT TAAACGCGTG TCAGCCAAGC AGTCAACGTC AGTTGGCGGT TTCTATTTCG GCTGTTGGCG AAACTCTGGA TCGCAGAGTG CCATTAAATT TATGCTTAAT TTTAGATCAT AGTGGTTCGA TGAATGGCCG AGCACTCGAA ACTGTGAAAA AAGCAGTTTC TTTGCTGGTT GATCAACTCA GTTCCGAAGA TCGGCTGAGT ATTGTCGTTT TTGACCACCG TGCTAAGATT TTAGTCCCTA ATCAGATAAT TTCTGATCGC AACCAGATCA AACAGCAAAT CAATCGCCTA ACAGCTGATG GAGGAACTGC TATTGATGAG GGTTTGCGGC TGGGGATTGA GGAGTTAGCC AAGGGTAAAA AGGACACTAT TTCCCAAGCA TTTTTACTCA CAGATGGGGA AAATGAACAT GGTGATAATA ATCGCTGCTT GAAATTCGCT CAGTTAGCAG CCAGCTATAA TTTAACCTTG AATACGTTGG GTTTTGGTGA CAATTGGAAT CAGGATATTT TGGAAAAAAT AGCCGATGCG GGGCTAGGGA ATCTGTCTCA TATTGAACAC CCTAATCAAG CAGTGGATAA GTTCAGTCGC TTGTTTAGCA GAATGCAAAC GGTGGGATTG ACTAACGCTT ATCTGCTATT TTCACTGTTA CCCAATGTCC GTTTAGCTGA ACTAAAACCT ATTGCCCAAG TGGCACCAGA TACAATTGAG TTACCAGTAC AACCAGAAGC TGATGGTCGG CTTGCAGTGC GGTTGGGAGA TTTAATGAAA GACGTAGAAC GGGTAATTTT GGCTAATATT TATTTGGGAC AGTTGCCAGA AGGTAAACAG GCGATCGCTA ATGTGCAAAT CCGCTACGAT GACCCCGCCC TGAACCAAAC CGGTTTATTT TCCCTGAAAA TACCAGTTTA TGCGAATGTA GAGCGCATTT ACCAACCAAG TTCTAATCCC CATGTGCAGC AGTCAATTTT GGCTTTAGCC AAATATCGCC AAACCCAGTT AGCCGAAGCG AAATTACAAC AGGGTGATCG GTCTGGGGCA GCGACAATGC TACAAACTGC AGCAAAAACA GCTTTGCAAA TGGGAGATGC TAGTGCAGCA AGGGTGTTGC AAACTTCGGC TACTCGCTTA CAGGCTGGGG ACGAGTTATC CGAAAGCGAT CGCAAAAAAA CCAGGATTGT CTCAAAAACT GTGTTGCAGG ATGCTCCTCC TAAATGA
|
Protein sequence | MKVNLQPVLN DAKLNACQPS SQRQLAVSIS AVGETLDRRV PLNLCLILDH SGSMNGRALE TVKKAVSLLV DQLSSEDRLS IVVFDHRAKI LVPNQIISDR NQIKQQINRL TADGGTAIDE GLRLGIEELA KGKKDTISQA FLLTDGENEH GDNNRCLKFA QLAASYNLTL NTLGFGDNWN QDILEKIADA GLGNLSHIEH PNQAVDKFSR LFSRMQTVGL TNAYLLFSLL PNVRLAELKP IAQVAPDTIE LPVQPEADGR LAVRLGDLMK DVERVILANI YLGQLPEGKQ AIANVQIRYD DPALNQTGLF SLKIPVYANV ERIYQPSSNP HVQQSILALA KYRQTQLAEA KLQQGDRSGA ATMLQTAAKT ALQMGDASAA RVLQTSATRL QAGDELSESD RKKTRIVSKT VLQDAPPK
|
| |