Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_2950 |
Symbol | |
ID | 9340754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 3033604 |
End bp | 3034884 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | |
Product | von Willebrand factor type A |
Protein accession | YP_003721884 |
Protein GI | 298491707 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTTC AATTACTATC GGCACTCAAC GATACTAATG TTGACGCTAC TCAATTGAGT AACCAACGTC AACTGGCAAT TTCCATTTCT TCCATTGCTG ATGAGCTTGA CCCAAGTTTA CCGTTGAATT TATGCCTGAT TCTGGATAAA AGCGGTTCTA TGCACGGCGA ACCCATTAAC ACCGTAATTC AGGCTGTAGA ACAATTATTA GCTCAACTCC AGCCAGGCGA TCACATCTCA ATTGTCGCGT TTGCAGGTAC TTCTGAGGTC ATTATCCCTA ACCAAATCGT CCAAGATGCT GAGAGCATCA AATGCCAGTT GCACAAAAGA CTCAAAGCTG GTGGTGGCAC AATCATTGCC GAAGGTTTAT CTTTGGGAAT TACTGAATTA CTCAAAGGGA CAAAAGGCGC TGTTTCCCAA GCATTTTTGT TGACAGATGG ACATGGTGAC AGGGGGTTAA AAATTTGGAA GTGGGAGATG GGCCCCAATG ACAAGAAACG TTGTTTGGAA CTAGCACAAA AAGCCACTAG AGTAAGCCTA ACGCTCAACA CCTTCGGTTT TGGCAATGAC TGGAACCAGG ATTTGCTGGA AAAAATTGCT GATGCTGGTG GTGGTACTCT GGCTTATATT GAGCGTCCAC AACAAGCCGT AGATCAATTT AGTCGCTTGC TTAAGCGAAT TCAGTCTGTG GGCTTAACCA ATGCCCACTT ATTGCTGTCT CTAGTCCCTA GTGTGCGTTT AGCAGAACTA AAACCCATTG CCCAAGTTGC CCCAGAAACC ATTGAGTTAC CAGTGGAAAC AGAACCCAAT GGTAGCTTAA TTGTGCGTTT GGGAGACTTG ATGAAAGATG TAGAACGGGT AGTTTTAGCG AATATTTATT TGGGACAGTT GCCAGAAGGG GAACAAGTAA TTGGACATAT CCAAATACGC TATGATGACC CAGCTATTAA CCAAGAAGGT TTACTTTCCC CACTCATACC AGTTTATGCC AATTTCACTA AAACTTACCA ACCACTACTT GATTCACAAG TGATCAAATC AATTTTGGTA TTGGCAAAAT ATCGCCAAAC TCAAGTAGCA GAAGCAAAAC TGGAACAGGG TGATCGCACT GGTGCTATCA CAATGCTACA AACAGCCGCT AAGACTGCTT TACAAATCGG TGATATTGGT GCAGCAACAG TGCTGCAATC TTCCGCTACT CGTTTGCAAG CCGGTGAAGA ACTCTCTGAA GCAGACCGCA AAAAAACCAG GATTGTCTCG AAAACTATTG TGAGGGAGTG A
|
Protein sequence | MKVQLLSALN DTNVDATQLS NQRQLAISIS SIADELDPSL PLNLCLILDK SGSMHGEPIN TVIQAVEQLL AQLQPGDHIS IVAFAGTSEV IIPNQIVQDA ESIKCQLHKR LKAGGGTIIA EGLSLGITEL LKGTKGAVSQ AFLLTDGHGD RGLKIWKWEM GPNDKKRCLE LAQKATRVSL TLNTFGFGND WNQDLLEKIA DAGGGTLAYI ERPQQAVDQF SRLLKRIQSV GLTNAHLLLS LVPSVRLAEL KPIAQVAPET IELPVETEPN GSLIVRLGDL MKDVERVVLA NIYLGQLPEG EQVIGHIQIR YDDPAINQEG LLSPLIPVYA NFTKTYQPLL DSQVIKSILV LAKYRQTQVA EAKLEQGDRT GAITMLQTAA KTALQIGDIG AATVLQSSAT RLQAGEELSE ADRKKTRIVS KTIVRE
|
| |