Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_2351 |
Symbol | |
ID | 3683466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 2918993 |
End bp | 2921485 |
Gene Length | 2493 bp |
Protein Length | 830 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 637717696 |
Product | filamentous haemagglutinin-like protein |
Protein accession | YP_322864 |
Protein GI | 75908568 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3210] Large exoproteins involved in heme utilization or adhesion |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTAA CTTCTGTGTG CTTGGGTCTA ACTTGTGTGT TCTTAGCTGC TGGAATGTCT TTACCAGCCA CTGCTCAAGT GACTTCTGAT GATACCACAA ACACTACTGT CAACCCAAGC GGTAATAATT TTCAGATTAT TAACGGTACT GCCAGAGGAA ATAATTTATT TCACAGCTTT AGTAATTTCT CTGTACCCAA AGACTCTTCA GCAACATTTG ACTTAACTAA TACACCAAAT ATTACAAATA TATTTAGCCG AGTTACTGGT GGTAATGTTT CTAATATTGA TGGAATGATT GAAACTGTTA AGAGTAATAA TCCAGTAAGT TTGTTTTTAA TGAATCCGGC AGGGATTGTG TTTGCTCCAA ATGCCTCCTT GAATATTGGC GGGTCGTTTG TAGGAACAAC GGCAAATAGT ATTAAGTTTG CTGATGGGAC AGAGTTTAGT GCCAATAATT CCAGTGTGAC ACCATTACTG ACGATGAGTG TTCCAATTGG CTTGCAAATG GGAAGTAATC CAAACCCCAT CACAGTTCAA GGTACAGGTT ATGCCCTAAC ATCTGTCAGT ACCGTTGCCC CCATCACTCA AAGTCCCAGT ACCTCAGAAC TAAGGGTAAA GCCAGTAAAG ACCTTGGCAC TGGTAGGGGG CGACTTAAAT CTTACTGGGG CAACCTTGAA CGCGCCCGAA GGACGTGTCG AATTGGGAAG TTTAAGCGGA GTGGGAATGG TCAGTTTGAA CCCCATTTCT CAGGGTTATC AGTTAAGTTA CGAGGGCGGG CAAAGCTTTG CAGATATCCA ACTTACGCAA AAATCCTTGC TGACTGTAGG AGCATTATTA AGTGCAGGGG CATTAAATGC TGGTTCGGTA CAATTACAGG GAAGGCACAT TCAAATCAGT GATGGTTCAA TTATATTTTC CAAAAATCTG GGAAATGTCG CTGGAGGTGA AATTATTTTA CAAGCCTCGG ACAGTATCGA TATTATAGGT ACAACAGCCA ACGCCCAAAT TCGCAGTGGC ATCCGCTCTG AAGGTTTGAA TACTGGCACA GGTTCACCCA TCCATATCAT CACTCCCCAC TTAACCCTCT CACAAGGAGC AGGAGTTAAC AACAATGCTT TTGGATTTGC TGCCAGTGGT GGGATTCAGA TTGATGCACA GACAGTTGAG TTATCTGGTT TTTCTCCCAT CAATCCTACA GGAGTAACGT CCCTGACCAT CTCTTCCCGA ACTCCCAAGT CAGCAGGCAA TCTCTCCATT AACACTAATA GTCTACTGGT TTCTCAGGGA GCAGCCATCT CCTCAGTAGC ATTTGGGACT GGTTCTACAG GGCAAGTCAC CATTCGCAGT CAGAATACTA CTGTCACGGG AGACAATCCG GCAGGACTCT ATAGCAATAT TAGTGCAATC ACATATGGCA CAGGCGATGC CCAAACCTTG ACGTTAGATA CGAAACGATT ACAACTACTC GATGGGGGAG TAGTCGCAAC CACATCCTTT TTGATTGGTA AAGCTGGAAA TTTAAACATT AACGCGACTG AATCGATTCT GATTGACGGG CGCAGCCAGG CTAACAATAG CAGTATTAAC TCAGCCGTCC TGATCCCGCC AGCATTAATC AAACAGTTAT TCAGGTTGCC TAATATTTTG TCGGCGAATG GTGGAACTGT GAATGTAACC ACCCCTACCT TGACGTTAAG CAATGGCGGC GCAGTTAGTG TGACTAATCA GGGTGCAGGC GATGGGGGCC ATATAAACAT TACTGCCAAT ACGGTATTTT TAGATCGTCA AGGTAGCGTT CAAGCTCAAA CGTTGTCGGG CGAGGGCGGT AATATTACCT CACAAGTCAG TGAGCTGCTG TTACTACGTC ACAATAGCCT GATTAGTGCG ACATCCGGGA ATACAGGTAA TGGTGGCAAT ATTAGCATTA ATGCTCCTGT TATCGCTGGG CTAGAAAACA GTGACATTAT TGCTAATGCA GTGAAAGGAA GAGGTGGCAA TATTGATATC AGGACTCAAG GTATTATTGG TCTAGCATTC CGTAATACTC TCACCCCAAG AGTTGAGCAG ACCAACGACA TCACAGCCAG TTCCGAATTT AACATTAATG GCACAGTAAA AATTAATAAC GTTGGTGTTG ATCCCAATTC GGGTTTAGTT GCACTACCCG CAAATATTAT CGATCCATCC CAGCAAATAG CTAGTGGTTG TTCTGCCGAT ACTGGCAGTA GTTTCGTCGC CACAGGGCGG GGTGGAATAC CGCAAAATCC TAGTCAAGAA ATGAGGAGCG ATCGCACTTG GTCTGATACC CGTGATATAT CTGCATTCCA AAACAGACAG CAGACACAAG CACAAGCACC CACACAGCCA CGAATCCCCG TCCAAGCAAA TTCCTGGCAT CGTAACAACC AAGGCAAAAT TGAATTAATT GCGGATCAAT CCCTAACACA GGGACAAACA GCATTAACCT GCATGGCTAT ACCCCAGAAT TAA
|
Protein sequence | MKVTSVCLGL TCVFLAAGMS LPATAQVTSD DTTNTTVNPS GNNFQIINGT ARGNNLFHSF SNFSVPKDSS ATFDLTNTPN ITNIFSRVTG GNVSNIDGMI ETVKSNNPVS LFLMNPAGIV FAPNASLNIG GSFVGTTANS IKFADGTEFS ANNSSVTPLL TMSVPIGLQM GSNPNPITVQ GTGYALTSVS TVAPITQSPS TSELRVKPVK TLALVGGDLN LTGATLNAPE GRVELGSLSG VGMVSLNPIS QGYQLSYEGG QSFADIQLTQ KSLLTVGALL SAGALNAGSV QLQGRHIQIS DGSIIFSKNL GNVAGGEIIL QASDSIDIIG TTANAQIRSG IRSEGLNTGT GSPIHIITPH LTLSQGAGVN NNAFGFAASG GIQIDAQTVE LSGFSPINPT GVTSLTISSR TPKSAGNLSI NTNSLLVSQG AAISSVAFGT GSTGQVTIRS QNTTVTGDNP AGLYSNISAI TYGTGDAQTL TLDTKRLQLL DGGVVATTSF LIGKAGNLNI NATESILIDG RSQANNSSIN SAVLIPPALI KQLFRLPNIL SANGGTVNVT TPTLTLSNGG AVSVTNQGAG DGGHINITAN TVFLDRQGSV QAQTLSGEGG NITSQVSELL LLRHNSLISA TSGNTGNGGN ISINAPVIAG LENSDIIANA VKGRGGNIDI RTQGIIGLAF RNTLTPRVEQ TNDITASSEF NINGTVKINN VGVDPNSGLV ALPANIIDPS QQIASGCSAD TGSSFVATGR GGIPQNPSQE MRSDRTWSDT RDISAFQNRQ QTQAQAPTQP RIPVQANSWH RNNQGKIELI ADQSLTQGQT ALTCMAIPQN
|
| |