Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_1390 |
Symbol | |
ID | 3682684 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 1714143 |
End bp | 1717244 |
Gene Length | 3102 bp |
Protein Length | 1033 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637716727 |
Product | filamentous haemagglutinin-like protein |
Protein accession | YP_321908 |
Protein GI | 75907612 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3210] Large exoproteins involved in heme utilization or adhesion |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.379495 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGTC ATTGCATCAA AATAGGTATC TCTATCATTT TAGGCGTAGG TGCAATAACT GGCATTGAGC ATCCTAGCAA CGCCGAAATT ATTCAAGATA GTACACTACC TGAAAATACT AATGTGACTG TTAATTTCGG TAAAGAGGGT GATTCTACCT TCATTATTAA TGGCGGAACA GCAATAGGCA ACAATCTATT TCATAGTTTT AGCAAATTTT CCTTACTCAA AGGTCAAAAT GCTTACTTTA ATAGTACTGA CAATATTCAA AATATCTTTA GTCGGGTTAC TGGTGGAGAA GTATCAAATA TTGATGGCTT AATCACAGCT AGAAGCAGGG TTAATTTATT TCTCATTAAT CCCAATGGAA TTATTTTTGG GCCTAATGCT CAGATAAATA TTCAGGGTTC TTTCGTTGCT AGTACTGCTC ATAGTGTCAG ATTTGCTGAG GGTGGAGAGT TTATAGCTAA AACTGCAAGT GTTACGCCCT TACTCAATGT CACAACTCCT ATTGGTTTAC AATATGGGGC TAATTCTGGC AGCATTCAGT TACAAGGAGA ATCAATACAT CAGCCTAGTA GGCTGCAAGT TGAAGGCAAT CAAACTTTAG CCTTAATTGG TGGAGATGTT ATTTTAGGAA ACGCTAATCT TAGCACTATA AAGTCTGAAG GCAGGATAGA ACTAGGCAGT GTTGCATCAG CAAGTTTAGT GGGAATTAAA TCAGTCAGTT CAGGCTTCTT GTTTAAATTT GACGGGGTAT CTGAATTTGG TGATATTCAA CTATCTTCAG GCACTAATAT TAATAGTTTA GGTGACCTCA ATCTCACGGG AAAAAATATT GTACTTCAAG ATGCGTCTTT GAATGTGTTC AATAATCTCA CAGTCAATGC CCAAGAGAGC ATTCAGTTAA TAGATAGAAG TGCAATTGAT ACTAGAAGTT CTAATAATTC GCCAGGAAAT CACACTATTA ATACTCGCAG ATTATTACTG AGTAATCAAT CTTTTATCGG TATAAACGGC GGTAATCTGG AGGTGAATGC CTCAGATGTG GTGGAACTGA CAAGTGATCT CAAAGGTACT CCCAGCCGTA TCTATGCCAC AACTTCAGAT GGTGACGCAG CCAATGGAAA TTTAACAATT AATACTGGGG ATTTATTAGT CGAGAATGGC TCACAAATTA TCACTAATTC TTCTACTGCA TTTTCCGGCA TTATTCCCCT TAATGTGCAG ACAAAAGCTA GTTTAACTAT TAATGCCGCA AATTCAGTGA CTTTGCGGGG TAGTTCATTG GACGAAATAT ATTCTAGCGG TGTGTTTAGT CAAACTGATG GTGATGGCAA GGCTGGAGAC CTGACGATTA ATACTCGTGT GTTACGAATT GAAGACGGGG CGCAGGTCAT TGCCAGAAAT TTGAGTGTCG GTAAAGGCGG AAACTTGACT GTAAATGCTT CTGATAAGGT GCTAATAATT GGTACTTCTC CCAAAGGTTC AATTCCTGGG AACTGGCCTA ACGATTCAGA GCTAGATGAG CAAGTACCTA GAGCTGTCGG CCCCTTAAAA GAGTTGTTGA TAACAGGGGT GTTACGCAGA GATAGTTTAC CTAGTGGTAT ATTTACTGAC TCAGTTTCTT CCGGTGATGC TGGCATGATC ACGATTAACA CTGGTGAGTT AACAGCACAA AATGGAGGGC GAATTAGTGC CGATGCTTTT TCAGCAGGTA AAGGGGGAGA TTTAATGATC AGTGCTACTG ATAAGGTGGA ATTGATTGGT ACTGCTGTCA ATGGGGTTGC TAGTGGTTTG TTCACGAGAA CAGGTTCTTC AGCTACGGGA AATGCCGGAT CTTTGACAAT TGTCACTGGT AATTTATTAG TAAAAGATGG GGCGCAAGTC AGTGTTAGTA CTTTTGGTAC GGCTAAGGGT GGCAACTTGT TGGTGCAGGC GGCTGAGGGG ATAAAACTCA TCGGTGTTTC TCAGAGGAAT ATTGCCAGTG GTTTGTTTGC CCAAGCTAAT CGTTACGCAA CAGGAGATGC GGGTAGTTTA AAGATTGATA CCTCGACCTT ATTGGTACGT GATGGCGCAC AAGTTAGTGC TAGCACTTTC GGGGCAGGTA AAGGTGGAGA TTTGTTTGTC CAAGCTTCTG ATATCAAACT AATCGGTACT GCTGCTGATG CTTCGTTCTC TAGTGGCTTG TTTACTGTAG CAACGGCAAA CTCTACTGGT AGCGCTGGTA AACTCACGGT GAACGCGGAT GTATTGGATA TTGAGCAGGG AGCGGGAGTA GGAGTGCAGA GTAGTGGTAA GGGGAGTGCA GGCAACTTAA ATATTAATGC TCATAGAATC AGATTGGATG ATCAAGCTTT TATCAGTGCC GATACTCGTG ATAATGGGAG TGACCCTAAC CGATCGCAAG CAAATATCAA TCTGCGATCG CGCAATCTCA TCCTATCTCG TGGCAGTAGC ATCACCACTA ACGCCACAGG CAGTAATGTC ATTGGCGGCA ATATAGACAT TGATACTAAC ACTCTAGTTG CTATTCAAAA TAGTGATATT AGTGCTAATT CTGCTGACTT TCGTGGTGGT CGAATCAATA TCAATGCCCA AAATATTTTT GGAACCAAAT TTCGCAATCA ACGTACCCCC AATAGCGATA TTACTGCTAC TGGTGCTAGT CCTGAGTTAA GCGGTGCAGT GGAAATTACT ACCCCTGATG TAGACCCTAG TCAAAGTTTA AGCCAACTCC CATCAGAGGC TGTTGATGTG TCTAATCAAA TCTCTCAAGA ATGCCGGATT GATGAAGCGA CGGCACAGAG ACAAAATCAG TTTATTATTA CTGGACGCGG TGGTGTACCA ACAAACCCCT ATGAAACCTT AGATAACACG GCGATAATTA CGGATTGGGT AACTGTTAAT GATGTCAATA CTGTTGCTCA CAAAGAAAAT AATTTCGCCC AAGAGGAAAA CACTGTTGCC AATAGTATTG TTGAGGCTCA AGGCTGGGTT TATGATACCC AAGGAAATTT GGTTCTGACT GCCGAAGCAA CTAAAATAAC AGGTCATGGT TCAGGATTGA CAACCGATTT CTGCCAGGTA AATAAGGGAT AA
|
Protein sequence | MSRHCIKIGI SIILGVGAIT GIEHPSNAEI IQDSTLPENT NVTVNFGKEG DSTFIINGGT AIGNNLFHSF SKFSLLKGQN AYFNSTDNIQ NIFSRVTGGE VSNIDGLITA RSRVNLFLIN PNGIIFGPNA QINIQGSFVA STAHSVRFAE GGEFIAKTAS VTPLLNVTTP IGLQYGANSG SIQLQGESIH QPSRLQVEGN QTLALIGGDV ILGNANLSTI KSEGRIELGS VASASLVGIK SVSSGFLFKF DGVSEFGDIQ LSSGTNINSL GDLNLTGKNI VLQDASLNVF NNLTVNAQES IQLIDRSAID TRSSNNSPGN HTINTRRLLL SNQSFIGING GNLEVNASDV VELTSDLKGT PSRIYATTSD GDAANGNLTI NTGDLLVENG SQIITNSSTA FSGIIPLNVQ TKASLTINAA NSVTLRGSSL DEIYSSGVFS QTDGDGKAGD LTINTRVLRI EDGAQVIARN LSVGKGGNLT VNASDKVLII GTSPKGSIPG NWPNDSELDE QVPRAVGPLK ELLITGVLRR DSLPSGIFTD SVSSGDAGMI TINTGELTAQ NGGRISADAF SAGKGGDLMI SATDKVELIG TAVNGVASGL FTRTGSSATG NAGSLTIVTG NLLVKDGAQV SVSTFGTAKG GNLLVQAAEG IKLIGVSQRN IASGLFAQAN RYATGDAGSL KIDTSTLLVR DGAQVSASTF GAGKGGDLFV QASDIKLIGT AADASFSSGL FTVATANSTG SAGKLTVNAD VLDIEQGAGV GVQSSGKGSA GNLNINAHRI RLDDQAFISA DTRDNGSDPN RSQANINLRS RNLILSRGSS ITTNATGSNV IGGNIDIDTN TLVAIQNSDI SANSADFRGG RININAQNIF GTKFRNQRTP NSDITATGAS PELSGAVEIT TPDVDPSQSL SQLPSEAVDV SNQISQECRI DEATAQRQNQ FIITGRGGVP TNPYETLDNT AIITDWVTVN DVNTVAHKEN NFAQEENTVA NSIVEAQGWV YDTQGNLVLT AEATKITGHG SGLTTDFCQV NKG
|
| |