Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_2352 |
Symbol | |
ID | 3683467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 2921564 |
End bp | 2924026 |
Gene Length | 2463 bp |
Protein Length | 820 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637717697 |
Product | filamentous haemagglutinin-like protein |
Protein accession | YP_322865 |
Protein GI | 75908569 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3210] Large exoproteins involved in heme utilization or adhesion |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTAA CCTTGGTTGG GTTTGGCTTT CTGGGTATAT TCTGCCTATC TGCTGTTGAT AATAATTGCG TTTACGCACA AGTAATTCCC GATCACACCC TCAAAACTTC TGTAAGTGGC AGTAATCCTT ACATCATCAC TGATGGTACT CGTATCGGCA ATAATTTATT TCATAGCTTC AGCCAATTCT CTATTCTTAA AGGTGATTCT GCATCATTTG ACAATGCTAT AGATATTCAA AATATTTTTA GTCGTGTGAC TGGCGGTAAT ATTTCCAACA TTGATGGTTC CATCAGTGCT AATGGTAACG CCAACTTATT CTTAATCAAC CCGGCTGGGA TTATATTTGG GCAAAATGCC AGTTTAAATA TTGGTGGATC ATTTGTCGGG ACAACAGCTG AGACTATCAC ATTTGCTGAT GGGAAGAGAT TTAGTAGCAC AGACACTGAT ACATCACCCT TGTTAACCAT GAATGTACCC ACTGGGCTGC AATTTGGTAG TAATTCAGGA GGGATAACAG TTCAGGGTAC AGGACATAAT GCCCAATTAA GTGACTCACT ACAAGTTTCT GGACTAAATC TCAGTACCAG AGGATTACAA CTACAACCAG GAAAAACACT AGGGTTATTA GGCAGCAATA TTGCCTTGGA TGGTGGACTG CTCTCTGCAC CAGGAGGACT AATAGAATTG GGTGGTATCA CCAGTGGCAA CGTCATTCTC AATTCTACTC CGCAAGGATT TGCCTTCCAG TATTCCCAAG TTCCGAGTTT TGGCAATCTT CAAATCACTC AACGAGCTTT AGCATCAACA CGGGATTTCA ATGGGGAAAG TGGAGGAGCC ATCCATATCC AGGGAAAACA AGTCAGTATC CGCGATGGTT CGTTGATATT AGTACAAAAT CGCAGCCAAA AGACAGCTGG TGATATTGCT GTTGATGCTT CAGAGTCTTT AGAACTTATT GGCAAGTCTC CTGATTTTCA AAGTTCCAGC AGTTTAGTGA ATGAAAGCAC TTCCTCTGGT AAGGCTGGGA ATATTATCAT TAACACTCCA CGATTGAACA TTGATCAGGG TGGGTATATT TATAACCGGA CGTTCAGCAC AGCACCAGGT GGCAATATTA TCGTCAATAC CGATGAAACG CGGGTTAATG GTTTTGCTTA TGGTGATCCT TCCGCTTTTC GAGCAGTTAG TCAAATATTA GCTGCTTCCT ATGCAGGCGG GAAAGGCGGT AATATTTCTG TCTCTACTCA AAAGCTATCT GTTTTAGCTG GGGGAAATAT AGCAGCCAGA CCCTATGCTG GAGGTAATGG GGGTGATGTA ACAGTAAAAG CAGATACGAT AGAAGTAGAT AATACAGGCT CTCCGACTGG TTTTTATTTT AGCTTAATTT CGACTGCCAC CTTTGGGTCA GGTAACGCGG GAAATTTGGC ACTCGATACT CGTAAATTAT CCGTGCAAGC TGGCGGCAGA GTGTCTGCTT CTAGCATAAT TTTAGGTAAT GCTGGTGTAC TCACTATTAA TGCTTCGGAA TCGATTGATG TAAGTGGGGT TAAGGATGCA GACAATCCCA GCTATATCGG CACTGCTGTG CGTCCTTTTG GTGCATTTGC CCGAACTTCT CGTGCTAATT CAGGTAACAC CACCATAAAC ACTCCAGTTT TGAATGTTAC TGATAGGGCG ACAGTTTTTG TGGAAAATTC TGGTTTGGGT ACGGCTGGTA CGCTGCAAAT TAATGCCAAC ACCCTTAAAC TTGACAATCA TGCAAGTATT CTAGCATCTA CAAAAGCTGG AGAGGGAGGC AACATCAATC TTCAACTACG GGATGTATTA TTGATGCGTC ATGGTAGCTT TATCAATGCC GAAGCAGGTA GTAATGGCAA TGGTGGCAAT ATAAGTATTA ACTCTCCTAA TGTTGTTGGT CTAGAAAATA GTGACATCAT TGCCAATGCC GTACAAGGAA AGGGTGGCAA TATTCAAATC ACAACTCAGG GAATTATTGG TTTGGAGTAC CGTAATCTTC TCAATCCTAG AGAAATTTTG AGTAATGACA TTACCGCTAG TTCCCAATTC AGTATTAGTG GTACAGTCCA AATTAATAAT GTTGGTGTTG ATCCCAATTC TGGTTTAATA GATTTACCGA CCAATCTCTC AGACCCATCA CAACAAGTTG CCACGGGTTG TTCTAATACT AATAGTAGTA GTTTTGTGGC TACGGGACGG GGTGGAATAC CACAAAATCC TACACAACAA ATGAGGAGCG ATCGCACTTG GTCTGATATC CGTGATATCT CTGCATTCCA CACCAAAGAA CTAGCACAAG CCCAAATACC ACAATCCCCA GAAAAACTTG TCCAAGCTAC TTCCTGGCAT CGTAACGCCC AAGGCAAAAT CGAGCTGGTT GCCGCTAAAT CCTCTTCGCA GATGTCACCA ACCTTAACCT GTGCTGCTGC ACCTCAAAAT TAA
|
Protein sequence | MKLTLVGFGF LGIFCLSAVD NNCVYAQVIP DHTLKTSVSG SNPYIITDGT RIGNNLFHSF SQFSILKGDS ASFDNAIDIQ NIFSRVTGGN ISNIDGSISA NGNANLFLIN PAGIIFGQNA SLNIGGSFVG TTAETITFAD GKRFSSTDTD TSPLLTMNVP TGLQFGSNSG GITVQGTGHN AQLSDSLQVS GLNLSTRGLQ LQPGKTLGLL GSNIALDGGL LSAPGGLIEL GGITSGNVIL NSTPQGFAFQ YSQVPSFGNL QITQRALAST RDFNGESGGA IHIQGKQVSI RDGSLILVQN RSQKTAGDIA VDASESLELI GKSPDFQSSS SLVNESTSSG KAGNIIINTP RLNIDQGGYI YNRTFSTAPG GNIIVNTDET RVNGFAYGDP SAFRAVSQIL AASYAGGKGG NISVSTQKLS VLAGGNIAAR PYAGGNGGDV TVKADTIEVD NTGSPTGFYF SLISTATFGS GNAGNLALDT RKLSVQAGGR VSASSIILGN AGVLTINASE SIDVSGVKDA DNPSYIGTAV RPFGAFARTS RANSGNTTIN TPVLNVTDRA TVFVENSGLG TAGTLQINAN TLKLDNHASI LASTKAGEGG NINLQLRDVL LMRHGSFINA EAGSNGNGGN ISINSPNVVG LENSDIIANA VQGKGGNIQI TTQGIIGLEY RNLLNPREIL SNDITASSQF SISGTVQINN VGVDPNSGLI DLPTNLSDPS QQVATGCSNT NSSSFVATGR GGIPQNPTQQ MRSDRTWSDI RDISAFHTKE LAQAQIPQSP EKLVQATSWH RNAQGKIELV AAKSSSQMSP TLTCAAAPQN
|
| |