Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4181 |
Symbol | |
ID | 3681042 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 5231460 |
End bp | 5233739 |
Gene Length | 2280 bp |
Protein Length | 759 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 637719528 |
Product | filamentous haemagglutinin-like protein |
Protein accession | YP_324675 |
Protein GI | 75910379 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01731] adhesin HecA family 20-residue repeat (two copies) [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000153742 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTCCGTAG TATCAATATT GACCAGTGAT ATTAGCTCGG CACAAATTAC ACCGGATACG ACTTTGGGTA ATCAAAGCTC CCGTGTTACA ACTGGTGTCA ACATCAAAGG ACATGAGACT GACTTAATTG AAGGTGGTGT CCAACGAGGA AGCAGCCTGT TTCACAGTTT TACAGAATTT AATGTCAACA ACGGACAAAG AGTATATTTT GCCAACCCTA CAGGCATTGC AGATATTTTT AGTCGAGTAA CTGGGAGTAA CGCTTCTCAT ATTTTGGGTA CTTTGGGCGT ACATGGTGCA GCTAATCTAT ATCTGCTCAA TCCCAATGGT ATTATTTTTG GGACAAATGC CCAGTTAGAT ATTCAAGGTT CTTTGTTTGC TACCACAGCC GACAGTTTTG AATTTCCTGA TGGTAGTGAA TTTAGTGCCA CTAATCCCCA AGCACCACTG TTAACAATGA GTGTACCTGT AGGTGTACAG TATGGTTTAC AGCAAACAGC AAGTATTACT AATCAAGGGA ATTTGGTAAC AGGAAAAAAT TTGACGCTAA ATGCTGGAAG TTTAGATTTA CAGGGCAAAC TCGAAGCTGG TGGTGATATT TCCCTAAACA GTCAGCAAGG AACAATTGAT ATAAATGGGG AGATTATATC TACAACTCCC AATGGAATTG CTGGAGATGT TACGATTCAA GCTGTAGGCG ATATTCATAT ATTTAAGAAC ATAAAAGCCT CTAGCAACAG TGATATTATC ACTAATAATG ATCAGAAAAG ATATAATAAT ATCGCAATTA ATTCTACCGG AGGTTCTATT TATTTAAATG GTTCTCAATT AAATGTTACT AACTCTGGTT CCGGCTTTGC CGGAGATATC ATGATTAACG CTAATGACCA AATATCCATT CTCAATGGTA GTAAGATATC TAGTAATGGA AATATGGGAC GGATATTTAT TGGCGTTGGC GATAGTAATA AAATTACGCC TAAAAGAGTT AATATTGATA ATTCTGGTCT GAATGTAGAT AATTATTCAT CTGATGGTAA TACAGGAAGT ATAACTATTT ACAGCTTTGA AGAAATTTCG ATTAAAAATA GTTATATCTT AAGTGGCAGA GGCAGCATAG CTAGTAGCCG TAATGATGAC GGCAGTGGGG CAAATGGTGG CACTGTAAGT ATTGAATCAC AAGGCTTAGT AAGTTTGGAC AGTTTAGTAA TATTAGCTGA TGTTTTTTCT TCAGACCAAA ATGGTACTGG AGGTGACATT AATATTTCCG CAGGCTCACT AATTATTAAG AATGGGTCTG AATTAGATAC TTCCACATCT GGTAAAGGGA ATGCGGGTGA TGTGACTATT ACGGCTTCAG ATTCTATTTT GTTAGACAAT AGTAAAATTT CCAGTAGTCT TAGGAACGCA GCAACAGGTA AGGGTGGAAA TATTGCAATT ACAACTAATT TTTTGACTGC TGACAAAGGA ACAATAGAAG CTCAGAGTCC AGGTAAGGGA GATGGTGGCA ATATCAACAT CACAACACTA AATTTGTGGC TTCGCCGCCA GAGTAAAATT TCCACTACAG CAGGTGCAGA AGGATCGGGT GGTAATGGAG GTAGCATCAG GATTAATGCT AAGGGTGGAT TTATCGTAGC AGTACCATCA GAAGACAGTA ATATCGTTGC TAATGCTTTT GGTGGTAATG GTGGCAAGAT TAACATTAGT GCCAACCGTA CTCTAGGATT TCAAAATCGG GGAAAGTTGA GTCCAAATGA ACTAAACGCG ATTATTACCA ATGGCACTAG CGAAATTAGC GCCAGTTCTG ATTATGGTGA AGACGGGGAG GTAGCGATTG AAACCCTGAG CATTGACCCT ACTCAGGGGC TAGTTGAATT GCCGACACAG TTAGTAGACC CTTCTCGATT AATTGCCCAA GGCTGTGGCT CTCCTAACAA TAGGGTTGCT AAAGGGCAGA GTGAATTTGT GATTACTGGA CGTGGGGGAC TACCGCCTAG TCCTGATGAT ATGCTCAAGC CTGGGATTAG GTCGCCGGAA TGGGTGGTAA ATAATACTAG AGATTACAGT AATAATTCTG AAAGCATAAT TGAAAAAGAG ATGAATTTAA AGCAATTATC GTCAAATACT TCTACTCCGT TAGTAGAAGC AGTGGGGATG GTTCACAATG CCAATGGAGA TGTTGTTTTG ACTACTCAAC CAGCAGTCGC CACTCCACTA CATTATTCGG GGTTATCTAG CCAAGTCTGT GGTCTTATTC AAGGGAATGT CAGAGAATGA
|
Protein sequence | MSVVSILTSD ISSAQITPDT TLGNQSSRVT TGVNIKGHET DLIEGGVQRG SSLFHSFTEF NVNNGQRVYF ANPTGIADIF SRVTGSNASH ILGTLGVHGA ANLYLLNPNG IIFGTNAQLD IQGSLFATTA DSFEFPDGSE FSATNPQAPL LTMSVPVGVQ YGLQQTASIT NQGNLVTGKN LTLNAGSLDL QGKLEAGGDI SLNSQQGTID INGEIISTTP NGIAGDVTIQ AVGDIHIFKN IKASSNSDII TNNDQKRYNN IAINSTGGSI YLNGSQLNVT NSGSGFAGDI MINANDQISI LNGSKISSNG NMGRIFIGVG DSNKITPKRV NIDNSGLNVD NYSSDGNTGS ITIYSFEEIS IKNSYILSGR GSIASSRNDD GSGANGGTVS IESQGLVSLD SLVILADVFS SDQNGTGGDI NISAGSLIIK NGSELDTSTS GKGNAGDVTI TASDSILLDN SKISSSLRNA ATGKGGNIAI TTNFLTADKG TIEAQSPGKG DGGNINITTL NLWLRRQSKI STTAGAEGSG GNGGSIRINA KGGFIVAVPS EDSNIVANAF GGNGGKINIS ANRTLGFQNR GKLSPNELNA IITNGTSEIS ASSDYGEDGE VAIETLSIDP TQGLVELPTQ LVDPSRLIAQ GCGSPNNRVA KGQSEFVITG RGGLPPSPDD MLKPGIRSPE WVVNNTRDYS NNSESIIEKE MNLKQLSSNT STPLVEAVGM VHNANGDVVL TTQPAVATPL HYSGLSSQVC GLIQGNVRE
|
| |