Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4131 |
Symbol | |
ID | 3681207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 5145463 |
End bp | 5148009 |
Gene Length | 2547 bp |
Protein Length | 848 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637719477 |
Product | filamentous haemagglutinin-like protein |
Protein accession | YP_324625 |
Protein GI | 75910329 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3210] Large exoproteins involved in heme utilization or adhesion |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.343736 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0350194 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCAACT TACAAACAAC AAAAATTTTA TTTTTTTTTA CATTGTGTCT AGTGCTAGAA ACACCTTTAA AAGGATTAGC ACAAACTCAA CTGAACCCTG ATCATACTCT ACCCACTAAT GTCAATAGTA TTGGTGGTGT GTACGATATT ACTGGCGGCA ATAGACCGAA TAATGGTGCT AATCTTTTTC ACAGTTTCCA AGATTTTTCA ATTCAATCAG GAGATACTGC TAGATTTATT TACGATACAG GAATTAGCAA TATCATCACA CGGATTACGG GAGGGTCACC TTCCCAAATT AACGGGACTA TTCAAACACT TCTCAATGGT ACTAATAATA TAGGTAATGC CAATTTATTT CTGATTAATC CACATGGAAT TATCTTTGGT GCAAATGCCA AATTAGATAT AGGCGGCTCA TTTATTGGGA CTACAGCAGA TAGTATCAAA TTCAATGATG GGAAAGAATT TAGTGCTATC AACCCTACAG TTAACCCGAT TTTGACTGTT AATGTACCTA TCGGTTTACA ATTTGGTTCT CACCCCACCA GCACCATTCA AGTACAAGGT TCAGGCAACA ATTTCCAACT CAATCCTGAT TTATCTGTTG ATAACAGTAA CCGTCCATCA GGATTGAGCT ATCAAACCCC AAATGCTCAG ACTTTAGCAC TAGTTGCAGG CAAGGTGGAA TTAGCTGGAG GAAATATTAC TGTACCCCAG GGAAGAATTG AATTATGGTC TGTGAATAGA GGTGAGGTGA CAATCACCAA TCCTAGTGGA CATCTGCAAC TACAACCCAC ACCAGGAATT AGTTACGGTA ATGTTGACCT TGTAAATGCT GCTTCTGTAG ATGCTAGTGG TAATAGTGGT GGTTCTATCG AGGTGCGAGG GCAAAATGTG ACTCTCGACA ACGGTTCAGT CATAGTTACA GATACCACAG GTAGCGGTTC TGGAGGGATA TTGAATATAT CTGCATCAGA GGTATTGACT GTTAAAGGCT TTGTTTTGAA CCCTAATAAC CAGATATCTA GCGGTATATC AGCTGATGTT GCTTCAGGTG CAAGTGGAGA AGGAGGTAAA GTCACAGTTA CTACAAAAAC TTTGCAAGTG AGCAATGGGG GTCAAATTTC CAGTGGTACT TTTGGCACTG GAAATGCTGG AGAATTGAAC GTTACAGCTC AGGATGTGCA GATACGTGGT ATTTCTCTCT TTGGGCCTAG TGGTTTATTT GCTCCTGTTG CTCCTGGCGC AAGAGGAAAC GGGGGAAACT TAACAGTTGA AACTAATAAA TTACAAGTTA CTGATGGTGG ACAGATATTT ACTAATACCT TGGGCTTTGG TAAAGCTGGT GACTTGAAAA TTCTCGCTCA AGATGTAGAG GTCAGTGGTG GGACAGAATT TGGGCCTAGT ACCATTGCAG CCACAGTCCA AAAGATATTG AGTATTCCAG AGCCAGCTGC AACTTTTTTA GGCGCTGGTT TTGGTAATGC TGGTAATTTA ATCATTGAAA CCAGCAATTT ACGAGTTACT GACGGGGGTC AGATTGCTGT TAGCACCTCT GGGAATGGCT CGGCTGGTAA CATGACAATT AATGCTAACT CAGTAGAATT AGCAGGTACT AATCAATTTG GTCGTAGTGG TTTATTCGCT AATGCTATTG TTGGTAAAGG TCAAGGTGGC GATGTTAATA TCAGTAGCGA TCGCTTAGTT GTTCGTGATG GTGCAACTAT TAATGTCAGT AGTTTCCTTA GTAGAGACCC AGGAAATCTG CGGGGTTTAG CTGGAAAAGG GGCGGCGGGA AATATAAATC TCAATTCTGC TGATATTTTA CTAGCAAATC AAGGGATCAT TACTGCTGAT ACTAATGCCG GGGATAAAGG CAATATTACG ATTCAATCGG ACACCCTGCA AATACTACGT GGTAGTCAAA TTAGCACCAA TGCGCGCAAT AGTGCAGTTG GGGGAAATAT TAATATTACT ACCAATACTT TAGTTGCTTA CGAAAATAGT GATATTAGTG CTAATGCTCA AAAAGGTTTT GGTGGTAGGG TAGTTGTCAA TGCCAAAGCA GTTTTTGGGA TTCAATTCCG TCCCCAACCG ACTCCAGACA GTGACCTGAC GGCTTCTTCT GATTTGGGTG CGGAGTTTAA TGGTACTGTA GAACTGAATA CACTAGATGT TGATCCTACT AGCGGATTAG TGAAGCTACC GACTAACTTT AGCGATCGCT CACAGCAGAT AGCTAGTGGT TGTAGTGTGA CGCAAAAGAA TCGTTTCGTT GTTAGTAACC GTGGTGGCTT ACCCACCAAC CCTACCGATA CCCTCAGAGG TGAGATAGTT TGGTATGATG TCCGTGATTT ATCCAATGAA GTAGCTAACT CAACAGCAGG CAGTAACTAT CAAACTGTTA ATAATCAAGA ACCAATTGTT GAGGCTCAAG GATTAATTGT TGGTGCAGAT GGTACAATGC AGCTACTAGC ATCCATCCCA CAGGTAACGC CTCTCACTCC GTGGCAAGTA TCACCTTCAT GTGATGTTAA ACCCTAA
|
Protein sequence | MFNLQTTKIL FFFTLCLVLE TPLKGLAQTQ LNPDHTLPTN VNSIGGVYDI TGGNRPNNGA NLFHSFQDFS IQSGDTARFI YDTGISNIIT RITGGSPSQI NGTIQTLLNG TNNIGNANLF LINPHGIIFG ANAKLDIGGS FIGTTADSIK FNDGKEFSAI NPTVNPILTV NVPIGLQFGS HPTSTIQVQG SGNNFQLNPD LSVDNSNRPS GLSYQTPNAQ TLALVAGKVE LAGGNITVPQ GRIELWSVNR GEVTITNPSG HLQLQPTPGI SYGNVDLVNA ASVDASGNSG GSIEVRGQNV TLDNGSVIVT DTTGSGSGGI LNISASEVLT VKGFVLNPNN QISSGISADV ASGASGEGGK VTVTTKTLQV SNGGQISSGT FGTGNAGELN VTAQDVQIRG ISLFGPSGLF APVAPGARGN GGNLTVETNK LQVTDGGQIF TNTLGFGKAG DLKILAQDVE VSGGTEFGPS TIAATVQKIL SIPEPAATFL GAGFGNAGNL IIETSNLRVT DGGQIAVSTS GNGSAGNMTI NANSVELAGT NQFGRSGLFA NAIVGKGQGG DVNISSDRLV VRDGATINVS SFLSRDPGNL RGLAGKGAAG NINLNSADIL LANQGIITAD TNAGDKGNIT IQSDTLQILR GSQISTNARN SAVGGNINIT TNTLVAYENS DISANAQKGF GGRVVVNAKA VFGIQFRPQP TPDSDLTASS DLGAEFNGTV ELNTLDVDPT SGLVKLPTNF SDRSQQIASG CSVTQKNRFV VSNRGGLPTN PTDTLRGEIV WYDVRDLSNE VANSTAGSNY QTVNNQEPIV EAQGLIVGAD GTMQLLASIP QVTPLTPWQV SPSCDVKP
|
| |