Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_2349 |
Symbol | |
ID | 3683464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 2913995 |
End bp | 2916409 |
Gene Length | 2415 bp |
Protein Length | 804 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637717694 |
Product | filamentous haemagglutinin-like protein |
Protein accession | YP_322862 |
Protein GI | 75908566 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3210] Large exoproteins involved in heme utilization or adhesion |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAATGC TACCGTGCTG TGATCCTGTC AGTGGGCAGG TAATTCCCGA TGGTAGTGTG AATACCAGAG TCTCTCAAAG CGGCGATCTC TTCACCATCA CTGACGGTAA TCGAGTAGGA GATAACTTAT TTCATAGTTT CAGTCAATTC TCAGTTCCCA GCAACGGCAC TGCTTTCTTC AATAACGCCT CAGATGTTCA AAATATTTTT AGTCGTGTCA CAGGAGCCAG TGTTTCCCTA ATTGACGGTT TAATACAAGC TAATGGTAAC GCTAATCTAT TTTTGCTCAA TCCCAAGGGA ATCATTTTCG GAGCAAATGC TCAGTTAAAT ATTGGCGGAT CATTCATCAG TACAACTGCC AACAGTATCG AATTTGCCGA TGGCATTGAG TTTAGTAGCA TCAATTCTCA ACCTGCTCCA TTGTTGAGTA TCAATGTTCC CGTTGGTCTA CAACTAGGAA ACAATCCTGC ACCAATTAAC CTTCAAGGTA CAGGACACAC TTTAACCAAT ACCAGTGGAC TGACTCTAGC TCCCCTAATT CGGACTCCTC ATCCCACAAA ACTACAAGTA CCATCGGGAA AAACCATAGC CCTAGTCGGC GGTGACATCA ACCTCAATGG GGCAACCTTA ATCGCGGAAA CGGGAGGAAT AGAATTAGGT AGTGTGATTA GTCCGGGATT AGTCAACCTC ATACCTAGCG CCCAGGGCTA CACCTTGGGA TATAGTAATA TACAACACTT TGGCAATATT CAACTAGCCG AGCGATCGCT ATTAGATATA AGCGGTGTAA ATGCTGGTTC CGTGCAGATT CGAGGTGGGC AACTGCGATT CACCGACGGT TCTCTAATCC TGTCACAAAA TTTCGGCGCT CGCCCTGGTG GTGAAATTCG CCTCCAGACT ACAGAATCCA TTGAATTAAT AGGCAGAACA TCTGATGTTG GAATCCGGGG TGGGATACGT AGTGAAGCCT TAGGTATAGG AAACAGTAGC AATATCAATA TTATTACCCC TATTCTCACC GTCAGTCAGG GAGCAGGTGT GACTAACTCA ACTCTAGGAT CTGCTTCTAG TGGCAATATC AACATTGAAG CAACTGCAAC TTACCTCTCC GGCTTTTCAC CCACAAATCC TGTTGAAGTT ACCACAATTA ATACTTCCAC ACTAGGCAGT GGAAACGCGG GTGATCTTTT CTTCAATGGC AGTAGTCTAT TAATATCAGA TGGTGCTTCA CTGTCTTCGA CCACCTTTGG CAACGGTTCA AGTGGTAAAG TCACAATTCG CAACACTAAT ACCACCGTCA TCGGAGAAAG TCCCTCTGAA CTGTACAGCA ACATTAGCTC AACGACATTT GCAACTGGAA ATGCTCAAAC CCTGACCCTA GATACTAAGA ACTTACAAAT CTTGGATGGG GGAGCAGTGG CCACCACAGC ATTTTTTGTA GGTAGTGGCG GAGACTTAAG CATCAACGCC AGCGAATCCA TCACCATTAG TGGTCAAGGT CGAACCATTA GCAGTAGCAT TAACGCCTCC ACTTTGCGAC CCGATCCTTT TTTACGACAA AGATTTGGTC TGCCGGACAT ACTAACAGCA AATCCCGGCT CTGTCAGCAT CACCACACCT AAGCTAACAT TAACCAACGG TGGAACCGTC GGTGTCACAA ATCAAGGTAG CGGCAACGGT GGTAATATGA GCATCGCTAC TAATACCATC CAATTAAAAA ATCAAGGTTT TATTCAAGCA CAGACAAAAT TTGGCAACGG TGGGAATATT CAATTATCTG CAACAGACCT ATTGCTATTG CGGCAAAATA GCCTAATTAC CTCCACATCT GGTGGCTCAG GAAATGGAGG TAACATCAAC ATCAATGCAC CCATCATCAC GGGACTAGAA AACAGCGACA TTATTGCTAA TGCCATTAGA GGTCAGGGAG GCAATATTCA AATTACCACT CAAGGTATGT TTGGACTAAA ATTTCGTGAC CAAATCACCC CGGAAAGTGA CATTACAGCC AGTTCCCAAT TTGGATTAAG TGGCACAGTC CAAGTTAACA CTGTGGGAGT TGACCCAAAC TCAGGTTTAG TCGAATTACC AGCCAATGTT ACCGATCCAT CCCAGAAAAT AGCTACAGGT TGCTCTAATA ATCAAGGTAG TAGTTTTATC GCCATAGGAA GGGGTGGAGT ACCGCAAAAC CCGACACAAG AAATCAGGAG CGATCGCTCC TGGTCAGACA CCCGCGACAT CTCTGCATAC CGGGAAACAC AGACAGTCCA AGCTCAAATC CCGCAACCTC CAGAAACGCT TGTCCAAGCA ACGTCCTGGC AACGTAACCC TCAAGGTAAA ATTGAATTAG TGGTGGCTAA ATCACCTCCG CAGATGCCAC CAGCCCTAAC CTGTGCTGCT GTAGTTGAAA ATTAA
|
Protein sequence | MTMLPCCDPV SGQVIPDGSV NTRVSQSGDL FTITDGNRVG DNLFHSFSQF SVPSNGTAFF NNASDVQNIF SRVTGASVSL IDGLIQANGN ANLFLLNPKG IIFGANAQLN IGGSFISTTA NSIEFADGIE FSSINSQPAP LLSINVPVGL QLGNNPAPIN LQGTGHTLTN TSGLTLAPLI RTPHPTKLQV PSGKTIALVG GDINLNGATL IAETGGIELG SVISPGLVNL IPSAQGYTLG YSNIQHFGNI QLAERSLLDI SGVNAGSVQI RGGQLRFTDG SLILSQNFGA RPGGEIRLQT TESIELIGRT SDVGIRGGIR SEALGIGNSS NINIITPILT VSQGAGVTNS TLGSASSGNI NIEATATYLS GFSPTNPVEV TTINTSTLGS GNAGDLFFNG SSLLISDGAS LSSTTFGNGS SGKVTIRNTN TTVIGESPSE LYSNISSTTF ATGNAQTLTL DTKNLQILDG GAVATTAFFV GSGGDLSINA SESITISGQG RTISSSINAS TLRPDPFLRQ RFGLPDILTA NPGSVSITTP KLTLTNGGTV GVTNQGSGNG GNMSIATNTI QLKNQGFIQA QTKFGNGGNI QLSATDLLLL RQNSLITSTS GGSGNGGNIN INAPIITGLE NSDIIANAIR GQGGNIQITT QGMFGLKFRD QITPESDITA SSQFGLSGTV QVNTVGVDPN SGLVELPANV TDPSQKIATG CSNNQGSSFI AIGRGGVPQN PTQEIRSDRS WSDTRDISAY RETQTVQAQI PQPPETLVQA TSWQRNPQGK IELVVAKSPP QMPPALTCAA VVEN
|
| |