Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_2530 |
Symbol | |
ID | 3682415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 3129325 |
End bp | 3131655 |
Gene Length | 2331 bp |
Protein Length | 776 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637717875 |
Product | filamentous haemagglutinin-like protein |
Protein accession | YP_323040 |
Protein GI | 75908744 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3210] Large exoproteins involved in heme utilization or adhesion |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.957928 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAATT TTTGGGTCTG GATTCCCAAG CTAGGAGTTG CGATCGCCGG CGTATTAACC TGCATGACAG ATTATGCTGT CGCGCAAGTT GTTCATGATA ATACACTAAT TAATAATTCC CAAATTAATA CACACAATAA TATTATTTCT ATTGAAGGTG GAACTAGAGC CGGAGATAAT CTTTTCCACA GTTTTGAGCA GTTTTCTGTA GCTCAAGGAA TTACAGCCGA GTTTAAAAAT GCTGTGGATG TAAAAAATAT TATTAGCCGA GTGACGGGTA ATTCTATTTC TCAGATTGAT GGCATTCTGA AAAGCCAATA TGAAGCCAAT CTTTTTCTGA TTAATCCTAA CGGCATTATT TTTGGTTCTA ATGCTTCTTT GAATATTGGC GGTTCATTCG TAGCGACTAC GGCAGATACT TTGATTTTCG CTGATAGAAG AAATTATAGT ACAAGAGAGT TATCCCTTGT ACCATCTCCT TTATCAATTA GTGTTCCGAT TGGTTTACAA TTTGGTGCAA ATCCAGGTTC TATTCATAAT CAATCTCAAA CAAGCCCTAA TGGAGCTATG AATAGTCTTT TTGAGCCAGT TGGTCTGCAA GTGAAACCAG GTAAAACCCT AGCTTTAGTT GGTGCTGGCT TGATCATGGA AGGTGGGAAT CTGACTGCTC CTTTGGGTCG CATTGAACTA GGAAGTGTAG GAAGTAATAG TTACGTCAGT CTAAGTCTAG TAAATCAAGG TTGGATATTG GGATATCAAG GTATTCAAGA TTTTCAAGAC ATACGATTAA TTCAAAGAAC TGTTGATGGT TCTGCATTTC CATCTCAAAT AGATGTGGGC GGTGATGGAG GCGGTAATAT TCATATCCAA GGTAAAGCTG TAGAGATGAA TGGCCGTGAT GTCCGCTTAT CGACTCAAAC ATTGGGTTTC AATAAAGCTG GAGATATAAC GATTGACGCA AACAAATTAG TAATTCGTAA TGGGGCGCAA ATATTAACAT TCACTTTCGC TCAGGGAGCA GGTGGAAATT TGGTTATCAA CGCGGCTGAA TCTGTAGAAG TGATTGGTGG CTTTCTTGTA CCGAATACAA ATCGTGTTGA ACCCAGTGCA TTGACAAGTT CAACTGCGTC TGATGGTAAA GCGGGTGATA TCACAATTAA TACAAAAAGA TTAATTGTCC AAGTTGGGGG AAACATATCT ACCAGTTCCC TTGGGTTACC AGATTTTAAT AACGAAAGAA TAATATTAGC TACAGGAGCA GGAGGAGATT TAATAGTCAA TGCTGCTGAT TCTGTAGAAA TAAGTGGCAA AGCCAGAAAT GTTCCTAGTT CTTTGTCCGC TTCTACAAAT AGTTATGGTA ATGCTGGCAA GTTAATGATA ATTACAGGAA AATTGACTGT AAAAGATTCA TCTGAAGTAG CTGTTGGCAG CGTCGTTTTA AAATTTGCTG GACTGAGAGT TGATAATAGC AATTTAGGCT CTGCTGGGGA ACTAAACATA ACTGCTCGTT CTATAGTTCT GGATAACCAG GGAAAACTTT TATCAAATAG TGAATCAGGT CAAGGTGGTA ATATTGAGAT CCAGGTGCAA AACACTCTAC TGATGCGCCG TAACAGCCAA ATCTCCACAA ATGCAGGTAT CACATCTGGG CGAGGCGATG GGGGTAATAT TATCATCAAT GCACCTAATG GTTTTATTAT TGGCACTCCT AATGAAAATA GTGATATCAC CGCCAACGCA TTCTCTGGTA GTGGTGGGAA AATCACTATA AATACTAATA GTGTTTTGGG GTTTGTACCT CGTAATCGTG CCGAATTAGT AAAGTTGCTA GCAACTGAAA ATCCTAAAAA ACTAGACCCA AATAAGCTTC CTAGTAATGA TATTACAGCT TTTTCTCAGG AAAACCCTTC ATTCAATGGC ATCGTACAAA TTAATACACC TGATGCTGAT CCTAGTAAAG GGTTAATAGA ATTGCCGAGA AATATCATTG ATACTTCTGA GAAAATTGTT GCTAGTTGTC ATCCTAGTCG AATTTCGCAA AGAAGTTCAT TTGTTTCTAC AGGAAGAGGA GGAATTGCAT CTAGCCCTAC GGATACGCTG ATTGATGATG CTGTGTTGGT AGGTTGGGCT GCATTACCGA CACAGATAGA AAGTAGTGCT GATAGTATTC CTGAACAACA CGCACAGCAG GATAATTTGA ACGTCTCTCC CCAAATTTTA GAAGCACAAG GCTGGCAAAT AGATCGCGAC GGTAATATAG TACTGGTGGC GCAAGCACCT ACTGTAAATC CTCATACTCC AACACTACAT TCGTCAGCTT GTACAGAATA G
|
Protein sequence | MNNFWVWIPK LGVAIAGVLT CMTDYAVAQV VHDNTLINNS QINTHNNIIS IEGGTRAGDN LFHSFEQFSV AQGITAEFKN AVDVKNIISR VTGNSISQID GILKSQYEAN LFLINPNGII FGSNASLNIG GSFVATTADT LIFADRRNYS TRELSLVPSP LSISVPIGLQ FGANPGSIHN QSQTSPNGAM NSLFEPVGLQ VKPGKTLALV GAGLIMEGGN LTAPLGRIEL GSVGSNSYVS LSLVNQGWIL GYQGIQDFQD IRLIQRTVDG SAFPSQIDVG GDGGGNIHIQ GKAVEMNGRD VRLSTQTLGF NKAGDITIDA NKLVIRNGAQ ILTFTFAQGA GGNLVINAAE SVEVIGGFLV PNTNRVEPSA LTSSTASDGK AGDITINTKR LIVQVGGNIS TSSLGLPDFN NERIILATGA GGDLIVNAAD SVEISGKARN VPSSLSASTN SYGNAGKLMI ITGKLTVKDS SEVAVGSVVL KFAGLRVDNS NLGSAGELNI TARSIVLDNQ GKLLSNSESG QGGNIEIQVQ NTLLMRRNSQ ISTNAGITSG RGDGGNIIIN APNGFIIGTP NENSDITANA FSGSGGKITI NTNSVLGFVP RNRAELVKLL ATENPKKLDP NKLPSNDITA FSQENPSFNG IVQINTPDAD PSKGLIELPR NIIDTSEKIV ASCHPSRISQ RSSFVSTGRG GIASSPTDTL IDDAVLVGWA ALPTQIESSA DSIPEQHAQQ DNLNVSPQIL EAQGWQIDRD GNIVLVAQAP TVNPHTPTLH SSACTE
|
| |