Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_2811 |
Symbol | |
ID | 3681647 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 3477221 |
End bp | 3480643 |
Gene Length | 3423 bp |
Protein Length | 1140 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637718157 |
Product | filamentous haemagglutinin-like protein |
Protein accession | YP_323319 |
Protein GI | 75909023 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0282223 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.417754 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTAAAC GAAAAAATCT AAATTGGGTT ATTAACTTAG GTTTATTATT TAGTTTGGTT GTGGCAACAA ATAACCGAAC TTCGGCACAA GTCACAGCCG ACCAAACCTT AGAAACACAA GTTATAGATA TTGGTTTCAA CTCCTTTATT TTAGGTGGCA CAACAGTTGG CAACACTAAT TTATTCCACA GTTTTGCTAG TTTTAATGTT CCTAGTAATG GTGCAGCAAT TTTTATCAAT GACCCTAGTT TAACTAATAT ATTTGCACGA GTAACAGGTG GCACAGCTTC CGATGTTCAA GGCAGAATAG GTACTCAAGG GACTGCCAAT TTATATTTAA TCAATCCCAA CGGCATTATT TTTGGTACAA ATGCTAGCTT AAATATAGGT GGTTCTTTTG TCGCTACCAC AGCTAATGCT ATTCAATTTC CTGGTGGTGC TGAATTTTCT TTAAATTCAT CAGTTTCACC AGATAATCAT CTACTTAGGG TCAACCCCAC AGCATTCTTA TTTAATCAAA TCGCTAATCA AGGTATAAAC TCCATTGAAA ATCGCGGTTC TTTAGGAGTT CCTAATAATA AGAGTTTAAT TCTACTTGGT GGTAATATTG CACCCACATC TCATGCCACT GGGGGAATTT TGCTGGATAA TGGAAGACTA GGAGCATTAG ATGGGAGAGT CGAAATTGGC GGATTAACTG CACCTGGCTC CGTAGGAATA AATGTTAAGG GTAATAATTT AAGTTTAAGC TTTCCGCAGG GAGTAGCTAG AACAGATATT GCATTAATAA ATAATACTGG AGTTCTTGTT GCTGGTGTTC GTGGTGGAGA TGTTTTAGTC AACGCTAATA ATTTAAATTT ACTAAATTCT ACTAATATTT TTACTGGCAT ATTTGCTAAT CAAGGAAAAC CTGATAGTAA AACTGGGGAT ATATCCATTA ATGCAACGGG AATTGTCAAC CTAACTGAAG GTAGTGGTAT TGGAAATTTA TCACTAGGGA TAGGAGATAG CGGCAATATC AATATTGTTG CTCAATCACT AAATATGAAG AATGGTAGCA GCATTCAAGC GATTACTGCA CAGGGTGAAT CAGGAATTGT TAATATTAAA GTTACTGATA CACTTTCCCT AAACCAGGGA CTCCTTGCTA ATGTAATTTT AAGTGATAGG ATTAATATCC AAGCACAAAA AGTTTCCCTT GCAAATGGCG CTTTAATCCG TGCTTTTAAT TCATCAGGAA ATGATTCTGT AGGTATTAAA ATTCAAGCTT TTGATTCCCT TGAATTGAAC AATGCTAGCA TTCTTACTGG TACTTTTGAA AACGGTAATT CAGGTGATTT ATCAATTATT ACCAATCGCT TAAATCTAAA AAATGATTCT CAAATAGCAG CTAATACATT TGGCAGTGGC AATGCTGGCC AAATTAATAT CAATGCTGCT GATATCGCTA TAGACTTATC TTCAATTAGT GCGGCTTCAT TATCTTTCCC AGGAGTTATT ACTAATATAG GTAATGCTGG GACTATCAAT ATCCAAACGG AAAGAATTAG TCTCACAAAT CGTGGTTTCA TATCATCTAG TTCTGACGAA CCAGAGCAGG GAACAACAAT CGGTTATGGC GGAAATATTA ATATCACAGC TAATGAATTA ATAGAAATAG ATGCAAAAGG TGCAACAGGT ATAATTACGG GTTTAGTTGC CAGAACTTTT AGTGGCAGCA GTGCTGGTGA CATAAACCTG AATACCAGAA ACCTGATTGT TAGAGATGGT GGTTATATTA GCGTAGAAGC TGCTAATAAT TTAGGGGGAA ATGCTGGTAA TATTACTATC AACGCATCAG ATGCTGTTGA GTTAACTAGC TCCAACGGCA ACCGCCCTAG TATAATTTTC ACTGGTGTTT CAGCTTTTAA TGATAATGCG ATCGCTGTAG GTAACGGTGG TGACTTAAAA ATTATTACTC AAAATTTACA GCTAACAAAC TCCACTATAT CAGCCGGGAC TTTCGGTCAA GGTAATGCAG GTAATATCAA TATCTTCAGC AATGATGGGG TGGTAATCGA TAATAGTGTG ATCAATAGCC GAATTGGCGC TGGTGCAGTT GGGAATGGTG GTGATATTGA CATCCAGACA CAGAGGCTAA CTTTAACTAA TGGTGGTGGG ATTGATACTA TAGTTGCCCC ACCAGAAGGA AATTTACCTG GCGGACTAGG TAAAGGTGGA ACTATTCGTA TCAATGCTAC AGATGCAGTC ACTATTTCAG GAACTAATGC AGATGGTTTT GAGAGTGCAA TCTTGACGGA AACCCAAAGC GGTGCATTTG GTCAATCTGG CGATATCATC ATTAATACAA ATTATTTCCG TGTCTTAGAC AACGGTAGTG TATCGACATT AACAGGAAAT AGTAGTGATG CAGGTAATAT TTTGATCAAT ACCCGCGTAA TGGAGGTGTT GAATGGTGGA GTAATCGTGA CAGGCACTGA GGCTAGTGGT AAAGCCGGTG ACATTACCAT TAATGCTACA GATCAAATTA ACATCTTGAG CAACACCGAC TTAAATGGTA ATACTGGTTT GGTTGCTAGC ACTACTTCTA GTGGAGATGC GGGGAATATA TCCCTATTTA CAACTGACCT TAACTTATTA GCTCCTAATT CTAACTTGCC TGCTCCCGTC CTTGTGGCCA CATTAAGTCG AGGAAGCGGG ATTGCCGGGA ATATTAATAT TGTTGCCACA GGCAATTATA ATGCCAATAA TGGCTTAATC AGCGCACGAT CTGAAAGAGC CGGTGGTGGA AATATCAGTG TCAATGCTAG AAATATCAAC CTACGGAATA ACAGCGACAT TCGTACAGAT TTATCTAGTG GTAACGGTAG AGGTGGGGGA ATTTCCCTCA CCGCAGATAT TATCATTTCC TTGGAGGATA GCGATATTCT TGCCTTTGCA CCAGAGGGAC AAGGCGGAGA TATTAAATTT AACACCCGCG CTGTATTTAG CAATGTTCTC TACACGGCTA GACAAACAAT TCCTGACAGA AATAGTCTTC AGTCACTAGT TAATAATACT CGCCCTGACA TTAACGCTAC AGGGACAATC TCTGGCAACA TTATTGGTGT ACCTGATATT AGCTCTATCC AAAACGGACT CACAGATTTA CAAGCTAATC CTATTGATAC TACCGTACTG ATTGCTAATA GTTGCATTGC TCGCAGTCCT AGACAAGAAG GAACTTTTAT TATTACCGGG ACTGGTGGTT TACCAACTCG TCCTGGTGAG CTTAAGGCTT CTAGTTATGC CACAGGAGAT GTGCAGAGTG TCAGTAATAA TATTGTAGCT AGTGCATGGA AGAAAGGCGA CGCTATTATT GAACCACAAG GAGTATATCG ACTGGCGAAC GGGGAGATGG TGATGAGTCG TGAGTGTGGT TGA
|
Protein sequence | MIKRKNLNWV INLGLLFSLV VATNNRTSAQ VTADQTLETQ VIDIGFNSFI LGGTTVGNTN LFHSFASFNV PSNGAAIFIN DPSLTNIFAR VTGGTASDVQ GRIGTQGTAN LYLINPNGII FGTNASLNIG GSFVATTANA IQFPGGAEFS LNSSVSPDNH LLRVNPTAFL FNQIANQGIN SIENRGSLGV PNNKSLILLG GNIAPTSHAT GGILLDNGRL GALDGRVEIG GLTAPGSVGI NVKGNNLSLS FPQGVARTDI ALINNTGVLV AGVRGGDVLV NANNLNLLNS TNIFTGIFAN QGKPDSKTGD ISINATGIVN LTEGSGIGNL SLGIGDSGNI NIVAQSLNMK NGSSIQAITA QGESGIVNIK VTDTLSLNQG LLANVILSDR INIQAQKVSL ANGALIRAFN SSGNDSVGIK IQAFDSLELN NASILTGTFE NGNSGDLSII TNRLNLKNDS QIAANTFGSG NAGQININAA DIAIDLSSIS AASLSFPGVI TNIGNAGTIN IQTERISLTN RGFISSSSDE PEQGTTIGYG GNINITANEL IEIDAKGATG IITGLVARTF SGSSAGDINL NTRNLIVRDG GYISVEAANN LGGNAGNITI NASDAVELTS SNGNRPSIIF TGVSAFNDNA IAVGNGGDLK IITQNLQLTN STISAGTFGQ GNAGNINIFS NDGVVIDNSV INSRIGAGAV GNGGDIDIQT QRLTLTNGGG IDTIVAPPEG NLPGGLGKGG TIRINATDAV TISGTNADGF ESAILTETQS GAFGQSGDII INTNYFRVLD NGSVSTLTGN SSDAGNILIN TRVMEVLNGG VIVTGTEASG KAGDITINAT DQINILSNTD LNGNTGLVAS TTSSGDAGNI SLFTTDLNLL APNSNLPAPV LVATLSRGSG IAGNINIVAT GNYNANNGLI SARSERAGGG NISVNARNIN LRNNSDIRTD LSSGNGRGGG ISLTADIIIS LEDSDILAFA PEGQGGDIKF NTRAVFSNVL YTARQTIPDR NSLQSLVNNT RPDINATGTI SGNIIGVPDI SSIQNGLTDL QANPIDTTVL IANSCIARSP RQEGTFIITG TGGLPTRPGE LKASSYATGD VQSVSNNIVA SAWKKGDAII EPQGVYRLAN GEMVMSRECG
|
| |