Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_2353 |
Symbol | |
ID | 3683410 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 2924091 |
End bp | 2926565 |
Gene Length | 2475 bp |
Protein Length | 824 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637717698 |
Product | filamentous haemagglutinin-like protein |
Protein accession | YP_322866 |
Protein GI | 75908570 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3210] Large exoproteins involved in heme utilization or adhesion |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTAA CTTTTGTTGG ACTTGGTGTG CTGAGTGTAA TCTGCTTATC TGCTGTTGAC AACAATAGTG TTCACGCCCA AGTAATTCAA GATAATACCT TCAATACTTC TGTTACTCCC ACTACTTCCA TGAACGGAAG TAATGCTTAC ACCATCAACA ACGGCACTCG TGTCGATAAT AATTTATTTC ATAGCTTTAG CCAATTCTCT ATTCCCACGG GCGATTCTGC ATTTTTCGGC AATGATCTTA GTATAGAAAA TATTTTTAGC CGGGTGACTG GTGGTAATAT TTCCAAGATT GATGGTTCTA TCAATGCCAA TGGTAAAGCC AATTTATTCT TACTCAACCC TGCGGGCATT ATTTTTGGAA AAAATGCCAG CTTAAATATT GGTGGTTCAT TTGTCGCAAC TACAGCTAAT AGTATTAAGT TTGAGGATGG GACAGAATTT AGTGCTGTAA ATCCTGCCGC TAAAGCGTTA TTAACAATGA GTGTACCTGT GGGTTTGCAA ATGGGAAGTA ACCCTGGTCA AATTACCGTC CAGAACACAG GGCATCAGTT GGCATTTCCG ATTCACCCTT TGGTTTCTTC TCCAAATCGC AGCAACAATC CTGTGGGATT AAATGTTAAT AATGGTAATT TGGCATTAAT CGGGGGAAAA ATCACCTTAG ATGGAGGAGT TTTAAATGCA CCATCAGGAC ATATTGAACT TGGTAGTGCT AGTAACGGAA TAGTTAATTT AAACACTGCC TCTCAGAGTT GGAATTTTGA CTATAGCAAT ATTCAGCAAT TTGGCGATAT TCAGTTGTCC CGTCAATCAC TAGCAGATGC TAGCGGTACT CCTGCGGGTT CTATTCAATT CGTAGGTCAA AATATTAGTT TAAATGATGC TTCTGCTGCC CTATTGGTTA ATGAAGGAAA TGGCAATTCA GGGAATATTA ACATTGATGC GAGTGGATCT TTAACACTAC GAGGAACTGG CACTATAGGA TTTCCTCAAA GTTTATTGCG TGCTGATAAT TTCAGCGATG GTAGTGGAGG TAATATCATT GCTTCGGCTT CCCAAGTATT CCTTCAAGAT GGCGGATCTC TTCATGGCAT TAACTTTGCT GAAGGTACAG GAAGTAATAT TTTTGTAGAA ACTCAGGATT TAATTCAAAT AACGGGAATA TCACCAGTTA GCGGATTTGC CAGTTCATTG AATACTATCA CCAGAGGTTC TGGAAAAAGT GGCGATATTC AAGTAGCTAC CAACAAATTG CAAGTTTTAG ATGGTGCTGT TATTGGTAAT TCGCCTTGGT CAAATGGTGC AGGAGGTAAC ATCACCATTA ACGCCTCTGA TTTTATCGAA GTGGCCGGAG AAAACACAAA AAACTTGGCT GACAGTGTCA TCGTAGTAGG AACATTCGAC GAAGGAAATT CAGGTTTATT AACTATTAAT ACTGCCCAAT TGAGAGTACG AGATGGTGGA GGTATAGTTG TCTCTACCGT CAATCAAGGA AATGCTGGTG ATTTAGTGAT TAATGCTTCA GACACAGTTG AGGTTAGTGG AGTGGGTAGT ATTTCTGGTC TTCCTAGCCG CATTGGCGTG AGGGCAGAAT TACTTGCACC ACCAATTCGA CAATTATTTG GATTACCGGA TGTGATTACA GGTAATATAG GTAAGCTTAC CCTCAACGCC CAACGTTTAC AAGTCACAAA TCAGGCAATT GTAGGGGTTG ATCATCAAGG CATCGGAGAT GCTGGACAAC TGGAAGTTAA TGCAGATTCC ATCCGACTAG ATAATGGTGG TAGCATTACA GCCGCTACAG TTCAAGGTGA AGGCGGTAAT ATTTTGATTA ATTCCAATGA CCTGCTATTG CGTCATGGTA GTTCAATTAT GACTAATGCA GACGGTATTG GTAACGGGGG GAATATGACC ATTAACTCTG CTGTGATTCT TGCTTTGGAA AACAGTGATA TTGTCGCCAA TGCCGTTCAA GGTAACGGAG GTAACATTAA CATTACTTCT CAAAGTATTT TTGGGTTGAA ATACCGTAAC GAACTCACTA CAGAAAGTGA CATCACTGCT AGTTCGCAAT TTGGTTTAAA CGGAACAGTC AATATTTATC ACTTTGGTGT TGATCCCAAA ACTGCTTTAG TAGAATTGCC AAAAAATACT ATAGATCCGT CAAAACAAAT TGCTAGTGGT TGTAATGCTA ATACTGGTAG TAGTTTTGTC GCCACAGGAC GGGGTGGAAT ACCACAAAAT CCCACACAGG AAATTAAGAG CGATCGCACT TGGTCTGATA TTCGTGACAT ATCTGCATTC CACACCAAAC AACAAGCACA AGCACCAAAA AATCCCGTAA CACTTGTCCA AGCTACCTCT TGGCGACGTA ACGCCAACGG CAAAATTGAG CTTTTTGCCG CTAAATCTCC TACAGGTGTG CAAATGTCAT TAACCTGTGC ATCTCTTGCC AAAAGTCAAC CTTAG
|
Protein sequence | MKLTFVGLGV LSVICLSAVD NNSVHAQVIQ DNTFNTSVTP TTSMNGSNAY TINNGTRVDN NLFHSFSQFS IPTGDSAFFG NDLSIENIFS RVTGGNISKI DGSINANGKA NLFLLNPAGI IFGKNASLNI GGSFVATTAN SIKFEDGTEF SAVNPAAKAL LTMSVPVGLQ MGSNPGQITV QNTGHQLAFP IHPLVSSPNR SNNPVGLNVN NGNLALIGGK ITLDGGVLNA PSGHIELGSA SNGIVNLNTA SQSWNFDYSN IQQFGDIQLS RQSLADASGT PAGSIQFVGQ NISLNDASAA LLVNEGNGNS GNINIDASGS LTLRGTGTIG FPQSLLRADN FSDGSGGNII ASASQVFLQD GGSLHGINFA EGTGSNIFVE TQDLIQITGI SPVSGFASSL NTITRGSGKS GDIQVATNKL QVLDGAVIGN SPWSNGAGGN ITINASDFIE VAGENTKNLA DSVIVVGTFD EGNSGLLTIN TAQLRVRDGG GIVVSTVNQG NAGDLVINAS DTVEVSGVGS ISGLPSRIGV RAELLAPPIR QLFGLPDVIT GNIGKLTLNA QRLQVTNQAI VGVDHQGIGD AGQLEVNADS IRLDNGGSIT AATVQGEGGN ILINSNDLLL RHGSSIMTNA DGIGNGGNMT INSAVILALE NSDIVANAVQ GNGGNINITS QSIFGLKYRN ELTTESDITA SSQFGLNGTV NIYHFGVDPK TALVELPKNT IDPSKQIASG CNANTGSSFV ATGRGGIPQN PTQEIKSDRT WSDIRDISAF HTKQQAQAPK NPVTLVQATS WRRNANGKIE LFAAKSPTGV QMSLTCASLA KSQP
|
| |