Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_0583 |
Symbol | |
ID | 3678613 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 735883 |
End bp | 739050 |
Gene Length | 3168 bp |
Protein Length | 1055 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 637715911 |
Product | putative Ig |
Protein accession | YP_321102 |
Protein GI | 75906806 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2931] RTX toxins and related Ca2+-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAACA CCGCGCCTCT ACTGAACAAT ACAGGTAATC TCACCTTATC TTTTATTACT GAAGATATTC CTGTTACTAG TAACACTGGT ACTCTAGTGT CTGAAATCAT CGGTAACTCT ATAAGTGACC CAGATGCTAG TGCGCTAAAA GGCATTGCCG TTACCTTCGT TGATAACAGT AACGGTGTTT GGGAATATAC CTTGAATAAT GGAACTAGTT GGAATGCCTT TGGAACACCA TTGCTAAATG CAGCACGGTT ACTACCATCA AATGCAAATA CTAAAATCCG CTTTCGCCCC AATGCTAACT TTAGTGGCTC TGCTGATATT AACTTTTACG CTTGGGATCA AACAACAGGT ACATCAGGTA ACACAGCGAA TATTTTGGCT GGTAAAGGTG GAACAACTGC CTTCAGTACG GACTATGAAG GAGCCTCCAT TACTGTTACC TCTGTTAACG ACACAGCACC AACCCTCAAC GCCGGGACTC TCACTCTAGC AACTATTAAT GAGGATACAC CTCTTGTGAG TAACCGGGGT AGTTTACTTG CTGACCTAGT GAGAGGATTA ATTAGCGATA GCGACGCAAA CCCCCAAGGA ATTGCTGTCA CAGGAGCAGA TAATAGCAAC GGTAGTTGGC AATATTCTCT TGATGGCGGT GCTAATTGGC TCAACTTTGG TGCTGTTTCT GATAGCTCAG CTACAGTACT ACTTCCTAGT ATCAGACTCT ATGATGGTTC ATTGATTGGC TCACCAACTT CTCAAGGCTG GCTCAAATTT GGTGCTTCGC CTGCTATTAC TCTTCCTATA CTAGGAAATG TGCTTGGGGT TGGTGGTACT CAGTCTCAGA TTAGTGGTGG AACTCAATTA AATAGTAATA CTACACCTGC TTCATCACTA GCTGGATTTA CAGCCTCAAC TGGGACATCC GGTTACAGTA ATTACAATGG TTACGCCCCA GTTCTATTTA ACCAGTCGTT TCCTGCACTT GACCCCGTAA AAGGCTTCAC TATTAGTTTT GATGTGAAAA TTAACAGTGA AACCCACACC AGTGATGATA ATGGTGATGG CATTCAAGAC CGCGCTGGCT TTAGCGTCAT TGTTGTTACT AGTGACAAAA CCAAAGCAAT TGAGTTAGGT TTCTGGACAG ATGAAATTTG GGCGCAGAAT GCTAGCCCGT TATTTACCCA CAGCACCACA GAACGGGCAT TTAGAAACAC AAGCACAGCA GTTACCCACT ATAATTTGGT AGTTGAAAAT AATACTTACA AGCTGTTTGC GCCGGATTCA TCTACGCCGA TTTTAAGTGG CAATCTCCGC GATTACACGG CATTCAACCA CACCACTGCT GCACCTTCCC CAATCAATTC TTTACCTTTT GACCCTTACG AAACACCCAA TTTCCTTTTC CTCGGTGACA ATACAACCTC CGCTAGGTCT TCTATTGACC TCACACGGGT GGAACTGCAA ACAAATACGA GAGTTCGCTT TGTCCCCAAT GCTGACTATA ATGGCCAAGC TAATCTCACC TTCCGCGCCT GGGATGGTTC TAATGGTGTA GCCAGTGGTA CTACAGGAGT AAACGCTTCA GTTAATGGCA ATGCCACAGC TTTTAGTAGC AATACTCAGA CTGTCGGTAT CACTATTAAT TCTGTTAATG ATGCACCGAT AGTAGCCAAT AGTATTTCTA ACCAAACAGC TATAACAGGC ACAGCATTTA ACTTTCAAAT CGCCGCTAAC ACATTTGCTG ATGCAGACTC AGGCGACACC CTCACGTATT CAGCTACCCG CAGCGATGGT AGCCCTCTAC CCAGTTGGTT ATCTTTTGAT GCCGATACGG GCAGATTTAC TGGTACTCCT ACCACGGATA ATTTAGGTAG CATTAGCCTG AGAGTTACTG CTACAGATAC AACCAACCTC AGTGTTAACA CTATATTTAA CCTAGAAATC AGACGGCCAG ACAATATCAT CAACGGCACC GCCAACAATA ATACAATCAT AGCTACCAGT GCCAAAGATA TATTTGATGG TGGAGATGGA GGTGATATCT TCATTACCAA CATCGCCAAC CTCAGCCAAA ATGACATTCT CAATGGTGGA AATGGACAAG ACACAATCAT TATTCAAGGT GGCATAAATA CGGACACCAT CAGCTTTGAC TTAAGTAATG TCAACAATCA ACTAGCCAGC ATTCTTGGCA CAACCATCAC CAATGTTGAA ACCTTTGACC TGAGAAGCTT TGCCGGCACA GTCACTTTTA CAGGTGGTAG TGGCAACGAT GTAGTCTATG GTGGCGTTGG TAACGATACC CTGACTGGTG GCGCTGGTGA TGATAACCTC AATGGAGGTG CTGGTGATGA TACCCTCATC GGTGGTGATG GTAATGATAT CCTCACAGGT GGTAGTGGTA CAAACACCCT GACTGGTGGT GCAGGTAATG ACCGTTACTA CATAGATAAC GCCAGCGATG TTATTCAAGA GGGGGCGGGT GCTGGTCAAG ATGAGGTTTT TGCCACAGTC AGCTACACCT TAGCTGCTAA TGTCGAGGCT CTGACCCTCA GAGGTACTGC CATACAAGGT ACGGGCAATA GCAGTAACAA TAACATTAGA GGTAATAATG CTAACAACAT TCTGTCTGGT GAAGATGGCA ATGACAATCT TACAGGTAAT GCTGGCAATG ATGTATTGAT TGGTGGTCGT GGTAATGATA CCCTCAATGG CGGTATTGGC AATGATGAGT TGATTGGTGG CGCTGGTAGC GATCGCTTAT TCGGTGGTGC TGGTGCTGAT TACTTCAGTT TTGGTAGTCA GGGTAATCCC TTCAACAGTG GTGATTTTGG CATAGATACC ATTGCTGATT TTGCAGTTGG CGTGGATGAC ATTAAGTTAG ATAAGGTCAG CTTCTCTGCT CTAACTAGTG TGGTTGGCAA TGGTTTTAGT GTAAGTAGTG AGTTTGCTAG TGTAAGTAAC GATACCTTAG CGGCAACTAG CAATGGGTTG ATTGTTTACA GTTTAGGTAG TGGTCGCTTG TTCTATAACC AAAATGGCAG CGCTGCTGGT TTTGGCACAG GCGCGCAGTT TGCGACTCTC TCCGGTACTC CTATTCTGAG TGCTGATGAT TTCTTCATTT TCCAGTCTGT TCAGGAACCT GCTAAAAAAA TAGCGTAA
|
Protein sequence | MANTAPLLNN TGNLTLSFIT EDIPVTSNTG TLVSEIIGNS ISDPDASALK GIAVTFVDNS NGVWEYTLNN GTSWNAFGTP LLNAARLLPS NANTKIRFRP NANFSGSADI NFYAWDQTTG TSGNTANILA GKGGTTAFST DYEGASITVT SVNDTAPTLN AGTLTLATIN EDTPLVSNRG SLLADLVRGL ISDSDANPQG IAVTGADNSN GSWQYSLDGG ANWLNFGAVS DSSATVLLPS IRLYDGSLIG SPTSQGWLKF GASPAITLPI LGNVLGVGGT QSQISGGTQL NSNTTPASSL AGFTASTGTS GYSNYNGYAP VLFNQSFPAL DPVKGFTISF DVKINSETHT SDDNGDGIQD RAGFSVIVVT SDKTKAIELG FWTDEIWAQN ASPLFTHSTT ERAFRNTSTA VTHYNLVVEN NTYKLFAPDS STPILSGNLR DYTAFNHTTA APSPINSLPF DPYETPNFLF LGDNTTSARS SIDLTRVELQ TNTRVRFVPN ADYNGQANLT FRAWDGSNGV ASGTTGVNAS VNGNATAFSS NTQTVGITIN SVNDAPIVAN SISNQTAITG TAFNFQIAAN TFADADSGDT LTYSATRSDG SPLPSWLSFD ADTGRFTGTP TTDNLGSISL RVTATDTTNL SVNTIFNLEI RRPDNIINGT ANNNTIIATS AKDIFDGGDG GDIFITNIAN LSQNDILNGG NGQDTIIIQG GINTDTISFD LSNVNNQLAS ILGTTITNVE TFDLRSFAGT VTFTGGSGND VVYGGVGNDT LTGGAGDDNL NGGAGDDTLI GGDGNDILTG GSGTNTLTGG AGNDRYYIDN ASDVIQEGAG AGQDEVFATV SYTLAANVEA LTLRGTAIQG TGNSSNNNIR GNNANNILSG EDGNDNLTGN AGNDVLIGGR GNDTLNGGIG NDELIGGAGS DRLFGGAGAD YFSFGSQGNP FNSGDFGIDT IADFAVGVDD IKLDKVSFSA LTSVVGNGFS VSSEFASVSN DTLAATSNGL IVYSLGSGRL FYNQNGSAAG FGTGAQFATL SGTPILSADD FFIFQSVQEP AKKIA
|
| |