Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_1841 |
Symbol | |
ID | 3681832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 2289469 |
End bp | 2294454 |
Gene Length | 4986 bp |
Protein Length | 1661 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 637717181 |
Product | WD-40 repeat-containing protein |
Protein accession | YP_322358 |
Protein GI | 75908062 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | [TIGR01543] phage prohead protease, HK97 family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTAACT GTGAACTTAA AACGGCGATC GCCAATAATA ATGAGCATTC CTTACAAAGG TTAATTCGAG CAATTAATTT ATCTAAGGGA GAATTTTCTC TCATTTTAGT TCGTTGTAAC TACCAACAGT TACGAGAAGA AATGCGGGAT AATCTTCGAG ATTTAAGTAA AGATATCAAT ATTAGAGAAA TATATGTTCA ATCATCGATT AGTGCTTTAC ACACTACAAT CACATCGCAA CTATTTTTAG ATAATCCCAG TGTTGCTAGC GATTGTTTAC CATCAGCCTT AATGGTTTTT GGTTTAGAAT CAGTAATTGC TCTTGAAGAT TTATTAACTG GAATAAATCA GTCACGGGAT ATTTACGCAG CTAGTTTTCC TTTCCCTTTA ATCTTGTGGT TACAAGATGA AGTGGCATCT TCACTTTCTC GGTTAGCTCC TGACTTTAAA AGCTGGGCTG CAACCACTAT TAAATTCGAG ATGCCCCAGG AAGATTTAAT TGCTTTAATT ACAAAACAGA CAGAATCTCT ATTTTCTAAA GTTTTAGAAG TAGGCGCAGA AAAGTTTATC TCTAATGATG CCCTTGATTT AGCACCGAAA TCCCAACATC GTCACGAGAT TGAATCAGCC CGTAATGATT TACTGCGTTC TTACAATATC AAATTAGAAC CAGGATTAGA GGCTAGTCTG GAGTTTGTTT TAGGCAGAGA TAAGTATGCT AATGATCAAA TTAATAGTGC TTTAGCCCAT TATCAAAGAA GCCTATCTCT ATGGCAGCAA GAAGAAAAAA AGGAAAGTAT AGAAAAGAAA TATTTAGAAC TGAACTACCA ACAAAGACTT TGGCAAGCAA TTATCTTATT CCACTTAGGT TTATGTTATC ACCGGATGGC AGATTTACAT CAGAGTGCTA ATAGTGGCTA TTGGCGACAC GCACTATTGT GGTTTCAGCA ATGCTTAGAT GTGTTGGAAG AGAAGAAAAG ACAAGATTTA GTAGCTAAAT TTATTTTACC TGCGTGTGAG ATGCTTTACC GCTTACAGCT TTGGGAAGAT TTAAAGAAAT TAGCTCAAAA GTCTTTATAT CTACATAAGA GTTATGGCAA TCAAGCCCAA ATAGCACAAG ATTATGGCTT TTTAGCAGCC GTAGCAGGTG CAAATAACGA TTGGATACTA GCTCATGAAT TAGCTAATAC TGCATTAGAT ATTCTGGAGA AAGTAACAGG AACTCGACAG CATGAGAGTT GGTATCTTTT GTTGTTAGCC CGTTCACAAA GGCATTTAGG AGAATATGAA GAATCTATCA ATAACTTAGA ATGGGCAAGG GTTGTTTGTG AGCTACAATA TGAGCCTTCT TTGTACTTGG AAATTTTAGA GGAATTGCGA TCGCTCTATT TTATTGAGCG ACATGAATAT TTAGAAGCCT TTAAACTTAA GCAAGAAAAA ATCCAAATTG AACATCAGTA TGGTTTACGT GCTTTTATTG GTGCAAGTCA ATTACAGGCA CAACGTTACA AAATTAATTC AGTCTTAGAA CCCCAAAGTA TCCCCTTTAT TCCAGAGGAA GTTGCCCAAG AAATATCTGC TTCTGGAAGA CAACAAGATG TCAATCGTTT AATTGAAAGA ATTACTCGTG CTGACTATAA ACTCACAGTA GTTCATGGTC CATCAGGAGT TGGTAAAAGT TCACTGCTCA AAGCTGGTTT ATTACCAGCT GTAAAGAATA AAGTGATTGG TGAGCGGATA CCTCTACCTA TTGTGTTATC TGCTTATACT GATTGGACAA CCAGTTTGGG GCGAGGTTTT TATAATGTGT TAGCACCATT AGATATCTCA AGTTCCCTGG AATTTACTCC CGCTATTCTG CTAGAAAAGT TCCGCTCTGC AACTGCACGT AATCATACAA TTATTATTAT TTTTGATCAA TTTGAAGAGT TCTTTTTCGT AAGTAGCTTT CAGCAAAAGT TAGAATTCTA TCAGTTTTTT AGCGAATGTT TGAATATTGC ATATTTAAAA ATTATTATTT CCATCAGAGA AGATTATCTC CATTACTTAT TAGATTTTGA GCGCTGGGAT AAAACACAGC CAGATAGTTT CCATGATTTA GATGTCATTA ACAAAAATAT TTTAGATAAA GATATCCGCT ACTTTCTGGG GAAATTTTCC CGTGAAGATG CCGTTGCTAT TATTCGTTAC TTGACCCAAA CATCTCAATA CGAAGTCAAG GATGATTTAA TTAATGAACT AGTGCAGGAT TTAGCAGGAG AAACCGAAGA AGTCCACCCA ATTGAATTAC AAATAGTAGG CGCGCAGCTA CAAGCAGAAA ATATCACGAA TCTCGAAAAA TACAAACTTT GTGGAGGTAC GAAAAAACTT ATAGAACGCT GGCTAGAGGA AGTTATTCGA GATTGTGGTC AAGAACATGA AGATTTTAGC TGGAAACTAT TATATGAATT AACTGACGAA AAAGGCACAC GTCCCCTCAA AACTAAATCT GATTTAATGC TGGCTTTAGA AAAATATCTT GATAATCAGT CAGATTTTGA TTCACGCTGG GAGTTGATTT TAGAGATTTT GGTTGGTTCA GGTTTAATAT TGTGCTGGCG AGAAGAGTTA GGCGAACGCT ACCAGTTAGT CCATGATTAT TTAGTTGAAC CAATTCGCCA AAGAAATGAT TATGGCATCA TTGCCGAATT AGAAAAAATC AAATCGGAAA AAACTCAAGC AGAGGTAGCC CGAAAATTAT CTCAAGAGCA GCTAAATTCA GTTTTGCGGC GTAGATTGCG AGAAGCACGA GCAGCAGGGG TACTGCTGGC AGTTATGGGC GGCACGATCG CTTCTTTGTG GTGGCAAGCC GATATGCAGA AAAGAACCGC AGAGCTACAA ACCATCCGTG CTGAAACCAG CGAGACCAAT TTGCAAATTA GTGCGATCGC CGCATCTAGC GAAGCTCTAT TTTCCTCCAA CCAAGAATTT GATGCTTTAT TAGAAAGTTT GAGAGCCTGG CAAAAACTTA AACAAGCCAA GGAGGTGCGA CCAGAAACCC GAATGCGCGT GGTTACAGCC TTACAACAGG CAGTTTATGG AGTTACGGAG TTAAATCGCT TAGAGGGGCA TAGTGATATT GTTTGGGGTG TAGCTTTTAG TCCTGATGGT CAATTATTAG CTTCCGGTAG CACAGACAGA ACAATCAAAC TTTGGCGACC TGACGGGACA CTACTCCAAA CCCTTGAAGG TCATACTAGC GCGGTTACCA GTGTCAGTTT TAGTCCTGAT GGTCAGACTA TAGCTTCAAC TAGTCTCGAC CAAACAGTAC GAATTTGGCG GAAAAACCCA ACAACTGGTG AATTCGCCCC AGAGCCGGCC CAAAGCCTCA GAAAACACAA AGATTGGGTT TATAGCGCCA ATTTTAGCCC TGATGGAGAA TTGTTAGCTA CTGCCAGCAG GGATAGAACT ATTAAAATTT GGGATCGGGA TGGTAACTTA ATCAAAACTC TTAAGGGTCA TCAAGGCTCT GTCAACTGGG TCAGTTTCAG TCCAGACAGT CAATTTATTG CTTCAGCTAG TGAGGATAAG ACCGTCAAAA TTTGGCGCAG AGACGGTAGC TTGGTGAAAA CTTTATCTGC ACATCAGGAA GGTGTAACTG TCGTCACTTT TAGCCCTGAT GGTAAGCTAC TAGCTTCAGC CGACCGAGAC AATGTTATTC AACTATGGCA ATGGGATAGT AGTAATCACA ATAACCCAGA AGTTGATATC TATAAAACCT TAAAGCAACA TACCAGCACG GTATGGAGTC TCAGTTTCAG TTCTGACAGT AAACAGTTAG CCTCAGCCAG TGATGACAAT ACTATCAACC TCTGGAGTCA TACAGGCAAC TTAATTAAGA CTTTTAAGGG ACACAGTGAC GCTGTTGTCA GTGTAGCTTT CAGTCCAGAT ACTAAAATTC TGGCCTCTGG AAGTTATGAC AAGAGCGTGA AATTGTGGAG TTTAGAAGCG CCTAGACTAC CGATTCTGCG GGGACATGAG GATCGAGTTT TGAGTGTTGC TTGGAGTCCT GATGGTCAGG TATTGGCTTC TAGTAGTCGC GATCGCACTG TAAAACTATG GCGAAGACAG TTGAATAAGG GCAGACTTGA TGCTCATCTT TACAAAACCT TAGTGGGTCA CACCCAGATG GTTCATAGCG TCAGTATTGA CCCCAAAGGT GAAATTTTAG CTTCTGCTAG TGAAGACAAA ACCGTTAAAC TTTGGCGACT GGATGGAACT CTCCTGAAAA CTCTCTCTGG TCATAGTGAT AGCGTAGTCA GTGTTAGTTT CAGCCCTGAT GGTCATTTAT TAGCATCAGC TAGTAGAGAT CACACAATTA AACTGTGGAA TCGTGATGGA AGTTTGCTCA AAACTTTAGT GGGGCATGAA GCGCGAGTCA ATAGCGTCAG TTTTAGCCCT GATGGAGAGG TTTTAGCTTC TGCTAGTGAC GATAAAACAA TCAAGCTTTG GCGGCCAGAT GGGAGTTTAA TCAAAACCTT TGACCCTCAC GATAGCTGGG TATTAGGTGT CAGTTTTAGC CCGACTGAAA AATTATTGGC TTCTGCTGGT TGGGACAACA CAGTCAGACT ATGGCGACAA GATGGGACTT TGTTACAAAC CTTATTAAGA GGATTTAGCG ATAGCGTTAA TGCTGTGAGT TTTAGTCCCA CTGGTGAAAT ACTTGCTGCT GCTAATTGGG ATAGTACAGT CAAATTGTGG AGCCGTGAGG GCAAATTGAT TAAAACCCTC AACGGACACG AAGCTCCTGT GTTGAGCGTC AGTTTTAGCC CCGACGGACA AACGTTAGCT TCAGCTAGCG ATGACAACAC AATTATTTTA TGGAACTTAC ACCTTGATGA CCTGCTGACT CGTGGTTGTG GCTGGGTCAA TAATTATCTG AAGCACAACA ACAATGTGGA TGAGCGCGAT CGCTTGCTTT GTGATGATGT CAGCGATCGT ATCTAA
|
Protein sequence | MTNCELKTAI ANNNEHSLQR LIRAINLSKG EFSLILVRCN YQQLREEMRD NLRDLSKDIN IREIYVQSSI SALHTTITSQ LFLDNPSVAS DCLPSALMVF GLESVIALED LLTGINQSRD IYAASFPFPL ILWLQDEVAS SLSRLAPDFK SWAATTIKFE MPQEDLIALI TKQTESLFSK VLEVGAEKFI SNDALDLAPK SQHRHEIESA RNDLLRSYNI KLEPGLEASL EFVLGRDKYA NDQINSALAH YQRSLSLWQQ EEKKESIEKK YLELNYQQRL WQAIILFHLG LCYHRMADLH QSANSGYWRH ALLWFQQCLD VLEEKKRQDL VAKFILPACE MLYRLQLWED LKKLAQKSLY LHKSYGNQAQ IAQDYGFLAA VAGANNDWIL AHELANTALD ILEKVTGTRQ HESWYLLLLA RSQRHLGEYE ESINNLEWAR VVCELQYEPS LYLEILEELR SLYFIERHEY LEAFKLKQEK IQIEHQYGLR AFIGASQLQA QRYKINSVLE PQSIPFIPEE VAQEISASGR QQDVNRLIER ITRADYKLTV VHGPSGVGKS SLLKAGLLPA VKNKVIGERI PLPIVLSAYT DWTTSLGRGF YNVLAPLDIS SSLEFTPAIL LEKFRSATAR NHTIIIIFDQ FEEFFFVSSF QQKLEFYQFF SECLNIAYLK IIISIREDYL HYLLDFERWD KTQPDSFHDL DVINKNILDK DIRYFLGKFS REDAVAIIRY LTQTSQYEVK DDLINELVQD LAGETEEVHP IELQIVGAQL QAENITNLEK YKLCGGTKKL IERWLEEVIR DCGQEHEDFS WKLLYELTDE KGTRPLKTKS DLMLALEKYL DNQSDFDSRW ELILEILVGS GLILCWREEL GERYQLVHDY LVEPIRQRND YGIIAELEKI KSEKTQAEVA RKLSQEQLNS VLRRRLREAR AAGVLLAVMG GTIASLWWQA DMQKRTAELQ TIRAETSETN LQISAIAASS EALFSSNQEF DALLESLRAW QKLKQAKEVR PETRMRVVTA LQQAVYGVTE LNRLEGHSDI VWGVAFSPDG QLLASGSTDR TIKLWRPDGT LLQTLEGHTS AVTSVSFSPD GQTIASTSLD QTVRIWRKNP TTGEFAPEPA QSLRKHKDWV YSANFSPDGE LLATASRDRT IKIWDRDGNL IKTLKGHQGS VNWVSFSPDS QFIASASEDK TVKIWRRDGS LVKTLSAHQE GVTVVTFSPD GKLLASADRD NVIQLWQWDS SNHNNPEVDI YKTLKQHTST VWSLSFSSDS KQLASASDDN TINLWSHTGN LIKTFKGHSD AVVSVAFSPD TKILASGSYD KSVKLWSLEA PRLPILRGHE DRVLSVAWSP DGQVLASSSR DRTVKLWRRQ LNKGRLDAHL YKTLVGHTQM VHSVSIDPKG EILASASEDK TVKLWRLDGT LLKTLSGHSD SVVSVSFSPD GHLLASASRD HTIKLWNRDG SLLKTLVGHE ARVNSVSFSP DGEVLASASD DKTIKLWRPD GSLIKTFDPH DSWVLGVSFS PTEKLLASAG WDNTVRLWRQ DGTLLQTLLR GFSDSVNAVS FSPTGEILAA ANWDSTVKLW SREGKLIKTL NGHEAPVLSV SFSPDGQTLA SASDDNTIIL WNLHLDDLLT RGCGWVNNYL KHNNNVDERD RLLCDDVSDR I
|
| |