Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_2971 |
Symbol | |
ID | 3681264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 3682727 |
End bp | 3686830 |
Gene Length | 4104 bp |
Protein Length | 1367 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637718318 |
Product | WD-40 repeat-containing protein |
Protein accession | YP_323477 |
Protein GI | 75909181 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGTATC AGTATCAAGT TGGCGGTAGC CTTCCCGCAG ATGCGCCAAC TTATGTCAAA CGACAAGCTG ATGAAGACTT ATATACAGGC TTGAAGGCTG GACAATTTTG TTATGTCCTC AACTCCCGCC AAATGGGGAA ATCTAGCTTG CGGGTGCAGG TGATGGGGCG ATTGCAGGCG GAGGGGTTTG CTTGTGCGGC TGTGGATATT ACCGCTATTG GGACGGCTGA GATTACGCCG GAACAATGGT ATGCAGGGGT GATTGATACC TTGGTAGGAT ACTTTAATTT ATATACTGAT TTCGATTTAG AAACTTGGTG GAATAACAAC GGTTTACTTT CACCTGTGCA GCGTTTTAGT AAGTTTATTG AAACGGTTTT ATTGCCAAGA ATTACCGAAA ATATAGTTAT TTTTATTGAT GAAATTGATA GTGTTTTAAG CCTTGATTTT AATCTCGATG ATTTTTTTGC CGTGATTCGG GATTGTTATA ACCGAAGGGC GGATCATCCT GAATATCATC GCATCACATT TGCCTTAATT GGGGTGTCTA CGCCCTCAGA CTTGATTCAA GATAAAGGAC GCACACCTTT TAATATTGGT AGGGCAATTG ATTTAACAGG TTTTGAGTTA GCCGAAGCTG AACCATTAGC CCAAGGTTTA GCCGCGTTAG GTAATCCCCA AGAAATCATA GCGGCGGTGT TGGCGTGGAC AGGAGGACAG CCATTTTTAA CCCAAAAAGT TTGTAATTTA TTAATCGCAG ATTACCTTGT AGAGACGCGA TTCATCGCGT CTGGTGAAAA CAAAGAGAAA TCATCAATTG AAAATTTAGT TGCAACGGTA GTCAAAAACC GGATTATTGA AAATTGGGAA GGACAAGACG AACCAGAACA TTTAAAGACA ATTCGTGACA GAATCATGCG GAGTGGGGAA CAGCGCACCG GGCGGCTGTT GGGGTTATAT CAGCAGATTT TGCAGCAAAG TGAGCTTGTT GCTGATGATA GTTATGACCA GATGATGCTG CGGTTGACTG GGTTAGTAGT GCGGCGGGAT GGCAAATTGC GAATTTATAA CCGCATCTAT GCAGAGGTGT TTCAGCAAGA GTGGTGTGAG AATATTTTGG CTGGGTTGCG TCCATACTCT GATACGTTTA ATGCTTGGGT GGCTTCTAAT TATCAAGATG AATCGCGCCT GTTACGTGGA CAAGCATTAC AAGAGGCTTT GGCTTGGCGC ATTGGGAAAA ATTTAACTGA TGTTGATGAC CGATTTTTAG ATGCTAGTCA GGAATTACAA AAGCGAGAAG TTGAAAAAAG TTTAGCACTG GCACGCAAAG AACAAGAAAT TTTAACAGCA GCTAATAAAA AAGCTCGGCA AAGAGTTTTT ATTGCTTCGG TGCTGTTGAT TATTTCTGTT GTGGCGGCGG TAGGTTTGGG AGTGTTGGCG GGGCGAAGTA ATAAACAATT GGCTGATGCG AGAACCGAAC GGGACAAAAT TGATAGAGAG AAACAACAGA AAGAACGAGA ACTCGCAACG GCACAGCAGC GAGTTGTTGA TGCTAATAAA AAGGTGATTG ATGCTAATAA AAATCTCCAG GATGCGACGG CTAATTTAAA ACAGCAACAG CTAACTGCCA AGCAACAGCT AAATGCAACA AATCAACAAC TGAAACAAGC ACAAGATAAA GAAAAGCAGG CGCGGGGACA AGTAGAAAAG GCACAAAACG ACTTGCGACA AGCGAGAGAA CAACAACGAC AGGCTTTGGT GGGGTTAAAA ACTGCGGAAG CACAGCAAAA ACGGGCGCAA GATAACCTGA AAAAGACAGA AGCAGAACGA GAAATTGCCT TGACAGGTAC ACGGCTGGAA CGGTCAGGAG TGGCCAATAT TAACCGCTTT GAGTTTAATC AAATAGGTGC TTTGCTGGCA GCAATGCGGG ATGGTAGAGA GTTAAAAAGT TTAATAGACA AGCAGGGAAT CAAGCAACTT AAGGACTACC CGGCGGCTAG CCCAGTGTTG GCTTTACAGA CAATTTTAGA TAACGTGCGG GGTATGACAG TCATGGCTGG GCATGAGAAT TGGGTCAACA GTGCCACCTT TAGTCCAGAT GGGCAGCGCA TCCTCACTGC ATCATCTGAC AAAACAGCGC GTTTGTGGGA CTTACAGGGT CGGCAAATCG CTAAGTTCCA GGGGCATGAG AGTTCGGTCA ACAGTGCCAC CTTTAGTCCA GATGGGCAGC GCATCCTCAC TGCATCATCT GACAAAACAG CGCGTTTGTG GGACTTACAG GGTCGGCAAA TCGCTAAGTT CCAGGGGCAT GAGAGTTCGG TCATCAGTGC CACCTTTAGC CCGGATGGGC AGCGCATCCT CACCCTTTCC GGTGATAGGA CAACGCGGTT GTGGGACTTA CAGGGTCGGC AAATCGCTGA GTTACAGGGG CATGAGGGTT GGGTCAGAAG TGCCACTTTT AGTCCAGATG GGCAGCGCAT CCTCACCGCC TCCGTTGACG AGACAGCGCG GTTGTGGGAC TTACAGGGTC GGCAAATCGC TAAGTTCCAG GGGCATAAGA GTTGGCTCTT CAGTGCCACC TTTAGCCCGG ATGGGCAGCG CATCCTCACT GCATCATCTG ACAAAACAGC GCGGTTATGG GACTTACAGG GTCGTCAAAT CGCTAAGTTC CAGGGGCATG AGAATTCAGT CATCAGTGCT ACATTTAGCC CAGATGGACA GCGCATCCTC ACCCTCTCCG TTGACAAGAC AGCGCGGTTA TGGGACTTAC AGGGTCGGCA AATCGCTGAG TTGCAGGGGC ATGAGGATTG GGTCAACAGT GCCACATTTA GCCCAGATGG GCAGCGCATC CTCACTGCAT CATCTGACAA GACAGCGCGG TTGTGGGACT TACAGGGTCG GCAAATCGCT GAGTTGCAGG GGCATGAGGA TTGGGTCAAC AGTGCCACAT TTAGCCCGGA TGGGCAGCGC ATACTCACCG CCTCTAGAGA CGAGACAGCG CGGTTGTGGA ACTTACAGGG TTGGCAAATC GCTAAATTCC AGGGGCATGA GAATGTGGTC AGCAGTGCCA CATTTAGCCC GGATGGACAG CGCATCCTCA CTGCCTCACC TGACAAGACA GCGCGGTTGT GGGACTTACA GGGTCGGCAA ATTGCTGAGT TGCAGGGGCA TGAGAATGTG GTCAGCAGTG CCACATTTAG CCCGGATGGA CAGCGCATCC TCACTGCCTC ACCTGACAAG ACAGCGCGGT TGTGGGACTT ACAGGGTCGG CAAATTGCTG AGTTGCAGGG GCATAAGGGT TGGCTCTTCA GTGCCATCTT TAGCCCTGAT GGGCAGCGCA TCCTCACCGC CTCCGATGAC AAAACAGCGC GGCTGTGGGA CTTACAGGGT CGACAAATCG CTGAGTTGGG GCATAAGGGT TGGCTCTTCA GTGCCACCTT TAGCCCGGAT GGGCAGCGCA TCCTCACCGC TTCAAGTGAC AGCACAGCGC GGTTGTGGAA CTTACAGGGC AGGGAAATTG CCAAGTTCCA GGGGCATAAG AATTTGGTCA TCAGTGCCAG TTTTAGCCCG GATGGGCAGC GCATCCTCAC TGCATCATCT GACAAGACAG CGCGGCTGTG GGAATTACAG GGCAGGGAAA TTGCCAAGTT CCAGGGGCAT GAGGGTGATG TCATAACTGC CATTTTTAGC CCGGATGGGC AGCGCATCCT CACCGCCTCT AGAGACAAGA TAGCGCGGTT GTGGGACTTA CAGGGTCGGG AAATTGCCAA GTTCCAAGGG CATGAGGATT GGGTCAACAG TGCCATTTTT AGCCCGGATG GGCAGCGCAT CCTCACCGCC TCTAGAGACA AGACAGCGCG GTTGTGGGAC TTACAGGGTC GGGAAATTGC CAAGTTCCAA GGGCATGAGG ATTGGGTCAA CAGTGCCACC TTTAGCCCGG ATGGGCAGCG CATCCTCACC GCCTCTAGAG ACAAGACAGC GCGGCTGTGG CAGGTGGAAA GTTTAGAGCA ATTACTGGCA CGGGGTTGTG GGTGGTTGCG TAACTATTTA ATTTATGCGC CAAATTTGAG TGAGAGTGAT AAGCAGGTTT GTAAAAAGGA GTGA
|
Protein sequence | MQYQYQVGGS LPADAPTYVK RQADEDLYTG LKAGQFCYVL NSRQMGKSSL RVQVMGRLQA EGFACAAVDI TAIGTAEITP EQWYAGVIDT LVGYFNLYTD FDLETWWNNN GLLSPVQRFS KFIETVLLPR ITENIVIFID EIDSVLSLDF NLDDFFAVIR DCYNRRADHP EYHRITFALI GVSTPSDLIQ DKGRTPFNIG RAIDLTGFEL AEAEPLAQGL AALGNPQEII AAVLAWTGGQ PFLTQKVCNL LIADYLVETR FIASGENKEK SSIENLVATV VKNRIIENWE GQDEPEHLKT IRDRIMRSGE QRTGRLLGLY QQILQQSELV ADDSYDQMML RLTGLVVRRD GKLRIYNRIY AEVFQQEWCE NILAGLRPYS DTFNAWVASN YQDESRLLRG QALQEALAWR IGKNLTDVDD RFLDASQELQ KREVEKSLAL ARKEQEILTA ANKKARQRVF IASVLLIISV VAAVGLGVLA GRSNKQLADA RTERDKIDRE KQQKERELAT AQQRVVDANK KVIDANKNLQ DATANLKQQQ LTAKQQLNAT NQQLKQAQDK EKQARGQVEK AQNDLRQARE QQRQALVGLK TAEAQQKRAQ DNLKKTEAER EIALTGTRLE RSGVANINRF EFNQIGALLA AMRDGRELKS LIDKQGIKQL KDYPAASPVL ALQTILDNVR GMTVMAGHEN WVNSATFSPD GQRILTASSD KTARLWDLQG RQIAKFQGHE SSVNSATFSP DGQRILTASS DKTARLWDLQ GRQIAKFQGH ESSVISATFS PDGQRILTLS GDRTTRLWDL QGRQIAELQG HEGWVRSATF SPDGQRILTA SVDETARLWD LQGRQIAKFQ GHKSWLFSAT FSPDGQRILT ASSDKTARLW DLQGRQIAKF QGHENSVISA TFSPDGQRIL TLSVDKTARL WDLQGRQIAE LQGHEDWVNS ATFSPDGQRI LTASSDKTAR LWDLQGRQIA ELQGHEDWVN SATFSPDGQR ILTASRDETA RLWNLQGWQI AKFQGHENVV SSATFSPDGQ RILTASPDKT ARLWDLQGRQ IAELQGHENV VSSATFSPDG QRILTASPDK TARLWDLQGR QIAELQGHKG WLFSAIFSPD GQRILTASDD KTARLWDLQG RQIAELGHKG WLFSATFSPD GQRILTASSD STARLWNLQG REIAKFQGHK NLVISASFSP DGQRILTASS DKTARLWELQ GREIAKFQGH EGDVITAIFS PDGQRILTAS RDKIARLWDL QGREIAKFQG HEDWVNSAIF SPDGQRILTA SRDKTARLWD LQGREIAKFQ GHEDWVNSAT FSPDGQRILT ASRDKTARLW QVESLEQLLA RGCGWLRNYL IYAPNLSESD KQVCKKE
|
| |