Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_0049 |
Symbol | |
ID | 3683558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 55168 |
End bp | 58002 |
Gene Length | 2835 bp |
Protein Length | 944 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637715376 |
Product | HEAT repeat-containing PBS lyase |
Protein accession | YP_320570 |
Protein GI | 75906274 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5635] Predicted NTPase (NACHT family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.229879 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.368167 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAGAG CGAGTTATGG CCCAGAGGCG AAAAGGCGAT CGCGGCATTT ATTACAGGTA CTATTAGCTT ATGCTAATGA TGAGCTAGGC TGCGATGAAG CGGCTTTAGA TGCCCTGCGT CCCCAAATAC AAACTCGCTG GCAAAGTGAA ACACGCCTAG TTGTCAGAAC TAAAGTCAGA TTTTTGCAGG CTTTAACGGG GTTAACATCA GATGAGTTAA CATCTGAACA AATCAAAGAA GCTTTAAAAC GGTTTGCAGA TTTCTTAGAA ATATTAGAAG ACAATCGCCC ATCTCGCAGT GGTTCCGAAA CTTGGCACTT TACCCTCAAT CTTTGGTACA AACGCCAAGA TACAGCAGCC AACTTACAAA AATTTGATTC CCACTGGGAA AGTCGCCGTC CAGAAAAATC CAAGGAAGTC ACAGCAGCAA AAGCGGACAA ACAAGTACCT CACCCAAATT GGCAAGAACT ATGTCGCGCC AACCTCAACA ACCGCCTGAC AACAAATCCC CTCACCAGTG TAGATGGTGT CACTTTTGAA TTGAATCAGG TTTACGTACC TTTGGGCTTA GTGCAGCGAA AACAGCGTTT ACGTCATCGT GATGATGTCA CTCCCCAAGA GGGTTCGCGG TTATATGAAC CAGAGGAATT GGATATTACC CAAACCTTTG ACCATAATGA ATTTATTCAA CAGGTACTAG GACAAAAGCA AAGTCAGCGA ATTGCCATTA TCGGCGAACC AGGCGCAGGG AAAACTACTT TTTTGCAGAG AATTGCTGCT TGGGTGTTAG ATAATACCGC AGATTTACCA GTTTGGATTT CCCTAGCAGA TTTACAAGGT AAAACTCTGG AACAGTATTT AATTCAAGAC TGGCTACCTT CAGCTATGCG TAAACTTAGG GTTTCGCCAG AACTTGAAGA CGCATTTTGT GAACAGTTCA ACCAAGGAAG AGTTTGGTTA CTGCTGGATG CGGTGGATGA AATGGCAATA GAATCTACCA GCGCCTTGGC AAAAATTGCT AGTTTCTTGA AAAGTTGGGT AGGCGATGCC ACAATTATTC TTACCTGTCG GCTGAATGTT TGGGATGCGG GGAAGAATGC CTTAGAAAAT TTTGACACTT ATGGCAATTT ACATTTTAGC TACGGCCTGG ATCAAACACA GGATCGGATA GGGCAATTTA TTTGTCGATG GTTTCAGTAC AATCCAGCGT TAGGCGAAAA ACTGCGACAT GAGTTAAACC AACCAGAAAG GCGGCGCGTC AAAGATACGG TAAAAAATCC CCTACGCTTG GCTTTGTTGT GTCGGATTTG GGGCATCACT CAGGGAGAAT TACCTTATAG TAAGGCGATG CTTTATCAGC AGTTTGTGGA AGCAATCTAT GAGTGGAAAC AAGACATTTT CCCTACTACT TCCACTCAGC GACAGGAGTT AAACCGGGCG TTGGGAAAAT TAGCACTGTT GGCAATTGCC CAAGAACAGA CAAAGTTTCG CCTCAGACAT CGTTTTGTCT GCCAAGTTTT AGGCACTCCC GATGATGGAC TGTTCCAACT AGCACTCCAG TTAGGTTGGT TGAATCAAGT CGGTATTTTA GAAACTCAAG GGGAAAAAGT TTATGCTTTT TACCATCCCA CATTTCAAGA ATATTTTGCC GCTCAGGCGA TTACAGATTT TTCAGGGGAA AGTGGTTACC GGATTTTTGA ACCGCAATGG CGAGAGGTAA TACTATTGTG GCTGGGTAGA GATGATGTAC CACAGGTAGA AAAAGAGGCT TTGATTAACG CACTAATTCA ATTTGAGGAC GGATGCGGGG GATTCTATAA TTATCAAGCT TATTTCTTGG CAGCCCAGGG AATCACTGAA TTTACCGATT CTCAACAAGC TGAGGCGATG ATTCAGCAGT TAATTAAGTG GCGTTTTGGC TACTTTGATT CACAAAAGCA AAAATGGTGT AGATACCCAT CACCAATTGT AGAAGGTGCG CGCATAGCTT TACTCAAAAC CGATCGCCTA AAAGCGATCG CCTGCTTAGA ACAATTTATC ACTGCCTGTG ACAATGAGTT TGATAGTTGG AATGCAGCCT ATAGTTTAGG TAAGATTTTT GACCCTGGGA ACAAGATTGC GATCGCTTCC TTGGAAAATC TTGTTAAAAT AGCTCGGCAT GAAACTATCC GTTGGCAAGC CGCCTATAGT CTAGGCAGAG TCGATGCAGA AAATCCCACG GCTATGACTG CATTGCTGCA AATTATTGCA ACTACTGACA ATGAATCTAC TCGTCGTAAG GCTGCCTATA GTTTAGGTAA ACTTGATTCC CACAACGCAA CGGCTATTAC CACATTAGAA AAAATCGCCC AATCAGCCAC AGATGTTTCT CAACGCCGCC AAGCCACCGA GAATCTAGCA ACTTTACGCG GTGAAGAAAT TACCCATAAA TGGCAAGGTA AACAACAACA ACAACAAAAG GTACTTTACC CTGTACCTGA AAAAATCACA GCCTTAATCC GTGGTATCTC TTCATGTGAG GATGAAGACA CCAAAAGACG TAGAGCCTAC AAATTAGCAC AACTTGACCC AGGTAATAAG ATTGCATTGA TAACCTTACT ACAATTACTC AAATCAACCC AACGCGAATC GTTACGCAAA CGCACAGCAG ACAATTTAAA AGAAATCCTG ATTGATGAAC AGTTACCACA AGTAATCTTC TACCTGAAAG ATTGCTTTTC CCCCACAGTT AGGGAACACG AGTTGGAACT ACACCGAGAT TGTTACAAGT TGCTGTGGTA CTGTGCCGAA AATCTGACTT ATCAGGAGTT TTACCAAGCT TGGCATAGCA GTTAA
|
Protein sequence | MARASYGPEA KRRSRHLLQV LLAYANDELG CDEAALDALR PQIQTRWQSE TRLVVRTKVR FLQALTGLTS DELTSEQIKE ALKRFADFLE ILEDNRPSRS GSETWHFTLN LWYKRQDTAA NLQKFDSHWE SRRPEKSKEV TAAKADKQVP HPNWQELCRA NLNNRLTTNP LTSVDGVTFE LNQVYVPLGL VQRKQRLRHR DDVTPQEGSR LYEPEELDIT QTFDHNEFIQ QVLGQKQSQR IAIIGEPGAG KTTFLQRIAA WVLDNTADLP VWISLADLQG KTLEQYLIQD WLPSAMRKLR VSPELEDAFC EQFNQGRVWL LLDAVDEMAI ESTSALAKIA SFLKSWVGDA TIILTCRLNV WDAGKNALEN FDTYGNLHFS YGLDQTQDRI GQFICRWFQY NPALGEKLRH ELNQPERRRV KDTVKNPLRL ALLCRIWGIT QGELPYSKAM LYQQFVEAIY EWKQDIFPTT STQRQELNRA LGKLALLAIA QEQTKFRLRH RFVCQVLGTP DDGLFQLALQ LGWLNQVGIL ETQGEKVYAF YHPTFQEYFA AQAITDFSGE SGYRIFEPQW REVILLWLGR DDVPQVEKEA LINALIQFED GCGGFYNYQA YFLAAQGITE FTDSQQAEAM IQQLIKWRFG YFDSQKQKWC RYPSPIVEGA RIALLKTDRL KAIACLEQFI TACDNEFDSW NAAYSLGKIF DPGNKIAIAS LENLVKIARH ETIRWQAAYS LGRVDAENPT AMTALLQIIA TTDNESTRRK AAYSLGKLDS HNATAITTLE KIAQSATDVS QRRQATENLA TLRGEEITHK WQGKQQQQQK VLYPVPEKIT ALIRGISSCE DEDTKRRRAY KLAQLDPGNK IALITLLQLL KSTQRESLRK RTADNLKEIL IDEQLPQVIF YLKDCFSPTV REHELELHRD CYKLLWYCAE NLTYQEFYQA WHSS
|
| |