Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_0996 |
Symbol | |
ID | 3680025 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 1199375 |
End bp | 1202707 |
Gene Length | 3333 bp |
Protein Length | 1110 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637716331 |
Product | HEAT repeat-containing PBS lyase |
Protein accession | YP_321515 |
Protein GI | 75907219 |
COG category | [C] Energy production and conversion |
COG ID | [COG1413] FOG: HEAT repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.399368 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.160233 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCACAA CTCTCCGCCA TCATCGCCAG CCTTCTCGCT ACATTTTTTA TTCTTTAGCC TTCATCCTTT CTTTCCTACT CTGCCTACCT TGGGTGAACG CCCAAGAAAA GCCCAAGCCC CAGCCGGAAG CGTGGCAGAT TAACGGCATA GTAGCTGCCC TTGATGATGG ACATGACCAA GTTAAGGGAT ATGCTTTCAA GCAATTAGGT GAATATAATC TCAAAGATTT AAAATCCCTG GTGAAGAAAC CAGAGGATAT TGCCGAGAAA GCTGCCAAAA TCCTCAAGGA TGAAAAGGTG AGCGCCGGCG TTCGTGGCAG TGCGGCAGAG GCATTGGGCA ACTTGGGAGA AGCCGCAGCC AAGTATGTCC CTGACATCGC TAATATCCTC AAGGATGAAA AAGTTCCCAC CAACGTTCGT GGTAGTGCGG CAGAGGCATT GGGCAACTTG GGAGACACCG TAGCCAAGTA TGTCCCTGAC ATCCTCTATT TCCTCAAGGA TCAAAAGGTT CCCGACTATG ATCGTTCAGG TGCGGCAAAG GCATTGGGCA ACTTGGGGAA CGCAGCAGCC AAGTATGTCC CTGACATCGC TAATATCCTC AAAGATGAAA AAGTAGACAC CATCGTTCGT TTATTTGCGG CATCGGCATT GGGCAACTTG GGAAACGCAG CAGCCAAGTA TGTCCCTGAC ATCGCTAATA TCCTCAAAGA TGAAAAGGTT CCCACCAACG TTCGTGGTAG TGCGGCAGAG GCATTGGGCA AATTGGGAAA CACCGCAGCC AAGTATGTCC CTGATATCGC TAATATCCTC AAGGATGAAA AGGTGGATGC CGATGTTCGT AGAAGTGCGG CACAGGCATT GGGCAACTTG GGAGAAGCCG CAGCCAAGTA TGTCCCTGAC ATCGCTAATA TCCTCAAGGA TGAAAAGGTT CCCGCCACCG TTCGTTCAGG TGCGGCACAG GCATTGGGCA GTATGGGAGA AGCCGCAGCT AAGTATGTCC CTGACATCCT CTATTTCCTC AAGGATGAAA AAGTAGACAC TATCGTTCGT TCAGATGCGG TAAAGGCATT GGGCAACTTG GGAGACACCG CAGCCAAGTA TGTCCCTGAC ATCGCTAATA TCCTCAAGGA TGAAAAGTTG GACAACAACG TTCGTTATTA TGGTGCGGTA TCGGCATTGG GCAACTTGGG AGACACCGCA GCTAAGTATG TCCCTGACAT CGCTAATATC CTCAAAGATG AAAAGGTGGA CACCATCTTT CGTAGAAATG CGGCAGAGGC ATTGGGCAGT ATGGGAGAAG CCGCAGCTAA GTATGTCCCT GACATCCTCA ATTTTCTCAA GGATGAAAAG GTGGAAGCCT CCTTTCGTAG AAATGCAGCA TTGGCATTGG GCAACTTGGG AGAAGCCGCA GTCAAGTATG TCCCTGACAT CCTCAATTTC CTCAAGGATG AAAACGTTCC CGCCGACGTT CGTGGAAATG CGGCATCGGC ATTGGGAAAA ATCAGACAAC TTAAATTAGA AGAGGTTGTC GTAGTTTTGA ATTATATCTA CGAACCAAAT CAGGGAAGTT TTGATTATGG TCAGAATAAT TATGAATTTT GGCGATTTTG GACTTATTTC CTGGGTGGCG GCCATGAAAA AGTCAAAACC CTGCTGACAT GGCTAGGCAG ACCAAAAACA ACTCCTGATA AGCTGGAACA CTCTCAAGCT GTCAAAACAC TAGAACTATT CCGCGACATT TGGCAACCCA GCCAAGAATT TGCACGAATA CGTGATGATT TAGCAAAACA AATCGCCCTA GTCGCCAGAA AAACCCCTTG GCAACCGCAA GATATTCTCC TATTAGAAAC TCACTACAAC AACCTCAAAA AAGCTGGCTA CAACGAAGCT GATTCACTGC AATCAGTCAT AGTCAACCTC AAGGGTTGGC AGTGGTTTTT CAACGCCAGA ATTACCATCC TCACACACGC TACCTTCTGG CTTGCCCTCA TCTTCGCCTA CCCCAAATTC CCCCAAATTC AAGCCATCTT CTTCTGGAAC CCTTGGGTAC GCCGCATCTT AGGGGTGGGT TACGTCGGCT TTCTCCTCAC CTGGTTTCCC CCCTTCCGCC GTAAATTATT TGAACCCTTC AAACCCTCCC TCCTAGCCGA TGCCGGCTTA GATAACTTTA GTGACAAAGG CTACTTCCCA GAATCCAGAG TCAAAGTTCC CGGTACAGGG GATATCTTCC CAGTTACCGC CGCCCTCCCC AGCATCAAAG GGCAAATTAT CTTAGAAGGG GATTCCGGCT TAGGTAAGTC GATGTTTCTC CGCCATCTGT TGCAAAACTC CCCGCGCATA GTCGTTTATC TCCCCGCCCA AAAATGCCAT AAAGGCGTAA TTGAAGCCAT CCAAGACAAG CTACACGGCC AAGCCCAAGA TGCCGACTTC CTGAAAAACC TCATTTACAG TGGTGCAATA GATATCTGCA TCGACGGACT CAACGAAGTC ACAGCCGATA CCAGAGCTAA AATCTGCCAG TTTGTGGAAA GCTATTTCCG GGGCAACATT ATCATGACTA CCCAGCCCCT AGAGTGGACA CCACCCTCAA CCGCCAAAAC CTACCACTTG CAACCCTTAG AACCCAACCA AATTCAAGAG TTTTTGCTCT CCCGTGAACC GCGACTGCCC AAGGATGCCA AAATTCAAGG TGCTGATTAC GAACAAGCCT GCATTAATTA TTTAAAAGAA GTCCTCACTA CCCAGCAACC AGAGGAAGAA TTAAAAGCAG CCAGGCGCAT TCTTTCCAAC CCAATGGATT TAACCGTGGT AGCCCTGATG TTATCACAAG GTCAATATCC CAACTTATTC CGCCTGCAAG AACAGCAATA CAACCTAATA GCGGCTGAAT ACCTGAAGGA ATGGAACCAA GAATTTCCCT TAAGAAAATT CTCCGCCGCA GTCTACCAAA TGCGTATCGA CGACAAACAA GCCTTACCCG CCGATGAATT TTATCAAGTC GTCATGTCTT TGGAAGATGA GAAATATAAA ATGGTAGTGA GCCGTCAATG GCAAGATGAT AAAGGGGAAG CCAAGAAAGA ATGGTATTTC CGCCACGATA AAATCATGGA CTTCTTCCTA GTGCAGAACT TTCTCGGCGA CAGTGATGAA GCGGAAAGAC TATTAGTAGA TAGAATGGGT GACCCCCGCT TTCGTGGTGT TTACTTCCTC TTAGCCAGCT TACTGCCAAT AGATGCAGCC AAGGAATTGC GGGAGAAGTT GATTCAATAC GCCGCAGATA CTAAAGACAA TACGGTGAGT AATACCTTTG TGCAGTTATT ACGGACAAGG TAA
|
Protein sequence | MVTTLRHHRQ PSRYIFYSLA FILSFLLCLP WVNAQEKPKP QPEAWQINGI VAALDDGHDQ VKGYAFKQLG EYNLKDLKSL VKKPEDIAEK AAKILKDEKV SAGVRGSAAE ALGNLGEAAA KYVPDIANIL KDEKVPTNVR GSAAEALGNL GDTVAKYVPD ILYFLKDQKV PDYDRSGAAK ALGNLGNAAA KYVPDIANIL KDEKVDTIVR LFAASALGNL GNAAAKYVPD IANILKDEKV PTNVRGSAAE ALGKLGNTAA KYVPDIANIL KDEKVDADVR RSAAQALGNL GEAAAKYVPD IANILKDEKV PATVRSGAAQ ALGSMGEAAA KYVPDILYFL KDEKVDTIVR SDAVKALGNL GDTAAKYVPD IANILKDEKL DNNVRYYGAV SALGNLGDTA AKYVPDIANI LKDEKVDTIF RRNAAEALGS MGEAAAKYVP DILNFLKDEK VEASFRRNAA LALGNLGEAA VKYVPDILNF LKDENVPADV RGNAASALGK IRQLKLEEVV VVLNYIYEPN QGSFDYGQNN YEFWRFWTYF LGGGHEKVKT LLTWLGRPKT TPDKLEHSQA VKTLELFRDI WQPSQEFARI RDDLAKQIAL VARKTPWQPQ DILLLETHYN NLKKAGYNEA DSLQSVIVNL KGWQWFFNAR ITILTHATFW LALIFAYPKF PQIQAIFFWN PWVRRILGVG YVGFLLTWFP PFRRKLFEPF KPSLLADAGL DNFSDKGYFP ESRVKVPGTG DIFPVTAALP SIKGQIILEG DSGLGKSMFL RHLLQNSPRI VVYLPAQKCH KGVIEAIQDK LHGQAQDADF LKNLIYSGAI DICIDGLNEV TADTRAKICQ FVESYFRGNI IMTTQPLEWT PPSTAKTYHL QPLEPNQIQE FLLSREPRLP KDAKIQGADY EQACINYLKE VLTTQQPEEE LKAARRILSN PMDLTVVALM LSQGQYPNLF RLQEQQYNLI AAEYLKEWNQ EFPLRKFSAA VYQMRIDDKQ ALPADEFYQV VMSLEDEKYK MVVSRQWQDD KGEAKKEWYF RHDKIMDFFL VQNFLGDSDE AERLLVDRMG DPRFRGVYFL LASLLPIDAA KELREKLIQY AADTKDNTVS NTFVQLLRTR
|
| |