Gene Ava_0996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0996 
Symbol 
ID3680025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1199375 
End bp1202707 
Gene Length3333 bp 
Protein Length1110 aa 
Translation table11 
GC content48% 
IMG OID637716331 
ProductHEAT repeat-containing PBS lyase 
Protein accessionYP_321515 
Protein GI75907219 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.399368 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.160233 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCACAA CTCTCCGCCA TCATCGCCAG CCTTCTCGCT ACATTTTTTA TTCTTTAGCC 
TTCATCCTTT CTTTCCTACT CTGCCTACCT TGGGTGAACG CCCAAGAAAA GCCCAAGCCC
CAGCCGGAAG CGTGGCAGAT TAACGGCATA GTAGCTGCCC TTGATGATGG ACATGACCAA
GTTAAGGGAT ATGCTTTCAA GCAATTAGGT GAATATAATC TCAAAGATTT AAAATCCCTG
GTGAAGAAAC CAGAGGATAT TGCCGAGAAA GCTGCCAAAA TCCTCAAGGA TGAAAAGGTG
AGCGCCGGCG TTCGTGGCAG TGCGGCAGAG GCATTGGGCA ACTTGGGAGA AGCCGCAGCC
AAGTATGTCC CTGACATCGC TAATATCCTC AAGGATGAAA AAGTTCCCAC CAACGTTCGT
GGTAGTGCGG CAGAGGCATT GGGCAACTTG GGAGACACCG TAGCCAAGTA TGTCCCTGAC
ATCCTCTATT TCCTCAAGGA TCAAAAGGTT CCCGACTATG ATCGTTCAGG TGCGGCAAAG
GCATTGGGCA ACTTGGGGAA CGCAGCAGCC AAGTATGTCC CTGACATCGC TAATATCCTC
AAAGATGAAA AAGTAGACAC CATCGTTCGT TTATTTGCGG CATCGGCATT GGGCAACTTG
GGAAACGCAG CAGCCAAGTA TGTCCCTGAC ATCGCTAATA TCCTCAAAGA TGAAAAGGTT
CCCACCAACG TTCGTGGTAG TGCGGCAGAG GCATTGGGCA AATTGGGAAA CACCGCAGCC
AAGTATGTCC CTGATATCGC TAATATCCTC AAGGATGAAA AGGTGGATGC CGATGTTCGT
AGAAGTGCGG CACAGGCATT GGGCAACTTG GGAGAAGCCG CAGCCAAGTA TGTCCCTGAC
ATCGCTAATA TCCTCAAGGA TGAAAAGGTT CCCGCCACCG TTCGTTCAGG TGCGGCACAG
GCATTGGGCA GTATGGGAGA AGCCGCAGCT AAGTATGTCC CTGACATCCT CTATTTCCTC
AAGGATGAAA AAGTAGACAC TATCGTTCGT TCAGATGCGG TAAAGGCATT GGGCAACTTG
GGAGACACCG CAGCCAAGTA TGTCCCTGAC ATCGCTAATA TCCTCAAGGA TGAAAAGTTG
GACAACAACG TTCGTTATTA TGGTGCGGTA TCGGCATTGG GCAACTTGGG AGACACCGCA
GCTAAGTATG TCCCTGACAT CGCTAATATC CTCAAAGATG AAAAGGTGGA CACCATCTTT
CGTAGAAATG CGGCAGAGGC ATTGGGCAGT ATGGGAGAAG CCGCAGCTAA GTATGTCCCT
GACATCCTCA ATTTTCTCAA GGATGAAAAG GTGGAAGCCT CCTTTCGTAG AAATGCAGCA
TTGGCATTGG GCAACTTGGG AGAAGCCGCA GTCAAGTATG TCCCTGACAT CCTCAATTTC
CTCAAGGATG AAAACGTTCC CGCCGACGTT CGTGGAAATG CGGCATCGGC ATTGGGAAAA
ATCAGACAAC TTAAATTAGA AGAGGTTGTC GTAGTTTTGA ATTATATCTA CGAACCAAAT
CAGGGAAGTT TTGATTATGG TCAGAATAAT TATGAATTTT GGCGATTTTG GACTTATTTC
CTGGGTGGCG GCCATGAAAA AGTCAAAACC CTGCTGACAT GGCTAGGCAG ACCAAAAACA
ACTCCTGATA AGCTGGAACA CTCTCAAGCT GTCAAAACAC TAGAACTATT CCGCGACATT
TGGCAACCCA GCCAAGAATT TGCACGAATA CGTGATGATT TAGCAAAACA AATCGCCCTA
GTCGCCAGAA AAACCCCTTG GCAACCGCAA GATATTCTCC TATTAGAAAC TCACTACAAC
AACCTCAAAA AAGCTGGCTA CAACGAAGCT GATTCACTGC AATCAGTCAT AGTCAACCTC
AAGGGTTGGC AGTGGTTTTT CAACGCCAGA ATTACCATCC TCACACACGC TACCTTCTGG
CTTGCCCTCA TCTTCGCCTA CCCCAAATTC CCCCAAATTC AAGCCATCTT CTTCTGGAAC
CCTTGGGTAC GCCGCATCTT AGGGGTGGGT TACGTCGGCT TTCTCCTCAC CTGGTTTCCC
CCCTTCCGCC GTAAATTATT TGAACCCTTC AAACCCTCCC TCCTAGCCGA TGCCGGCTTA
GATAACTTTA GTGACAAAGG CTACTTCCCA GAATCCAGAG TCAAAGTTCC CGGTACAGGG
GATATCTTCC CAGTTACCGC CGCCCTCCCC AGCATCAAAG GGCAAATTAT CTTAGAAGGG
GATTCCGGCT TAGGTAAGTC GATGTTTCTC CGCCATCTGT TGCAAAACTC CCCGCGCATA
GTCGTTTATC TCCCCGCCCA AAAATGCCAT AAAGGCGTAA TTGAAGCCAT CCAAGACAAG
CTACACGGCC AAGCCCAAGA TGCCGACTTC CTGAAAAACC TCATTTACAG TGGTGCAATA
GATATCTGCA TCGACGGACT CAACGAAGTC ACAGCCGATA CCAGAGCTAA AATCTGCCAG
TTTGTGGAAA GCTATTTCCG GGGCAACATT ATCATGACTA CCCAGCCCCT AGAGTGGACA
CCACCCTCAA CCGCCAAAAC CTACCACTTG CAACCCTTAG AACCCAACCA AATTCAAGAG
TTTTTGCTCT CCCGTGAACC GCGACTGCCC AAGGATGCCA AAATTCAAGG TGCTGATTAC
GAACAAGCCT GCATTAATTA TTTAAAAGAA GTCCTCACTA CCCAGCAACC AGAGGAAGAA
TTAAAAGCAG CCAGGCGCAT TCTTTCCAAC CCAATGGATT TAACCGTGGT AGCCCTGATG
TTATCACAAG GTCAATATCC CAACTTATTC CGCCTGCAAG AACAGCAATA CAACCTAATA
GCGGCTGAAT ACCTGAAGGA ATGGAACCAA GAATTTCCCT TAAGAAAATT CTCCGCCGCA
GTCTACCAAA TGCGTATCGA CGACAAACAA GCCTTACCCG CCGATGAATT TTATCAAGTC
GTCATGTCTT TGGAAGATGA GAAATATAAA ATGGTAGTGA GCCGTCAATG GCAAGATGAT
AAAGGGGAAG CCAAGAAAGA ATGGTATTTC CGCCACGATA AAATCATGGA CTTCTTCCTA
GTGCAGAACT TTCTCGGCGA CAGTGATGAA GCGGAAAGAC TATTAGTAGA TAGAATGGGT
GACCCCCGCT TTCGTGGTGT TTACTTCCTC TTAGCCAGCT TACTGCCAAT AGATGCAGCC
AAGGAATTGC GGGAGAAGTT GATTCAATAC GCCGCAGATA CTAAAGACAA TACGGTGAGT
AATACCTTTG TGCAGTTATT ACGGACAAGG TAA
 
Protein sequence
MVTTLRHHRQ PSRYIFYSLA FILSFLLCLP WVNAQEKPKP QPEAWQINGI VAALDDGHDQ 
VKGYAFKQLG EYNLKDLKSL VKKPEDIAEK AAKILKDEKV SAGVRGSAAE ALGNLGEAAA
KYVPDIANIL KDEKVPTNVR GSAAEALGNL GDTVAKYVPD ILYFLKDQKV PDYDRSGAAK
ALGNLGNAAA KYVPDIANIL KDEKVDTIVR LFAASALGNL GNAAAKYVPD IANILKDEKV
PTNVRGSAAE ALGKLGNTAA KYVPDIANIL KDEKVDADVR RSAAQALGNL GEAAAKYVPD
IANILKDEKV PATVRSGAAQ ALGSMGEAAA KYVPDILYFL KDEKVDTIVR SDAVKALGNL
GDTAAKYVPD IANILKDEKL DNNVRYYGAV SALGNLGDTA AKYVPDIANI LKDEKVDTIF
RRNAAEALGS MGEAAAKYVP DILNFLKDEK VEASFRRNAA LALGNLGEAA VKYVPDILNF
LKDENVPADV RGNAASALGK IRQLKLEEVV VVLNYIYEPN QGSFDYGQNN YEFWRFWTYF
LGGGHEKVKT LLTWLGRPKT TPDKLEHSQA VKTLELFRDI WQPSQEFARI RDDLAKQIAL
VARKTPWQPQ DILLLETHYN NLKKAGYNEA DSLQSVIVNL KGWQWFFNAR ITILTHATFW
LALIFAYPKF PQIQAIFFWN PWVRRILGVG YVGFLLTWFP PFRRKLFEPF KPSLLADAGL
DNFSDKGYFP ESRVKVPGTG DIFPVTAALP SIKGQIILEG DSGLGKSMFL RHLLQNSPRI
VVYLPAQKCH KGVIEAIQDK LHGQAQDADF LKNLIYSGAI DICIDGLNEV TADTRAKICQ
FVESYFRGNI IMTTQPLEWT PPSTAKTYHL QPLEPNQIQE FLLSREPRLP KDAKIQGADY
EQACINYLKE VLTTQQPEEE LKAARRILSN PMDLTVVALM LSQGQYPNLF RLQEQQYNLI
AAEYLKEWNQ EFPLRKFSAA VYQMRIDDKQ ALPADEFYQV VMSLEDEKYK MVVSRQWQDD
KGEAKKEWYF RHDKIMDFFL VQNFLGDSDE AERLLVDRMG DPRFRGVYFL LASLLPIDAA
KELREKLIQY AADTKDNTVS NTFVQLLRTR