Gene Ava_3078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3078 
Symbol 
ID3681058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3817350 
End bp3822485 
Gene Length5136 bp 
Protein Length1711 aa 
Translation table11 
GC content43% 
IMG OID637718423 
Productpeptidase C14, caspase catalytic subunit p20 
Protein accessionYP_323582 
Protein GI75909286 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.771707 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCCAC TCGGTGTTGC TACCAGTCAT TCAACTCATA CCCTAGCAAC AGGGAAAGCA 
AAACTTTGGC TGTTGCTAGT GGGAGTTAAC CAATACCAAG ATGAAAGACT ACCAACATTG
CGTTATTCGG CTGTTGACTG TCAAGGTTTA GCAGCAGCTT TAGCCGATGC AACTTACAGG
TTTCCCGATA AATCAGAGTG GGTACATCAT GATTTTGCTA CCCAACTACC CACACTTGCC
ACTGTCAGGA ATAGTTTAAA CAAAGTTACT CACCAGGCTC AACCAGAAGA CACGATTTTA
TTTTACTTTT CTGGTCACGG TATCCTAGAA ACCGGTTCTC AGCAGGTGAT TTTGTGTTTA
GCAGATACTC AAACAGATGA TTTATTGAAT ACAGGTTTAG GTTTACCAGA ACTTTTGCAA
TGCTTGGAAA ATAGCCAAGC ACAGACGCAG CTAGTTTGGC TAGATGCTTG TCATAGTGGT
AGCCTCACAT TTAGGGGTGC GAGAAGTAAC CACACACCAG CATCTTTGCC CAATCCTACA
CCCCAGATAG TGGAATTATT ACGCCAACGG GCAAAACAAA GTAAAGGATT CTACGCTTTA
CTTTCCTGCG ATACCAATCA ACAGTCTTGG GAGTTTCCCG AATTAGGACA TGGGGTATTT
ACTTATTATT TGATGCGGGG GTTGCGGGGT GAAGCAGCAG ACATACAAGG TTTAATTGAT
GCTGATGGGC TTTATCGCTA CGTTTATCAC CAGACACGGC AATATATTGA ACAAACCAAC
CAACAATTAC GGCTAATTAA TCAACAAAAT CGCAGTCGAG GAAACACTCA GGTTTATTCA
GAATATCCAT TGCAAACACC AAAACGTATT GTTGAAGGTG TGGGGGAAGT GATTTTGGGA
GTAAAACCTG CGTTAGTTGT GTCTCCCGAT GCTCGCAAAG CCTTGATTGT GGAAGGGTTA
GCAATTAATC AAACAACTCT GGCTTTTAGT CAACTGTTGG GGACTGTAGG AGGCTTTGGG
ATTGAGTATT GGCCTCTGGC TCATACCAAT CAAGATTTAC AGGCTACAAT TAGCAATTGT
TTACAAACTA GAGAACTGGA GACAGAAAAC CAGCAAAATC ATTTTGCCAC AGTCCTATTA
TATTTGCGGG GTAAATTGGC GCAAAGTTCA ACTGGTGAAC CTGTGTTAGT ACTGGGAGAA
AATATTCAGT TGAGTCGTTC CTGGTTAAGA CAACAATTGA GGCGATCGCT ATACTCCCAA
CAAATTATCA TCTTAGATTG TCCTTTAGAC CAACATAGTC ACATATCCTT ACAAGATTGG
GTAGAAGACT TACAACTGGG ATTCGAGCAA GGACAATGTA TTATAGCGGC TGCTTCATCA
CCAGACAATC CTCAACAATT CCTGCAAATT CTACACTCTA ACTTACAGGC AAATCAAGAA
CAACCAAACC TATCAGCCGC CGCTTGGATT AACCAATTGC AACTATCCTC CCCATTACCC
TTACACATTT GGTTATCAGG TACAAAGGGA GTAATTGAAA TTATCCCTGC AAGTACAGCC
GCTAAAGGTA AACAGCCAAA CGCCATCGTT GATTTAGGTA TTTGTCCTTA CAGAGGATTA
CAAGCCTTCC AAGAAGAAGA TATCCAATAT TTTTACGGTA GAGAAACCCT CACCCAGCAA
CTTATCGCAG ACTTGGAAAC TAAGTCATTT ATGGCTGTAG TGGGTGCATC GGGAAGTGGT
AAATCTTCCG TCGTCCAAGC GGGATTAATT GCCCAACTCC GCCGGGGTCA GCAATTACCA
GGGAGTCAAC AATGGTGGAT GAAAAGCCTC CGTCCTGGGG AAAATCCTCT AGTCAGCTTA
TCCCATTGCT TAGTAGATAG TGGTACCGCA AAAGAAAAAG CGTATCAACA AATGCAGATA
GAAGGGATGT TATATCAAGG GGCGCAAGGA TTTGCCCATT GGCTACATCA TCGCAGTGAA
CCAATGGTAG TGTTGGTGTT AGACCAATTT GAAGAATTAT TTACTCTGGC GGCTAGTGAA
GATAGGCAAA GATTTATTGA CACTGTTTTG GGTGCGTTGG AGTTATCACC AGACAAATTT
AAATTAATTC TTACTCTCCG GGCAGATTTT ATCGCCCCCT GTTTAGAAAT ACCTACCTTA
GCAAAGCTTT TACAGCAGTC AAGTGTTTTG CTACCGCCTT GTTTAACTCA AGAAGAGTAC
CGCCGCATTA TTATTCATCC CGCCGAAAAA GTAGGGTTAA CAGTAGACCC GGAATTAGTC
GAGGTGCTAT TGCAAGAGTT ACATAACTCC CCTGGAGATT TACCATTACT AGAATTTGTT
CTCGAACAGT TGTGGGAATA TCGAGACAAA GGCGTAATTA CCTTACAAGC TTACCAGCAA
TACTTGGGCG GCATTAAAGG CGCATTGGAA AGGAAAGCCC AAGGAGTTTA CGACACTTTA
GACCCAGAAG CCCAAGAATG TACAAGGTGG ATTTTTTTAT CACTGACACA GTTAGGTGAA
GGAACAGAAG ATACCAGACG GCGGCTACTC AAGTCTGAGT TAATTGTCAA AAAATACTCT
GTTGCTTTAG TAGAAAGAAC ATTGCAAGTT TTGACTGCTG CTAAGTTAGT GGTGGTGAAT
GGGGATTGGG AAGAGGCAGG AGGCAAGAGG CAGGGGGCAG GGGGCAGGGG GCAGGGGGAA
AATATCTTGC TTACAACTCC TTCCGTGACC ATAGAAGTAG CCCATGAGGT GATAATTCGT
CACTGGTCAA CTCTACGGTG GTGGTTGGAG GAAAATCGTA GTAGATTGCG ATCGCATCGA
CAAATTGAAC AATCGGCGGC TTTATGGCAG CAAAACAACC AACAGCCCGA CTTTTTATTA
CAAGGTGTCC GGTTAGCAGA AGCCGAGGAA ATTTATCTGA ATTACACAGA TGAATTATCT
TGGGATGTCC AACATTTCAT TGAAGCTTGT CTCCATGAAA GGCGACGCAA ACAACACCAG
GAACAAAGCC GACTTAGACA AGCACAAAGG GCTGTCAGTA TTATCAGTAC GTTGGGTTTA
ACAGCCTTTG GTTTAGCGGT TTTTGCTTAT CAACAAACCC AAAACGCCCA AATCAAAGAA
ATTCAAGCTT TAAATTCTCT GTCGGAAAAC TTTCTCCTCT CCCACAAACA ATTAGAAGCA
CTCATAACTA GTGTGCAAGC CGGGAAGGAA GTACAAAACA TTAGCTTAGG AATCCCCGCA
GATACCCGCA CGCAGACGGC AACCACTTTA CAACAAGCAG TCTACAGCAC TCAAGAACGT
AACCGGTTAC TGCATAATGC TTGGGTAACT AGTGTCAGTT ATTCACCAGA TGGTGAAGTT
ATTGCTTCTG GTAGTGTAGA TAACACTATC CATCTTTGGC GTAGAGATGG TAAATTGCTG
ACCACTCTCA CTGGTCATAA TGATGGGGTA AATAGCGTCA GTTTTTCCCC CGATGGTGAA
ATTATCGCCT CTGGTAGTGC AGACAGTACC ATCAAGCTTT GGCAACGCAA CGGTAAACTC
ATCACCACAC TCAAAGGACA TGACCAAGGT GTCAAGAGTG TTAGTTTTTC CCCCAATGGT
GAAATTATCG CCTCTGGTGG TAGTGACAAT ACCATCAATC TTTGGAGTCG CGCAGGTAAA
TTACTACTTA GTCTCAACGG CCATAGCCAA GGTGTCAATA GCGTCAAATT TTCCCCAGAA
GGTGATACCA TCGCCTCCGC CAGTGACGAC GGAACAATTA GACTGTGGAG TTTAGACGGT
CGCCCTTTAA TCACCATCCC CTCCCACACA AAACAAGTAT TGAGTATTAG TTTTAGCCCC
GATGGGCAAA CCATTGCTTC GGCTGGTGCA GACAATACCG TAAAACTCTG GAGTCGTAAC
GGTACTTTGC TGAAAACTCT AGAGGGACAT AATGAAGCTG TTTGGCAAGT AATTTTCTCC
CCCGATGGAC AGTTAATTGC TACCGCCAGC GCTGATAAAA CTATTACCCT TTGGTCGCGT
GATGGCAATA TCTTAGGAAC TTTTGCTGGA CATAACCATG AAGTCAATAG TCTCAGTTTT
AGTCCCGATG GCAATACATT AGCCTCAGGT AGTGATGATA ATACTGTCAG ACTGTGGACT
GTGAACAGAA CACTACCCAA AACCTTTTAT GGACATAAAG GCAGCGTCAG TTACGTCAAA
TTTAGCAATG ATGGTCAGAA AATTACCTCA CTCAGCACCG ACAGCACCAT GAAAATCTGG
AGTCTGGATG GGAAATTACT GCAAACCTTA TCGTCTCCTC TACCTGATGT CACCAGTGTC
AGTTTCACCC CAGACAATAA CATCGTTGCC TTAGCTAGTC CTGACCATAC TATCCACCTT
TACAATCGGG ATGGCATTTT ACTCCGTAGC TTACCAGGTC ACAACCATTG GATAACGAGC
TTAAGTTTCA GTCCTGACAA TCAAATATTA GCTTCTGGTA GTGCTGATAA AACCATCAAA
CTTTGGAGTG TAAACGGTCG CTTGTTGAAA ACTCTCTCAG GGCATAATGG TTGGGTGACA
GATATTAAAT TTAGCGCTGA TGGAAAAAAT ATTGTCTCTG CTAGTGCTGA CAAAACCATC
AAAATTTGGA GCTTAGATGG TAAGCTAATC AGGACTTTAC AAGGTCATAG TGCTAGTGTG
TGGAGTGTCA ACTTTTCACC CGATGGTCAA ACTCTCGCTT CAACTAGTCA AGATGAAACT
ATCAAACTCT GGAATTTAGA TGGCGAATTA ATCTACACCC TCCGGGGTCA TGGTGATGTA
GTTTACAACT TAAGTTTTTC ACCTGATAGT AAAACAATAG CCTCAGCTAG TGACGACGGC
ACAATTAAGC TATGGAATGT CACTCATGGC ACATTACTAA AAACCTTCCA AGGACATCGC
GGCGGTGTCA GGAGTGTAAG TTTTAGTCCC GACGGTAAAA TTTTGGCATC CGGTGGACAT
GATACTACAA TCAAAGTCTG GAACCTGGAG GGGATAGAAC TACAAACCCT CAATCTAGAT
GAGTTGTTAA ACCGTGCTTG CGATCGCCTG CATAATTATC TCACAACCAA CCCTAATATA
ACTACAGAAG AGTATCAGCT TTGTTTTGGA GATTAG
 
Protein sequence
MSPLGVATSH STHTLATGKA KLWLLLVGVN QYQDERLPTL RYSAVDCQGL AAALADATYR 
FPDKSEWVHH DFATQLPTLA TVRNSLNKVT HQAQPEDTIL FYFSGHGILE TGSQQVILCL
ADTQTDDLLN TGLGLPELLQ CLENSQAQTQ LVWLDACHSG SLTFRGARSN HTPASLPNPT
PQIVELLRQR AKQSKGFYAL LSCDTNQQSW EFPELGHGVF TYYLMRGLRG EAADIQGLID
ADGLYRYVYH QTRQYIEQTN QQLRLINQQN RSRGNTQVYS EYPLQTPKRI VEGVGEVILG
VKPALVVSPD ARKALIVEGL AINQTTLAFS QLLGTVGGFG IEYWPLAHTN QDLQATISNC
LQTRELETEN QQNHFATVLL YLRGKLAQSS TGEPVLVLGE NIQLSRSWLR QQLRRSLYSQ
QIIILDCPLD QHSHISLQDW VEDLQLGFEQ GQCIIAAASS PDNPQQFLQI LHSNLQANQE
QPNLSAAAWI NQLQLSSPLP LHIWLSGTKG VIEIIPASTA AKGKQPNAIV DLGICPYRGL
QAFQEEDIQY FYGRETLTQQ LIADLETKSF MAVVGASGSG KSSVVQAGLI AQLRRGQQLP
GSQQWWMKSL RPGENPLVSL SHCLVDSGTA KEKAYQQMQI EGMLYQGAQG FAHWLHHRSE
PMVVLVLDQF EELFTLAASE DRQRFIDTVL GALELSPDKF KLILTLRADF IAPCLEIPTL
AKLLQQSSVL LPPCLTQEEY RRIIIHPAEK VGLTVDPELV EVLLQELHNS PGDLPLLEFV
LEQLWEYRDK GVITLQAYQQ YLGGIKGALE RKAQGVYDTL DPEAQECTRW IFLSLTQLGE
GTEDTRRRLL KSELIVKKYS VALVERTLQV LTAAKLVVVN GDWEEAGGKR QGAGGRGQGE
NILLTTPSVT IEVAHEVIIR HWSTLRWWLE ENRSRLRSHR QIEQSAALWQ QNNQQPDFLL
QGVRLAEAEE IYLNYTDELS WDVQHFIEAC LHERRRKQHQ EQSRLRQAQR AVSIISTLGL
TAFGLAVFAY QQTQNAQIKE IQALNSLSEN FLLSHKQLEA LITSVQAGKE VQNISLGIPA
DTRTQTATTL QQAVYSTQER NRLLHNAWVT SVSYSPDGEV IASGSVDNTI HLWRRDGKLL
TTLTGHNDGV NSVSFSPDGE IIASGSADST IKLWQRNGKL ITTLKGHDQG VKSVSFSPNG
EIIASGGSDN TINLWSRAGK LLLSLNGHSQ GVNSVKFSPE GDTIASASDD GTIRLWSLDG
RPLITIPSHT KQVLSISFSP DGQTIASAGA DNTVKLWSRN GTLLKTLEGH NEAVWQVIFS
PDGQLIATAS ADKTITLWSR DGNILGTFAG HNHEVNSLSF SPDGNTLASG SDDNTVRLWT
VNRTLPKTFY GHKGSVSYVK FSNDGQKITS LSTDSTMKIW SLDGKLLQTL SSPLPDVTSV
SFTPDNNIVA LASPDHTIHL YNRDGILLRS LPGHNHWITS LSFSPDNQIL ASGSADKTIK
LWSVNGRLLK TLSGHNGWVT DIKFSADGKN IVSASADKTI KIWSLDGKLI RTLQGHSASV
WSVNFSPDGQ TLASTSQDET IKLWNLDGEL IYTLRGHGDV VYNLSFSPDS KTIASASDDG
TIKLWNVTHG TLLKTFQGHR GGVRSVSFSP DGKILASGGH DTTIKVWNLE GIELQTLNLD
ELLNRACDRL HNYLTTNPNI TTEEYQLCFG D