Gene Ava_0039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0039 
Symbol 
ID3683548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp40066 
End bp45126 
Gene Length5061 bp 
Protein Length1686 aa 
Translation table11 
GC content43% 
IMG OID637715366 
Productpeptidase C14, caspase catalytic subunit p20 
Protein accessionYP_320560 
Protein GI75906264 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0781722 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACCAA TTGGTCTTTC AACCAGCCGT ACAACTCATA CCAAACAAAC CAACATCCCT 
AAGTTGTGGT TAATTCTGGT GGGAGTAAAT CAATACCAGG ACGAACACCT ACCCAACCTG
AATTACTCAG CAATAGATTG TCAGGGTTTA TCTGAGGCTT TAACTGAGGC GGCCTCCCAA
TTTACCCAAA AGATTGTCAA CATTTACCAC GATTTTGCAC CACAATCACC ATCTTTAGCA
AATGTCCGTC ATAGACTACA AGATATAACG TCTACTGTAT CCCCTATTGA TACGATTTTA
TTTTATTTTT CTGGTCATGG GGTGGTAGAT CCCAAGACAC AAGAAGTATT TTTGTGTTTA
GCAGATACGC AAAAATCAAA CTTGCAAAAT ACAGGTTTAG CTTTACAAGA AATATTGCAA
CTTTTGGGTA ACAGTGGGGT ACAGAATCAG CTAGTGTGGT TAGATGCTTG TCATAGTGGG
GGGATGTCTC TTAGGGGAGT AACACTGCAA CTAGTGGAGC TACTACAACA AAGTGCCGCT
AAGAGTAAGG GGTTTTATGC TTTACTTTCT TGTGATAATA ATCAGCAATC TTGGGAATTT
CCCGAATTGG GGCATGGGGT ATTTACATAT TATTTGATGC GGGGTTTGCA AGGTGAGGCG
GTAGATAGTC AAGGCTTGAT TTATTTGGAT GGTTTATATC GTTATGTATA TCACCAAACC
TTGCAATATA TTGATAAAAC TAATCAACAA TTACGGCTAA TTAATCAGCA GAAACGGGGT
AAAGGTGATA CGCAACTTTA TAACGAATAT CCCCTGCAAA CTCCTAAAAG AATTGTGGAA
GGAGTGGGGG AATTAATTTT AGGGAAAAGA TTAAAGAAAA TTGCCTCTGG GGTGCATCAT
GGTTTGGGGA TGGTGGTGGA GGGGTTATCA AATAGTAAAA TTACATTAGA TATTAGTAAA
CTTCTGGGTA GTACTGGAGA TTTTGCGGTT GAGTATTTGC CAGCTACTAA AACATCGGCT
GAGGATATTA AAGCAGCGAT CGCTCGTCAT TGGCAACAAC CCCAAGCAGA ATTAACTCTG
TTATATCTGC GGGGACGCAT TGAGGAAAGC GAAGCTGGGG AATGGCTACG ACTGGGAGAA
GATATCTGCA TCAAACGTTC CTGGTTAAAA CAACAGTTAC GCCAATGCAC TAGCCAACAG
GTAATTATCT TGGATTGTCC GTTAGGGACT GCATCGTTAG GGGATTGGAT AGAAGATTTA
CAAATTGAAT CTCATCACGG ACAATGTTTA ATTGCTTGCG CCTCACCTCC AGAAGCACCA
GAAAACTTTG CCCAAAAATT CCTCGATACC TTGATGATAT CTGCTCAAGG AAATGGTTTG
TCTGCGGCTG GGGCGATCGC TCAATTACAA TTATCTTTAG CGGATAGTAA AACCCCTCTC
TATGTTTGGC TATCAGGGAC ACAGGGAATT ATCGAAATTT TGACTAACAA CACACACAAA
AGCCAACAGA CAAGCGGGTT GGATTTAGGG GTGTGTCCTT ACATGGGTTT AAATGCTTTT
GCGGAAGCAG ACGCAGCCTA TTTTTATGGG AGAGAAACTG TCACCCAGCA GTTAATTCAT
CATCTGCGGG ATAACTCATT TCTCGCTGTC ATCGGTGCTT CGGGTAGTGG GAAGTCTTCC
GTAGTGCAAG CTGGACTCAT TCCCCAACTG CGACAGGGGA AACACATCCC CAATAGTGAA
CAGTGGGGGA TTAAAACTAT CCGTCCTGGT GTCAATCCTG TGGAGGCTTT AGCCAGGAAG
TTGGGGGAAT GGAGAGAAAC CCATCTTGTG ATTGAGGGAA TTTTACATCA AGGAGTGGAA
AGTTTTGTTT ACTGGCTGCG TAGCCTTCCC CATAGGGTGA CGGTGTTGGT AATAGACCAG
TTTGAGGAAT TATTTACCCT TGCACCATCA CCAGATAGGG AGTTATTTCT AGAATTGTTA
TTGGGGGCTG TACAGTATGC AGGCGATCGC TTTAAATTAA TTATTACTTT AAGGGCTGAT
TTTATTGCTC CTTGTTTGGA AGTACCAGCT TTAGCTGAAG CACTACAGGT AGCTAGTGTG
TTAGTTCCCC CAAAACTGAG TTTGGATGAT TATCGGCGGG TGATTCTCAA CCCAGCACAG
CAGGTAGGGT TGCAGGTGGA AGGGGAACTG GTGGAAGTGC TGTTACGGGA GTTAAATCAA
TCCGTGGGGG ATTTACCTTT ACTGGAATTT GTTTTAGAAC AGTTATGGCA ACAACGAACG
GCTGGTAAAT TAACTCTACA AAGCTATCAA GAACAACTAG GGGGAATTAA AGGCGCATTA
GAGCGATCGT GTCAAGGGGT TTATGAAAGT TTACCACCAC AATTACAAGA ATGTGCCAAG
TGGATTTTTC TTTCATTAAC TCAGTTAGGG GAAGGTACAG AAGATACCAG AAGACGCATA
TATAAGTCAG ATTTAATAGT TAAAAAATAT CCGGTTGGGT TAGTAGAACA AACCCTCAAT
GCTTTGACTA CTGCCAAATT AGTAGTAATT AACTTAGAAA CAGACATAGA AGCACAAGGT
AAAAGTTCCT CCCCTGCTTC CCCAGCCTCC CCTGCTTCCC CTACTCCCTT TGTTACAGTA
GAGGTCGCCC ACGAAATTTT AATACGCCAT TGGTCAACTT TGCGCTGGTG GTTAGAAGAA
AACCGCGATC GCTTGCGTAA ACAAAGACAA ATTAATCATG CTTGTCAGTT GTGGCAACAA
AGCGGCAAAC AAGCAGACTT TTTATTACAA GGTGCTAGGT TAGCAGAAGC CGAAGACATT
TATATTCACT GGACTGATGA ACTAGGGGCG GATGTGCAAG AATTTATCGG CGCTTGTTTA
GCAGAACGCA AGCATCAACA ATTACAAGCA AAAAATCGCC TCAAACAAGC ACAAAGGGCT
GTAGTCGCCT TGAGTGTTTT AGGTATTGCA TCTGTCAGTT TTGGGGGTTT AGCTTATTGG
CAAGGTAGGG AAGCCCAATT TAGGGAAATT GCAGCGTTAA ATTCTTCATC CCAGGCAAAT
CTGTTATCTC ATCAACAATT AGCAGCACTT ATCGCCAGTC TCAAAGCTGC ACAACAGGTA
AATAATGTCA TAGCAGTTCC CAATAATCTC AAATTAGCAA CTGTCACTAC CTTACAACAA
GCCCTGTTGG GGATGCAGGA ACGGAACAGG CTAGAAGGAC ATAAGGACGG CGTGATTAGT
ATTAGCATCA GTGGAGATGG TCAAACTATC GCCTCTGGTG GCTTAGATAA GACTATTAAA
CTTTGGAGTC GAGACGGTCG GTTATTTAGA ACTCTCAACG GACATGAAGA CGCTGTTTAT
AGTGTAAGTT TCTCTCCCGA CGGTCAAACC ATTGCTTCCG GGGGGAGTGA CAAAACCATT
AAACTTTGGC AAACCAGTGA TGGAACCCTA CTCAAAACCA TCACTGGTCA TGAGCAAACA
GTCAATAATG TTAATTTTAG TCCCGATGGT AAAACTCTCG CCTCTGCCAG TAGTGATCAC
AGCATCAAGT TGTGGGATAG TACATCTGGT CAACTCTTGA TGACTCTCAA TGGTCATAGC
GCTGGAGTTA TCAGTGTGCG TTTCAGTCCT GATGGTCAGA CCATCGCTTC CGCTAGCGAA
GATAAAACCG TTAAACTGTG GCATCGCCAA GATGGGAAAT TATTAAAAAC CCTCAATGGA
CATCAAGATT GGGTAAATAG CCTGAGTTTT AGCCCTGATG GTAAAACTCT CGCTTCCGCT
AGTGCTGACA AAACCATCAA ACTGTGGCGC ATCGCTGATG GTAAATTGGT CAAAACCCTA
AAAGGTCACA ATGATTCAGT CTGGGATGTT AACTTTAGCC AAGATGGTAA AGCGATCGCC
TCTGCGAGTA GAGATAACAC TATCAAACTG TGGAACCGTC ACGGCATCGA ACTAGAAACC
TTTACAGGTC ATAGCGGTGG TGTGTATGCT GTAAATTTCC TGCCTGATGG TAAAACTCTC
GCTTCCGCTA GTTTAGACAA CACCATCAGA CTTTGGCAGC GTCCCTTAAT ATCTCCCTTA
GAAGTTCTTG CCGGCAATAG CGGCGTGTAT GCGCTCAGTT TCAGCCCTGA CGGTAGTATC
ATTGCTACAG CAGGTGCAGA TGGCAAGATT CAGCTTTGGC ACAGTCAAGA CGGTAGTTTA
CTAAAAACCC TACCAGGGAA CAAAGCAATT TATGGTATTA GTTTTACACC CCAAGGTGAT
TTAATCGCTA GCGCCAACGC CGATAAAACT GTGAAGATTT GGCGTGTTAG AGATGGTCAG
CTTTTAAAAA CACTCATAGG ACATGATAAC GAAGTCAACA AAGTAAATTT TAGCCCAGAT
GGTAAAGCGA TCGCTTCCGC CAGCCGAGAC AACACAATTA AACTTTGGAA TGTGAGCGAT
GGTAAGTTAA AACAAATCCT CAAAGGTCAT ACAGAGGAAG TATTTTGGGT TAGTTTTAGC
CCCGATGGTA AAATCATCGC CTCTGCTAGT GCGGACAAAA CTATCCGACT CTGGGACAGT
GTTAGCGGCA ACTTAATTAA AAGTCTTCCA GCCCATAATG ACTTAGTATA CAGCGTCAAC
TTCAGTCCTG ACGGTAGTAT GCTTGCTTCA ACCAGCGCCG ACAAAACCGT CAAACTCTGG
AGGAGTCAAG ACGGTCATTT ACTACATACT TTCTCAGGCC ATAGTGATGT AGTTTATAGT
AGCAGCTTCT CTCCCGATGG TCGTTACATT GCATCAGCCA GCGAAGATAA AACAGTCAAA
ATTTGGCAAC TCGACGGTCA CCTGTTAACC ACCCTACCCC AGCATCAAGC CGGAGTCATG
AGTGCAATTT TTAGCCCAGA TGGTAAAACT CTCATCTCCG GCAGTTTAGA CACTACCACC
AAAATTTGGC GTTTTGATAG CCAGCAAGCC CAAACTTCCC AGATAAATAC TTTAGTCATG
TCTGCTTGCA ACTGGCTACA GGATTACCTC AACACCAATC CCCATGTTAC GACCAACGAA
CAAAAACTCT GTCCTAGTTA A
 
Protein sequence
MPPIGLSTSR TTHTKQTNIP KLWLILVGVN QYQDEHLPNL NYSAIDCQGL SEALTEAASQ 
FTQKIVNIYH DFAPQSPSLA NVRHRLQDIT STVSPIDTIL FYFSGHGVVD PKTQEVFLCL
ADTQKSNLQN TGLALQEILQ LLGNSGVQNQ LVWLDACHSG GMSLRGVTLQ LVELLQQSAA
KSKGFYALLS CDNNQQSWEF PELGHGVFTY YLMRGLQGEA VDSQGLIYLD GLYRYVYHQT
LQYIDKTNQQ LRLINQQKRG KGDTQLYNEY PLQTPKRIVE GVGELILGKR LKKIASGVHH
GLGMVVEGLS NSKITLDISK LLGSTGDFAV EYLPATKTSA EDIKAAIARH WQQPQAELTL
LYLRGRIEES EAGEWLRLGE DICIKRSWLK QQLRQCTSQQ VIILDCPLGT ASLGDWIEDL
QIESHHGQCL IACASPPEAP ENFAQKFLDT LMISAQGNGL SAAGAIAQLQ LSLADSKTPL
YVWLSGTQGI IEILTNNTHK SQQTSGLDLG VCPYMGLNAF AEADAAYFYG RETVTQQLIH
HLRDNSFLAV IGASGSGKSS VVQAGLIPQL RQGKHIPNSE QWGIKTIRPG VNPVEALARK
LGEWRETHLV IEGILHQGVE SFVYWLRSLP HRVTVLVIDQ FEELFTLAPS PDRELFLELL
LGAVQYAGDR FKLIITLRAD FIAPCLEVPA LAEALQVASV LVPPKLSLDD YRRVILNPAQ
QVGLQVEGEL VEVLLRELNQ SVGDLPLLEF VLEQLWQQRT AGKLTLQSYQ EQLGGIKGAL
ERSCQGVYES LPPQLQECAK WIFLSLTQLG EGTEDTRRRI YKSDLIVKKY PVGLVEQTLN
ALTTAKLVVI NLETDIEAQG KSSSPASPAS PASPTPFVTV EVAHEILIRH WSTLRWWLEE
NRDRLRKQRQ INHACQLWQQ SGKQADFLLQ GARLAEAEDI YIHWTDELGA DVQEFIGACL
AERKHQQLQA KNRLKQAQRA VVALSVLGIA SVSFGGLAYW QGREAQFREI AALNSSSQAN
LLSHQQLAAL IASLKAAQQV NNVIAVPNNL KLATVTTLQQ ALLGMQERNR LEGHKDGVIS
ISISGDGQTI ASGGLDKTIK LWSRDGRLFR TLNGHEDAVY SVSFSPDGQT IASGGSDKTI
KLWQTSDGTL LKTITGHEQT VNNVNFSPDG KTLASASSDH SIKLWDSTSG QLLMTLNGHS
AGVISVRFSP DGQTIASASE DKTVKLWHRQ DGKLLKTLNG HQDWVNSLSF SPDGKTLASA
SADKTIKLWR IADGKLVKTL KGHNDSVWDV NFSQDGKAIA SASRDNTIKL WNRHGIELET
FTGHSGGVYA VNFLPDGKTL ASASLDNTIR LWQRPLISPL EVLAGNSGVY ALSFSPDGSI
IATAGADGKI QLWHSQDGSL LKTLPGNKAI YGISFTPQGD LIASANADKT VKIWRVRDGQ
LLKTLIGHDN EVNKVNFSPD GKAIASASRD NTIKLWNVSD GKLKQILKGH TEEVFWVSFS
PDGKIIASAS ADKTIRLWDS VSGNLIKSLP AHNDLVYSVN FSPDGSMLAS TSADKTVKLW
RSQDGHLLHT FSGHSDVVYS SSFSPDGRYI ASASEDKTVK IWQLDGHLLT TLPQHQAGVM
SAIFSPDGKT LISGSLDTTT KIWRFDSQQA QTSQINTLVM SACNWLQDYL NTNPHVTTNE
QKLCPS