Gene Ava_1841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1841 
Symbol 
ID3681832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2289469 
End bp2294454 
Gene Length4986 bp 
Protein Length1661 aa 
Translation table11 
GC content40% 
IMG OID637717181 
ProductWD-40 repeat-containing protein 
Protein accessionYP_322358 
Protein GI75908062 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID[TIGR01543] phage prohead protease, HK97 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAACT GTGAACTTAA AACGGCGATC GCCAATAATA ATGAGCATTC CTTACAAAGG 
TTAATTCGAG CAATTAATTT ATCTAAGGGA GAATTTTCTC TCATTTTAGT TCGTTGTAAC
TACCAACAGT TACGAGAAGA AATGCGGGAT AATCTTCGAG ATTTAAGTAA AGATATCAAT
ATTAGAGAAA TATATGTTCA ATCATCGATT AGTGCTTTAC ACACTACAAT CACATCGCAA
CTATTTTTAG ATAATCCCAG TGTTGCTAGC GATTGTTTAC CATCAGCCTT AATGGTTTTT
GGTTTAGAAT CAGTAATTGC TCTTGAAGAT TTATTAACTG GAATAAATCA GTCACGGGAT
ATTTACGCAG CTAGTTTTCC TTTCCCTTTA ATCTTGTGGT TACAAGATGA AGTGGCATCT
TCACTTTCTC GGTTAGCTCC TGACTTTAAA AGCTGGGCTG CAACCACTAT TAAATTCGAG
ATGCCCCAGG AAGATTTAAT TGCTTTAATT ACAAAACAGA CAGAATCTCT ATTTTCTAAA
GTTTTAGAAG TAGGCGCAGA AAAGTTTATC TCTAATGATG CCCTTGATTT AGCACCGAAA
TCCCAACATC GTCACGAGAT TGAATCAGCC CGTAATGATT TACTGCGTTC TTACAATATC
AAATTAGAAC CAGGATTAGA GGCTAGTCTG GAGTTTGTTT TAGGCAGAGA TAAGTATGCT
AATGATCAAA TTAATAGTGC TTTAGCCCAT TATCAAAGAA GCCTATCTCT ATGGCAGCAA
GAAGAAAAAA AGGAAAGTAT AGAAAAGAAA TATTTAGAAC TGAACTACCA ACAAAGACTT
TGGCAAGCAA TTATCTTATT CCACTTAGGT TTATGTTATC ACCGGATGGC AGATTTACAT
CAGAGTGCTA ATAGTGGCTA TTGGCGACAC GCACTATTGT GGTTTCAGCA ATGCTTAGAT
GTGTTGGAAG AGAAGAAAAG ACAAGATTTA GTAGCTAAAT TTATTTTACC TGCGTGTGAG
ATGCTTTACC GCTTACAGCT TTGGGAAGAT TTAAAGAAAT TAGCTCAAAA GTCTTTATAT
CTACATAAGA GTTATGGCAA TCAAGCCCAA ATAGCACAAG ATTATGGCTT TTTAGCAGCC
GTAGCAGGTG CAAATAACGA TTGGATACTA GCTCATGAAT TAGCTAATAC TGCATTAGAT
ATTCTGGAGA AAGTAACAGG AACTCGACAG CATGAGAGTT GGTATCTTTT GTTGTTAGCC
CGTTCACAAA GGCATTTAGG AGAATATGAA GAATCTATCA ATAACTTAGA ATGGGCAAGG
GTTGTTTGTG AGCTACAATA TGAGCCTTCT TTGTACTTGG AAATTTTAGA GGAATTGCGA
TCGCTCTATT TTATTGAGCG ACATGAATAT TTAGAAGCCT TTAAACTTAA GCAAGAAAAA
ATCCAAATTG AACATCAGTA TGGTTTACGT GCTTTTATTG GTGCAAGTCA ATTACAGGCA
CAACGTTACA AAATTAATTC AGTCTTAGAA CCCCAAAGTA TCCCCTTTAT TCCAGAGGAA
GTTGCCCAAG AAATATCTGC TTCTGGAAGA CAACAAGATG TCAATCGTTT AATTGAAAGA
ATTACTCGTG CTGACTATAA ACTCACAGTA GTTCATGGTC CATCAGGAGT TGGTAAAAGT
TCACTGCTCA AAGCTGGTTT ATTACCAGCT GTAAAGAATA AAGTGATTGG TGAGCGGATA
CCTCTACCTA TTGTGTTATC TGCTTATACT GATTGGACAA CCAGTTTGGG GCGAGGTTTT
TATAATGTGT TAGCACCATT AGATATCTCA AGTTCCCTGG AATTTACTCC CGCTATTCTG
CTAGAAAAGT TCCGCTCTGC AACTGCACGT AATCATACAA TTATTATTAT TTTTGATCAA
TTTGAAGAGT TCTTTTTCGT AAGTAGCTTT CAGCAAAAGT TAGAATTCTA TCAGTTTTTT
AGCGAATGTT TGAATATTGC ATATTTAAAA ATTATTATTT CCATCAGAGA AGATTATCTC
CATTACTTAT TAGATTTTGA GCGCTGGGAT AAAACACAGC CAGATAGTTT CCATGATTTA
GATGTCATTA ACAAAAATAT TTTAGATAAA GATATCCGCT ACTTTCTGGG GAAATTTTCC
CGTGAAGATG CCGTTGCTAT TATTCGTTAC TTGACCCAAA CATCTCAATA CGAAGTCAAG
GATGATTTAA TTAATGAACT AGTGCAGGAT TTAGCAGGAG AAACCGAAGA AGTCCACCCA
ATTGAATTAC AAATAGTAGG CGCGCAGCTA CAAGCAGAAA ATATCACGAA TCTCGAAAAA
TACAAACTTT GTGGAGGTAC GAAAAAACTT ATAGAACGCT GGCTAGAGGA AGTTATTCGA
GATTGTGGTC AAGAACATGA AGATTTTAGC TGGAAACTAT TATATGAATT AACTGACGAA
AAAGGCACAC GTCCCCTCAA AACTAAATCT GATTTAATGC TGGCTTTAGA AAAATATCTT
GATAATCAGT CAGATTTTGA TTCACGCTGG GAGTTGATTT TAGAGATTTT GGTTGGTTCA
GGTTTAATAT TGTGCTGGCG AGAAGAGTTA GGCGAACGCT ACCAGTTAGT CCATGATTAT
TTAGTTGAAC CAATTCGCCA AAGAAATGAT TATGGCATCA TTGCCGAATT AGAAAAAATC
AAATCGGAAA AAACTCAAGC AGAGGTAGCC CGAAAATTAT CTCAAGAGCA GCTAAATTCA
GTTTTGCGGC GTAGATTGCG AGAAGCACGA GCAGCAGGGG TACTGCTGGC AGTTATGGGC
GGCACGATCG CTTCTTTGTG GTGGCAAGCC GATATGCAGA AAAGAACCGC AGAGCTACAA
ACCATCCGTG CTGAAACCAG CGAGACCAAT TTGCAAATTA GTGCGATCGC CGCATCTAGC
GAAGCTCTAT TTTCCTCCAA CCAAGAATTT GATGCTTTAT TAGAAAGTTT GAGAGCCTGG
CAAAAACTTA AACAAGCCAA GGAGGTGCGA CCAGAAACCC GAATGCGCGT GGTTACAGCC
TTACAACAGG CAGTTTATGG AGTTACGGAG TTAAATCGCT TAGAGGGGCA TAGTGATATT
GTTTGGGGTG TAGCTTTTAG TCCTGATGGT CAATTATTAG CTTCCGGTAG CACAGACAGA
ACAATCAAAC TTTGGCGACC TGACGGGACA CTACTCCAAA CCCTTGAAGG TCATACTAGC
GCGGTTACCA GTGTCAGTTT TAGTCCTGAT GGTCAGACTA TAGCTTCAAC TAGTCTCGAC
CAAACAGTAC GAATTTGGCG GAAAAACCCA ACAACTGGTG AATTCGCCCC AGAGCCGGCC
CAAAGCCTCA GAAAACACAA AGATTGGGTT TATAGCGCCA ATTTTAGCCC TGATGGAGAA
TTGTTAGCTA CTGCCAGCAG GGATAGAACT ATTAAAATTT GGGATCGGGA TGGTAACTTA
ATCAAAACTC TTAAGGGTCA TCAAGGCTCT GTCAACTGGG TCAGTTTCAG TCCAGACAGT
CAATTTATTG CTTCAGCTAG TGAGGATAAG ACCGTCAAAA TTTGGCGCAG AGACGGTAGC
TTGGTGAAAA CTTTATCTGC ACATCAGGAA GGTGTAACTG TCGTCACTTT TAGCCCTGAT
GGTAAGCTAC TAGCTTCAGC CGACCGAGAC AATGTTATTC AACTATGGCA ATGGGATAGT
AGTAATCACA ATAACCCAGA AGTTGATATC TATAAAACCT TAAAGCAACA TACCAGCACG
GTATGGAGTC TCAGTTTCAG TTCTGACAGT AAACAGTTAG CCTCAGCCAG TGATGACAAT
ACTATCAACC TCTGGAGTCA TACAGGCAAC TTAATTAAGA CTTTTAAGGG ACACAGTGAC
GCTGTTGTCA GTGTAGCTTT CAGTCCAGAT ACTAAAATTC TGGCCTCTGG AAGTTATGAC
AAGAGCGTGA AATTGTGGAG TTTAGAAGCG CCTAGACTAC CGATTCTGCG GGGACATGAG
GATCGAGTTT TGAGTGTTGC TTGGAGTCCT GATGGTCAGG TATTGGCTTC TAGTAGTCGC
GATCGCACTG TAAAACTATG GCGAAGACAG TTGAATAAGG GCAGACTTGA TGCTCATCTT
TACAAAACCT TAGTGGGTCA CACCCAGATG GTTCATAGCG TCAGTATTGA CCCCAAAGGT
GAAATTTTAG CTTCTGCTAG TGAAGACAAA ACCGTTAAAC TTTGGCGACT GGATGGAACT
CTCCTGAAAA CTCTCTCTGG TCATAGTGAT AGCGTAGTCA GTGTTAGTTT CAGCCCTGAT
GGTCATTTAT TAGCATCAGC TAGTAGAGAT CACACAATTA AACTGTGGAA TCGTGATGGA
AGTTTGCTCA AAACTTTAGT GGGGCATGAA GCGCGAGTCA ATAGCGTCAG TTTTAGCCCT
GATGGAGAGG TTTTAGCTTC TGCTAGTGAC GATAAAACAA TCAAGCTTTG GCGGCCAGAT
GGGAGTTTAA TCAAAACCTT TGACCCTCAC GATAGCTGGG TATTAGGTGT CAGTTTTAGC
CCGACTGAAA AATTATTGGC TTCTGCTGGT TGGGACAACA CAGTCAGACT ATGGCGACAA
GATGGGACTT TGTTACAAAC CTTATTAAGA GGATTTAGCG ATAGCGTTAA TGCTGTGAGT
TTTAGTCCCA CTGGTGAAAT ACTTGCTGCT GCTAATTGGG ATAGTACAGT CAAATTGTGG
AGCCGTGAGG GCAAATTGAT TAAAACCCTC AACGGACACG AAGCTCCTGT GTTGAGCGTC
AGTTTTAGCC CCGACGGACA AACGTTAGCT TCAGCTAGCG ATGACAACAC AATTATTTTA
TGGAACTTAC ACCTTGATGA CCTGCTGACT CGTGGTTGTG GCTGGGTCAA TAATTATCTG
AAGCACAACA ACAATGTGGA TGAGCGCGAT CGCTTGCTTT GTGATGATGT CAGCGATCGT
ATCTAA
 
Protein sequence
MTNCELKTAI ANNNEHSLQR LIRAINLSKG EFSLILVRCN YQQLREEMRD NLRDLSKDIN 
IREIYVQSSI SALHTTITSQ LFLDNPSVAS DCLPSALMVF GLESVIALED LLTGINQSRD
IYAASFPFPL ILWLQDEVAS SLSRLAPDFK SWAATTIKFE MPQEDLIALI TKQTESLFSK
VLEVGAEKFI SNDALDLAPK SQHRHEIESA RNDLLRSYNI KLEPGLEASL EFVLGRDKYA
NDQINSALAH YQRSLSLWQQ EEKKESIEKK YLELNYQQRL WQAIILFHLG LCYHRMADLH
QSANSGYWRH ALLWFQQCLD VLEEKKRQDL VAKFILPACE MLYRLQLWED LKKLAQKSLY
LHKSYGNQAQ IAQDYGFLAA VAGANNDWIL AHELANTALD ILEKVTGTRQ HESWYLLLLA
RSQRHLGEYE ESINNLEWAR VVCELQYEPS LYLEILEELR SLYFIERHEY LEAFKLKQEK
IQIEHQYGLR AFIGASQLQA QRYKINSVLE PQSIPFIPEE VAQEISASGR QQDVNRLIER
ITRADYKLTV VHGPSGVGKS SLLKAGLLPA VKNKVIGERI PLPIVLSAYT DWTTSLGRGF
YNVLAPLDIS SSLEFTPAIL LEKFRSATAR NHTIIIIFDQ FEEFFFVSSF QQKLEFYQFF
SECLNIAYLK IIISIREDYL HYLLDFERWD KTQPDSFHDL DVINKNILDK DIRYFLGKFS
REDAVAIIRY LTQTSQYEVK DDLINELVQD LAGETEEVHP IELQIVGAQL QAENITNLEK
YKLCGGTKKL IERWLEEVIR DCGQEHEDFS WKLLYELTDE KGTRPLKTKS DLMLALEKYL
DNQSDFDSRW ELILEILVGS GLILCWREEL GERYQLVHDY LVEPIRQRND YGIIAELEKI
KSEKTQAEVA RKLSQEQLNS VLRRRLREAR AAGVLLAVMG GTIASLWWQA DMQKRTAELQ
TIRAETSETN LQISAIAASS EALFSSNQEF DALLESLRAW QKLKQAKEVR PETRMRVVTA
LQQAVYGVTE LNRLEGHSDI VWGVAFSPDG QLLASGSTDR TIKLWRPDGT LLQTLEGHTS
AVTSVSFSPD GQTIASTSLD QTVRIWRKNP TTGEFAPEPA QSLRKHKDWV YSANFSPDGE
LLATASRDRT IKIWDRDGNL IKTLKGHQGS VNWVSFSPDS QFIASASEDK TVKIWRRDGS
LVKTLSAHQE GVTVVTFSPD GKLLASADRD NVIQLWQWDS SNHNNPEVDI YKTLKQHTST
VWSLSFSSDS KQLASASDDN TINLWSHTGN LIKTFKGHSD AVVSVAFSPD TKILASGSYD
KSVKLWSLEA PRLPILRGHE DRVLSVAWSP DGQVLASSSR DRTVKLWRRQ LNKGRLDAHL
YKTLVGHTQM VHSVSIDPKG EILASASEDK TVKLWRLDGT LLKTLSGHSD SVVSVSFSPD
GHLLASASRD HTIKLWNRDG SLLKTLVGHE ARVNSVSFSP DGEVLASASD DKTIKLWRPD
GSLIKTFDPH DSWVLGVSFS PTEKLLASAG WDNTVRLWRQ DGTLLQTLLR GFSDSVNAVS
FSPTGEILAA ANWDSTVKLW SREGKLIKTL NGHEAPVLSV SFSPDGQTLA SASDDNTIIL
WNLHLDDLLT RGCGWVNNYL KHNNNVDERD RLLCDDVSDR I