Gene Ava_2891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2891 
Symbol 
ID3681396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3590932 
End bp3596076 
Gene Length5145 bp 
Protein Length1714 aa 
Translation table11 
GC content43% 
IMG OID637718236 
ProductWD-40 repeat-containing protein 
Protein accessionYP_323397 
Protein GI75909101 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID[TIGR01543] phage prohead protease, HK97 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00238858 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTTGAAG GACACCCAGC AGTAGATATT AAAGTAGCGA ATGAGCGTGC TTTCACAAGT 
TTATGGAGAG CGATCGCTCT TTCTCATGGT AATTTTTCTG TAGCTTTGGT TTACTGTAAT
TATCGAGTCT TGCAGGAGAA AATATTGCAA CGACTAGATG AAATGTTTGC TGAGAATCCC
GTTCAAAAAG TTGTCTTACC TCCTAATACC AGAAGTCTTT ATACAACTCT CCATTTAAAT
TTACTCCCCC AACAGCAACA GCCATCAGCC TTGATGGTTT TGGGTTTAGA ATCAGTCGAG
GAAATTGACG ATTTACTCAG AGCCATTAAT CATATTCGAG ATGAATTTCC CAAACGTCAT
TCATTTCCGA TGATTTTTTG GGTGAATGAA GAAGTCTTGC AAAAAGTCAT CCGTTTAGCG
CCGGATTTTG CCAGTTGGGC GGCGACACCG ATTCGGTTTG AGATGACAAC ACCAGAGTTG
TTGCAATTTC TCCAACAGGA AACCGACTCT TTATTTGCCA GAGTCTTACC CAAGGATATG
GGACAACCCC AACAGCCTCA ATTCGGTGAT GATTACTCTA CCTTAGAGCA GGTATGGGAA
CACAGTAATG AACTGCACTA CGCTATTGCG GAACTCCATG AGAGGGGTAT TACCTTAGAA
CCAGAATTAA ACGCTAGTTT AAAATTTGTG TTTGGCTTAG ATGATTATGT CAGCGATCGC
ATTCATCATG CCTTAAATCA TTTTCAACAA AGTCTGCAAA TTTGGCAACG ATTGGGGGAT
TGGGGATTGG GGACTGGGGA ACAGAGTATA TCTTTTCCTG TTCGTCCTAC TCCTTTATCT
TCCCCAATTC TTAGACAGGG AGTTTTGCAA TTATATATAG GGTTATGTTA TTGCCGTTTA
GCGGAGCAGA ATCAGCTAGA TAATCGTCGT CATTGGGAAA CAGCAAAATT CTATTTTCAA
GAATGTTTGG AGATTTTGCA GGTAGCTGGA AGGCCAGATA TAGCATCGGA ATTTATTGGT
CAACTGGCGG AAGTTTTGGA ACATCTGCAA GGATGGGATG AACTACAAAC CGTTGCAGAG
ACAGCACTGG AGTTACATCA TACCTACGGT AGTCAAATTC AACTGGCTTG CGATTATGGT
TTCCTGGCCC AGGTGGCGTT ACAGCAGTCG CGCTGGGTGC AGGCGAGTAT TTTGGCACAC
GTATCACTAT TAAAATTAAC TGAGGCGCAA AACCATAACG ATAGTGACCG TCATCATTGC
CTATTTCCCT TACTGTTGGC ACAAATATAT TATCTAGTTT TAGCCAAAGC CCAACAAAAC
TTAGGTGAAC CAGCAGTTGC ACAGGAATAT TTAGATAAAG CAGCCAAAGA ATTACCAGCC
GCTTTAGAAA ATAGCACCCA TCAATATGAC GCTCATCGTT ATATTAGAAT GCTGCGGACA
TTGCGATCGC TCTATTTTGA AGCAGGTAGA TATCTAGAAG CTTACCGCAT CCGCCAAAAA
CGCCGCTCTG TAGAACAGCA ATATGGTTTT CGGGCTTTTA TTGGTGCGGG AAGATTGCAA
CCCCAAAGAC AAGCAACCAA CCCCGCTTTG ATGTCACCCT CTGGTAGCAG TACTGTAGCC
TTAGAAATCG CCGCTTCCGG TCGAGAACAA GATATAAATA ACTTAATCGG CAGAATTAGC
CGGGCTGACC AAAAATTGAT CGTGATTCAC GGCCCTTCTG GAGTGGGGAA GAGTTCCACT
GTGACAGCTG GTTTAGTCCC AGCGTTACAA AATCGAGCGA TTGGTGATCA AATTGCCATG
CCCGTAGTAT TGCAAGTATA CACCGATTGG GTACGGGAAT TAGGAAAGGC TTTAACTGAG
GCTGTGGCGC ATATTTCCGG AGATGTCAGC ATTACGCCGG AGATTTTATC CACACCGACA
CCAGCGATGG ATGTGGGCTA TGCTCGATCG ATGGCGATCG CTGATATATT AGGACAATTA
CGCCAAAATG CCAATAACCA CCTGATCACG GTGCTGATTT TCGACCAGTT TGAAGAATTC
TTCTTTGGTT ATAGCGATGG CGTTCCGCCC ACCGTAGGTG ATCGTCAACA AAAGAAAGAA
TTTGATCAAT TTCTCAGTCA ATGTCTGAAT ATATCCTTTG TCAAAATCAT TTTTTCCATC
AGAGAAGATT ATTTACACCG CTTATTAGAA TTTAAACATC TTTCCTACCT GGAAGCAGTT
AATAATAACA TCCTTGATAA ACAAATCCGT TATCAATTAA ATAATTTTTC CCCAGAATAT
GCTAAAGTCA TCATTCAAAA ATTAACAGCA CGTTCTCAAG TCAATTTGGA ACCAGCTTTA
ATTGATGCAG TAGTGGAAGA TTTATCAACT GAATTAGGGG AAGTCCGCCC CATTGAATTG
CAGGTAGTTG GCGCACAAAT CCAAGATGAG CGCATTAATA CCTTAAAACA ATATCAGCAA
TATCGACCAA ATAAACTCAT AGAAAGATAT ATCAAAGAAC TGATCAAAGA GTGTGGCCCT
GACAATGAGC GTGCAGCTTT ACTTGTTTTA TATTTGTTGA CAGATGAAAG CAACAAACGA
CCTTTTAAAA CCCGTGCCGA ATTAGCTATT GAATTGGCAG AATTAGAAGA CCCTGAAAAA
TTAGATTTAG TCTTAGATAT TTTAGTTAAC TCTGGCTTAG TTGTCTTGTT TCCTGATATT
CCCGAACGTT ATCAACTTAT TCATGATTAT TTAGTCGATT TGATTCGCTA TCTCCAACAA
CAGGAATCCA GCTTACAAGC ACAACTTGAC CAACTACGCC ATAAAGTCCA ACAAAGTCAA
ACGGAAATTG CCCGCCTTAA AAGCGAACTC AGCCAAAAAA AGCAATCTAA ACTCACAGAT
ACTCATCTCC AACAAGGTTT GGATTTAGTC ACAGAACTAC GGGAATTACG CAAGCGAGAA
GAACTAACTC AACTAGAAAT TGAACAATTA CGTGCTGAAC TCAAAGAAAA GGAATTAACC
GCCCAACTAG CCGAAAGCCA GAAACAACAA AGGCTCAGCC AAGCCAAGCT CAATCGTTCA
CTAAAAATTG CCCTGGCTGC CTCATGTCTC GCCATCTTGG GGTTAAGTGT TTCTATTATT
ACAGCAGTAG ATAGCGAAAT TAAAACCCTC AGCGTCTCCA GCGAAGCCCT GTTTGCTTCC
CAAAAAGGTC TTGATGCGGT GAAAGAAGGT GTCAAAGCCG CCAGAAAACT GCAACGGGCT
ATTTGGGTAG ACCCTTACAC CAGAGAACAG GTGCAAACAG CACTCTATCA AGCCGTTGTG
GGAGTGAGGG AATACAACCG TTTAGACGGG CATACCGCAG GGGTAAACAG TGCGGTCTTT
AGCCCCGATG GTTCCCTGAT TGCTTCCGCC AGTGCCGATA ATACTATCAA TCTCTGGCGT
AATGATGGCA GCTTAATCAA CACCTTATCC AAACATACTA ATGTAGTAAA TAGTGTCAAC
TTCAGCCCTG ATGGCTTGTT AATTGCCTCT GCTAGCCAAG ACAAAACCGT CAAACTATGG
AACCGAGTCG GTCAACTAGT AACCACCTTA CAAGGGCATA GAGATGTAGT AAATAATGCT
AGTTTTAGCC CCGATGGTTC CCTGATTGCT TCCGCCAGTA GCGACAAGAC GGTAAAACTA
TGGAGTCGAG AAGGGAAATT ACTCAAAACT TTATCAGGCC ATAATGATGC AGTTTTGGGC
ATAGCTTGGA CACCAGATGG TCAAACTCTC GCTTCTGTAG GTGCTGACAA AAACATCAAC
TTTTGGAGTC GGGACGGTCA ACCACTCAAA ACCTGGAAAG GACATGATGA TGCAATCTTG
GGTGTAGCTT GGTCGCCCAA TGGTGAAATA CTCGCCACCG CCAGTTTTGA TAAGACTATT
AAACTGTGGA ATCGCCAAGG TAATTTATTA AAAACCCTAT CAGGACACAC GGCGGGAGTC
ACAGCCGTCA CCTTTAGCCC TAACGGTCAA ACTATCGCTT CCGCCAGTAT CGATGCAACC
CTCAAACTTT GGAGTCCTGG CGGTCTGCTG TTGGGAACCT TAAAGGGTCA TAACAGTTGG
GTAAATAGTG TCAGTTTTAG CCCTGACGGT CGCACCTTCG CCTCTGGTAG CCGAGATAAA
ACGGTGACAC TCTGGCGCTG GGATGAAGTG CTATTACGAA ACCCCAACGG CGATGGTAAC
GACTGGGTGA CAAGTATTAG CTTTAGTCCT GATGGTGAGA CTTTGGCGGC GGCGAGTCGA
GACCAAACTG TAAAAATTTT ATCTCGGCAA GGGAAACTAT TAAACATCTT CAAAGGGCAT
ACAGGGTCAA TTTGGGGCGT AGCATGGTCG CCAAATCAGC AGATGATAGC CTCAGCCAGT
AAAGATAAAA CAGTGAAATT ATGGAATCGA GATGGCAAAT TATTACATAC CTTACAGGGT
CATCAAGATG CAGTCTTAGC TGTAGCCTGG TCGTCGGATA GTCAGGTAAT AGCTTCTGCC
AGTAAAGATA AAATGGTGAA AATCTGGAGT CAAGACGGGC AACTACTCCA CATCTTACAA
GGTCACACCG ATGCAGTCAA TTGGGTCAGC TTTAGCCCTG ATGGTAAGAT ATTAGCCTCC
GTCAGCGATG ATACCACCGT GAAATTATGG AATCGAGACG GGCAACTACT CCACACCCTC
AAGGAACATA GCCGCCGAGT GAACGGGGTA GCATGGTCGC CAGATGGACA AATTGTAGCT
TCTGCCAGCA TCGACGGGAC AGTTAAACTG TGGAATCGAG ACGGTAGCTT GTTAAGAAAT
CTACCGGGTG ACGGTGATAG TTTTATTAGT GTTAGTTTTA GCCCCGACGG TAAGATGTTA
GCGGCTAATA GTGATGATAA AATCAGGCTT TGGAACCAGA AAGGCACATT GCTGATGGTT
TTAAAAGGCG ACAAAGATGA ATTAACCAGT GTTACCTTCA GTCCCGACAG TCAAATTTTG
GCAGCAGGCG GCGGCAATGG TAAAGTGATT TTCCAGAATT TAGCTGATAT TAAACTCGAA
AATTTACTAG TACGCGGGTG TGATTTACTA CAAGATTATT TAAAAACTAA TCTGGATGTG
ACAAAAAGCG ATCGCACCTT ATGCCCCAAT ACTAACAACA GGTGA
 
Protein sequence
MVEGHPAVDI KVANERAFTS LWRAIALSHG NFSVALVYCN YRVLQEKILQ RLDEMFAENP 
VQKVVLPPNT RSLYTTLHLN LLPQQQQPSA LMVLGLESVE EIDDLLRAIN HIRDEFPKRH
SFPMIFWVNE EVLQKVIRLA PDFASWAATP IRFEMTTPEL LQFLQQETDS LFARVLPKDM
GQPQQPQFGD DYSTLEQVWE HSNELHYAIA ELHERGITLE PELNASLKFV FGLDDYVSDR
IHHALNHFQQ SLQIWQRLGD WGLGTGEQSI SFPVRPTPLS SPILRQGVLQ LYIGLCYCRL
AEQNQLDNRR HWETAKFYFQ ECLEILQVAG RPDIASEFIG QLAEVLEHLQ GWDELQTVAE
TALELHHTYG SQIQLACDYG FLAQVALQQS RWVQASILAH VSLLKLTEAQ NHNDSDRHHC
LFPLLLAQIY YLVLAKAQQN LGEPAVAQEY LDKAAKELPA ALENSTHQYD AHRYIRMLRT
LRSLYFEAGR YLEAYRIRQK RRSVEQQYGF RAFIGAGRLQ PQRQATNPAL MSPSGSSTVA
LEIAASGREQ DINNLIGRIS RADQKLIVIH GPSGVGKSST VTAGLVPALQ NRAIGDQIAM
PVVLQVYTDW VRELGKALTE AVAHISGDVS ITPEILSTPT PAMDVGYARS MAIADILGQL
RQNANNHLIT VLIFDQFEEF FFGYSDGVPP TVGDRQQKKE FDQFLSQCLN ISFVKIIFSI
REDYLHRLLE FKHLSYLEAV NNNILDKQIR YQLNNFSPEY AKVIIQKLTA RSQVNLEPAL
IDAVVEDLST ELGEVRPIEL QVVGAQIQDE RINTLKQYQQ YRPNKLIERY IKELIKECGP
DNERAALLVL YLLTDESNKR PFKTRAELAI ELAELEDPEK LDLVLDILVN SGLVVLFPDI
PERYQLIHDY LVDLIRYLQQ QESSLQAQLD QLRHKVQQSQ TEIARLKSEL SQKKQSKLTD
THLQQGLDLV TELRELRKRE ELTQLEIEQL RAELKEKELT AQLAESQKQQ RLSQAKLNRS
LKIALAASCL AILGLSVSII TAVDSEIKTL SVSSEALFAS QKGLDAVKEG VKAARKLQRA
IWVDPYTREQ VQTALYQAVV GVREYNRLDG HTAGVNSAVF SPDGSLIASA SADNTINLWR
NDGSLINTLS KHTNVVNSVN FSPDGLLIAS ASQDKTVKLW NRVGQLVTTL QGHRDVVNNA
SFSPDGSLIA SASSDKTVKL WSREGKLLKT LSGHNDAVLG IAWTPDGQTL ASVGADKNIN
FWSRDGQPLK TWKGHDDAIL GVAWSPNGEI LATASFDKTI KLWNRQGNLL KTLSGHTAGV
TAVTFSPNGQ TIASASIDAT LKLWSPGGLL LGTLKGHNSW VNSVSFSPDG RTFASGSRDK
TVTLWRWDEV LLRNPNGDGN DWVTSISFSP DGETLAAASR DQTVKILSRQ GKLLNIFKGH
TGSIWGVAWS PNQQMIASAS KDKTVKLWNR DGKLLHTLQG HQDAVLAVAW SSDSQVIASA
SKDKMVKIWS QDGQLLHILQ GHTDAVNWVS FSPDGKILAS VSDDTTVKLW NRDGQLLHTL
KEHSRRVNGV AWSPDGQIVA SASIDGTVKL WNRDGSLLRN LPGDGDSFIS VSFSPDGKML
AANSDDKIRL WNQKGTLLMV LKGDKDELTS VTFSPDSQIL AAGGGNGKVI FQNLADIKLE
NLLVRGCDLL QDYLKTNLDV TKSDRTLCPN TNNR