Gene Ava_2971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2971 
Symbol 
ID3681264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3682727 
End bp3686830 
Gene Length4104 bp 
Protein Length1367 aa 
Translation table11 
GC content49% 
IMG OID637718318 
ProductWD-40 repeat-containing protein 
Protein accessionYP_323477 
Protein GI75909181 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTATC AGTATCAAGT TGGCGGTAGC CTTCCCGCAG ATGCGCCAAC TTATGTCAAA 
CGACAAGCTG ATGAAGACTT ATATACAGGC TTGAAGGCTG GACAATTTTG TTATGTCCTC
AACTCCCGCC AAATGGGGAA ATCTAGCTTG CGGGTGCAGG TGATGGGGCG ATTGCAGGCG
GAGGGGTTTG CTTGTGCGGC TGTGGATATT ACCGCTATTG GGACGGCTGA GATTACGCCG
GAACAATGGT ATGCAGGGGT GATTGATACC TTGGTAGGAT ACTTTAATTT ATATACTGAT
TTCGATTTAG AAACTTGGTG GAATAACAAC GGTTTACTTT CACCTGTGCA GCGTTTTAGT
AAGTTTATTG AAACGGTTTT ATTGCCAAGA ATTACCGAAA ATATAGTTAT TTTTATTGAT
GAAATTGATA GTGTTTTAAG CCTTGATTTT AATCTCGATG ATTTTTTTGC CGTGATTCGG
GATTGTTATA ACCGAAGGGC GGATCATCCT GAATATCATC GCATCACATT TGCCTTAATT
GGGGTGTCTA CGCCCTCAGA CTTGATTCAA GATAAAGGAC GCACACCTTT TAATATTGGT
AGGGCAATTG ATTTAACAGG TTTTGAGTTA GCCGAAGCTG AACCATTAGC CCAAGGTTTA
GCCGCGTTAG GTAATCCCCA AGAAATCATA GCGGCGGTGT TGGCGTGGAC AGGAGGACAG
CCATTTTTAA CCCAAAAAGT TTGTAATTTA TTAATCGCAG ATTACCTTGT AGAGACGCGA
TTCATCGCGT CTGGTGAAAA CAAAGAGAAA TCATCAATTG AAAATTTAGT TGCAACGGTA
GTCAAAAACC GGATTATTGA AAATTGGGAA GGACAAGACG AACCAGAACA TTTAAAGACA
ATTCGTGACA GAATCATGCG GAGTGGGGAA CAGCGCACCG GGCGGCTGTT GGGGTTATAT
CAGCAGATTT TGCAGCAAAG TGAGCTTGTT GCTGATGATA GTTATGACCA GATGATGCTG
CGGTTGACTG GGTTAGTAGT GCGGCGGGAT GGCAAATTGC GAATTTATAA CCGCATCTAT
GCAGAGGTGT TTCAGCAAGA GTGGTGTGAG AATATTTTGG CTGGGTTGCG TCCATACTCT
GATACGTTTA ATGCTTGGGT GGCTTCTAAT TATCAAGATG AATCGCGCCT GTTACGTGGA
CAAGCATTAC AAGAGGCTTT GGCTTGGCGC ATTGGGAAAA ATTTAACTGA TGTTGATGAC
CGATTTTTAG ATGCTAGTCA GGAATTACAA AAGCGAGAAG TTGAAAAAAG TTTAGCACTG
GCACGCAAAG AACAAGAAAT TTTAACAGCA GCTAATAAAA AAGCTCGGCA AAGAGTTTTT
ATTGCTTCGG TGCTGTTGAT TATTTCTGTT GTGGCGGCGG TAGGTTTGGG AGTGTTGGCG
GGGCGAAGTA ATAAACAATT GGCTGATGCG AGAACCGAAC GGGACAAAAT TGATAGAGAG
AAACAACAGA AAGAACGAGA ACTCGCAACG GCACAGCAGC GAGTTGTTGA TGCTAATAAA
AAGGTGATTG ATGCTAATAA AAATCTCCAG GATGCGACGG CTAATTTAAA ACAGCAACAG
CTAACTGCCA AGCAACAGCT AAATGCAACA AATCAACAAC TGAAACAAGC ACAAGATAAA
GAAAAGCAGG CGCGGGGACA AGTAGAAAAG GCACAAAACG ACTTGCGACA AGCGAGAGAA
CAACAACGAC AGGCTTTGGT GGGGTTAAAA ACTGCGGAAG CACAGCAAAA ACGGGCGCAA
GATAACCTGA AAAAGACAGA AGCAGAACGA GAAATTGCCT TGACAGGTAC ACGGCTGGAA
CGGTCAGGAG TGGCCAATAT TAACCGCTTT GAGTTTAATC AAATAGGTGC TTTGCTGGCA
GCAATGCGGG ATGGTAGAGA GTTAAAAAGT TTAATAGACA AGCAGGGAAT CAAGCAACTT
AAGGACTACC CGGCGGCTAG CCCAGTGTTG GCTTTACAGA CAATTTTAGA TAACGTGCGG
GGTATGACAG TCATGGCTGG GCATGAGAAT TGGGTCAACA GTGCCACCTT TAGTCCAGAT
GGGCAGCGCA TCCTCACTGC ATCATCTGAC AAAACAGCGC GTTTGTGGGA CTTACAGGGT
CGGCAAATCG CTAAGTTCCA GGGGCATGAG AGTTCGGTCA ACAGTGCCAC CTTTAGTCCA
GATGGGCAGC GCATCCTCAC TGCATCATCT GACAAAACAG CGCGTTTGTG GGACTTACAG
GGTCGGCAAA TCGCTAAGTT CCAGGGGCAT GAGAGTTCGG TCATCAGTGC CACCTTTAGC
CCGGATGGGC AGCGCATCCT CACCCTTTCC GGTGATAGGA CAACGCGGTT GTGGGACTTA
CAGGGTCGGC AAATCGCTGA GTTACAGGGG CATGAGGGTT GGGTCAGAAG TGCCACTTTT
AGTCCAGATG GGCAGCGCAT CCTCACCGCC TCCGTTGACG AGACAGCGCG GTTGTGGGAC
TTACAGGGTC GGCAAATCGC TAAGTTCCAG GGGCATAAGA GTTGGCTCTT CAGTGCCACC
TTTAGCCCGG ATGGGCAGCG CATCCTCACT GCATCATCTG ACAAAACAGC GCGGTTATGG
GACTTACAGG GTCGTCAAAT CGCTAAGTTC CAGGGGCATG AGAATTCAGT CATCAGTGCT
ACATTTAGCC CAGATGGACA GCGCATCCTC ACCCTCTCCG TTGACAAGAC AGCGCGGTTA
TGGGACTTAC AGGGTCGGCA AATCGCTGAG TTGCAGGGGC ATGAGGATTG GGTCAACAGT
GCCACATTTA GCCCAGATGG GCAGCGCATC CTCACTGCAT CATCTGACAA GACAGCGCGG
TTGTGGGACT TACAGGGTCG GCAAATCGCT GAGTTGCAGG GGCATGAGGA TTGGGTCAAC
AGTGCCACAT TTAGCCCGGA TGGGCAGCGC ATACTCACCG CCTCTAGAGA CGAGACAGCG
CGGTTGTGGA ACTTACAGGG TTGGCAAATC GCTAAATTCC AGGGGCATGA GAATGTGGTC
AGCAGTGCCA CATTTAGCCC GGATGGACAG CGCATCCTCA CTGCCTCACC TGACAAGACA
GCGCGGTTGT GGGACTTACA GGGTCGGCAA ATTGCTGAGT TGCAGGGGCA TGAGAATGTG
GTCAGCAGTG CCACATTTAG CCCGGATGGA CAGCGCATCC TCACTGCCTC ACCTGACAAG
ACAGCGCGGT TGTGGGACTT ACAGGGTCGG CAAATTGCTG AGTTGCAGGG GCATAAGGGT
TGGCTCTTCA GTGCCATCTT TAGCCCTGAT GGGCAGCGCA TCCTCACCGC CTCCGATGAC
AAAACAGCGC GGCTGTGGGA CTTACAGGGT CGACAAATCG CTGAGTTGGG GCATAAGGGT
TGGCTCTTCA GTGCCACCTT TAGCCCGGAT GGGCAGCGCA TCCTCACCGC TTCAAGTGAC
AGCACAGCGC GGTTGTGGAA CTTACAGGGC AGGGAAATTG CCAAGTTCCA GGGGCATAAG
AATTTGGTCA TCAGTGCCAG TTTTAGCCCG GATGGGCAGC GCATCCTCAC TGCATCATCT
GACAAGACAG CGCGGCTGTG GGAATTACAG GGCAGGGAAA TTGCCAAGTT CCAGGGGCAT
GAGGGTGATG TCATAACTGC CATTTTTAGC CCGGATGGGC AGCGCATCCT CACCGCCTCT
AGAGACAAGA TAGCGCGGTT GTGGGACTTA CAGGGTCGGG AAATTGCCAA GTTCCAAGGG
CATGAGGATT GGGTCAACAG TGCCATTTTT AGCCCGGATG GGCAGCGCAT CCTCACCGCC
TCTAGAGACA AGACAGCGCG GTTGTGGGAC TTACAGGGTC GGGAAATTGC CAAGTTCCAA
GGGCATGAGG ATTGGGTCAA CAGTGCCACC TTTAGCCCGG ATGGGCAGCG CATCCTCACC
GCCTCTAGAG ACAAGACAGC GCGGCTGTGG CAGGTGGAAA GTTTAGAGCA ATTACTGGCA
CGGGGTTGTG GGTGGTTGCG TAACTATTTA ATTTATGCGC CAAATTTGAG TGAGAGTGAT
AAGCAGGTTT GTAAAAAGGA GTGA
 
Protein sequence
MQYQYQVGGS LPADAPTYVK RQADEDLYTG LKAGQFCYVL NSRQMGKSSL RVQVMGRLQA 
EGFACAAVDI TAIGTAEITP EQWYAGVIDT LVGYFNLYTD FDLETWWNNN GLLSPVQRFS
KFIETVLLPR ITENIVIFID EIDSVLSLDF NLDDFFAVIR DCYNRRADHP EYHRITFALI
GVSTPSDLIQ DKGRTPFNIG RAIDLTGFEL AEAEPLAQGL AALGNPQEII AAVLAWTGGQ
PFLTQKVCNL LIADYLVETR FIASGENKEK SSIENLVATV VKNRIIENWE GQDEPEHLKT
IRDRIMRSGE QRTGRLLGLY QQILQQSELV ADDSYDQMML RLTGLVVRRD GKLRIYNRIY
AEVFQQEWCE NILAGLRPYS DTFNAWVASN YQDESRLLRG QALQEALAWR IGKNLTDVDD
RFLDASQELQ KREVEKSLAL ARKEQEILTA ANKKARQRVF IASVLLIISV VAAVGLGVLA
GRSNKQLADA RTERDKIDRE KQQKERELAT AQQRVVDANK KVIDANKNLQ DATANLKQQQ
LTAKQQLNAT NQQLKQAQDK EKQARGQVEK AQNDLRQARE QQRQALVGLK TAEAQQKRAQ
DNLKKTEAER EIALTGTRLE RSGVANINRF EFNQIGALLA AMRDGRELKS LIDKQGIKQL
KDYPAASPVL ALQTILDNVR GMTVMAGHEN WVNSATFSPD GQRILTASSD KTARLWDLQG
RQIAKFQGHE SSVNSATFSP DGQRILTASS DKTARLWDLQ GRQIAKFQGH ESSVISATFS
PDGQRILTLS GDRTTRLWDL QGRQIAELQG HEGWVRSATF SPDGQRILTA SVDETARLWD
LQGRQIAKFQ GHKSWLFSAT FSPDGQRILT ASSDKTARLW DLQGRQIAKF QGHENSVISA
TFSPDGQRIL TLSVDKTARL WDLQGRQIAE LQGHEDWVNS ATFSPDGQRI LTASSDKTAR
LWDLQGRQIA ELQGHEDWVN SATFSPDGQR ILTASRDETA RLWNLQGWQI AKFQGHENVV
SSATFSPDGQ RILTASPDKT ARLWDLQGRQ IAELQGHENV VSSATFSPDG QRILTASPDK
TARLWDLQGR QIAELQGHKG WLFSAIFSPD GQRILTASDD KTARLWDLQG RQIAELGHKG
WLFSATFSPD GQRILTASSD STARLWNLQG REIAKFQGHK NLVISASFSP DGQRILTASS
DKTARLWELQ GREIAKFQGH EGDVITAIFS PDGQRILTAS RDKIARLWDL QGREIAKFQG
HEDWVNSAIF SPDGQRILTA SRDKTARLWD LQGREIAKFQ GHEDWVNSAT FSPDGQRILT
ASRDKTARLW QVESLEQLLA RGCGWLRNYL IYAPNLSESD KQVCKKE