Gene Ava_2620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2620 
Symbol 
ID3681903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3241015 
End bp3244413 
Gene Length3399 bp 
Protein Length1132 aa 
Translation table11 
GC content46% 
IMG OID637717966 
Productphycobilisome protein 
Protein accessionYP_323129 
Protein GI75908833 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00272718 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTTA AGGCGAGTGG AGGAAGCTCG GTTGCGCGCC CGCAACTATA TCAAACCCTA 
GCTGTGGCAA CAATCACCCA AGCGGAACAG CAAGACCGCT TTTTAGGCAG GGGTGAACTA
GATGAACTAG CAAGCTATTT TGCATCTGGT GCAAAACGTC TAGAAATTGC CCAACTTCTC
ACAGAAAATT CCGAGATTAT TGTTTCTCGT GCTGCTAACC GGATTTTTGT TGGTGGTTCG
CCAATGGCTT TCTTGGAAAA GCCGAGAGAA CCAGAACTGG CAATGGCTGC TGTTGGTGGT
GGTGGAGATG TCAGAGAGAG CATGAAGTTG GGAACTGTCA CCTATGTGGA AACCCGTGGT
GGATTCCTAG AAAACTTGCG CTCTATCTTT AATACATCTC CCAGTGGTCC AACTCCTCCA
GGGTTTAGAC CAATCAACAT TGCTCGTTAC GGCCCAAGCA ACATGGCCAA GAGCTTGCGG
GACTTGTCCT GGTTCTTGCG CTATGCTACT TATGCGATCG TGGCTGGCGA CCCTAACATC
ATTGTGGTGA ACACAAGAGG TTTGCGGGAA ATTATCGAAA ATGCTTGCTC TGGTGAAGCA
ACCATTGTTG CTTTGCAGGA AATCAAAGCT GCATCACTTT CTTATTTCCG TAAAGATCCA
GAAGCCACAG AGATTGTGTC TCAATACATG GATGTTTTGA TCACAGAATT CAAAGCACCC
ACACCATCTA ATAAGCTGCG TCAACGTCCC TCTGGTGACC AACAAGGCTT ACAACTACCT
CAAATTTACT TCAGTGCGGC TGAAAGAAGA CCCAAGTTTG TGATGAAACC GGGGTTATCA
GCGACTGAGA AAAATGAAGT AGTAAAAGCA GCTTATAGAC AAATCTTTGA GCGCGATATT
ACTCGTGCTT ACAGCTTGTC AATCTCTGAC CTAGAATCCA AAGTTAAGAA CGGCGACATC
TCTATGAAGG AGTTTGTCCG TCGTTTAGCA AAATCTCCTC TTTACCAAAA ACAGTTTTAC
CAACCTTTTA TTAACAGCCG CGTTATCGAA CTAGCTTTCC GTCACATTTT GGGACGGGGG
CCAAGTAGCC GTGAAGAAGT TCAAAAATAT TTCTCAATCA TTTCTAACGG CGGTCTACCA
GCTTTAGTAG ATGCTTTGGT TGATTCGGCA GAATATGGGG ACTACTTTGG AGAAGAGACA
GTACCTTACC TACGTGGTTT AGGTCAAGAA GCTCAAGAAT GTCGTAACTG GGGACCACAG
CAAGACCTGT TTAACTACAG TGCGCCTTTC CGTAAAGTAC CTCAGTTTAT TACCACATTT
GCGGCGTACG ATCGCCCACT ACCAGATCAA CACCCATACG GTTCTGGTAA TGACCCATTA
GAAATTCAGT TTGGGGCAAT TTTCCCGAAA GAAACCCGCA ACCCCAGCAC CAGTCCCGCA
CCTTTTGGTA AGGACACCAG ACGGATCTTG ATTCACCAAG GCCCTGGAAT TAACAACCAA
GTTAGTAACC CCAGCGCACG AGGTATAGCT CCTGGTTCTC TTGGGCCTAA AGTGTTCAAG
TTAGATCAAT TACCTGGAAC TATCGGTAGA AAAGCAGCTA AGGGTGCAAG TGTCAAGTTC
TCCGAAAGCT CAACCCAAGC AGTAATTAAA GCTACTTACT TGCAAGTTTT CGGTCGGGAT
GTGTATGAAG GTCAACGGCT GAAAGTCCAA GAAATTAAGC TGGAAAACGG CGAAATTTCT
GTAAGAGAGT TTGTCAGAGC TTTGGCTAAA TCGGATCTAT TCCGTAAGCT TTACTGGACT
CCTTTCTATG TTTGTAAGGC GATCGAATAT ATCCACCGTC GCTTATTAGG TCGTCCTACC
TACGGTCGTC AAGAAAACAA CAAGTACTTC GATATCGCCT CTAAGAAGGG CTTATATGCT
GTAGTTGATG CCATTCTGGA CAGCCTAGAG TATACCGAAA CCTTCGGCGA AGATACAGTT
CCTTACGAAC GCTATCTAAC TCCGGCTGGT GTAGCACTCA GACAGCTACG TGTTGGTACT
ATCCGGGAAG ATGTGGCGAA TGTTGAAAAA CAAGAAACAC CACGTTTTGT AGAACTGGGT
ACAGTCAAGG AAAATCGGAC TCAACCAGAT ATCGATTTCC GCATCAACCA AGGTGTTACC
AAGCAGCGTG AACAAACCAA GGTGTTCAAG CGGGTAGCGG CTAACAACGA TAAAACTGCG
ACCCAAACCT TGATTAGTGC GGCTTATCGG CAAATTTTTG AGCGTGATAT TGCACCATAC
ATTGCTCAGA ATGAATTTTC AGGCTGGGAA AGCAAACTGG GTAACGGTGA AATCACTGTA
AAAGAATTTA TTGAAGGTTT GGGTTACTCT AACCTCTACC TGAAGGAGTT CTACACACCA
TACCCCAACA CCAAAGTAAT CGAGTTGGGA ACCAAGCACT TCCTCGGTCG CGCACCAATT
GACCAAGCAG AAATCCGCAA GTATAACCAA ATTTTGGCTA CCCAAGGGAT TCGGGCTTTT
ATTAACGCTT TGGTAAATAG CCAGGAGTAC AACGAGGTAT TTGGTGAAGA TACAGTACCT
TACAGACGCT TCCCGACCTT ACCTGCGGCG AACTTCCCCA ATACCCAAAA GCTGTATAAC
CAACTCACCA AACAAAATAA TGATGTGGTT ATCCCCAGCT TTAAGCCTGT ACAAGCGCGG
ATACAGTCTG ATAAGACACC AATTTTAGCG AAGGCGATCG CAGATTTAGC AGCCCAAGCC
AAACAAATCG ACAAGAGCAA GCCTCTGTTC ATTGAATTGG GTCGCTCCTA CAACGACGGT
CGCGGACAGT CTGTAGAAGT GGGTGTGGGT ACAACTCGTC GTAAACCTGC ACGTATTTAT
CGCCTGACCG ATGGTATCGG CCAAGCAGAA AAACAACTGG TAATTAACGC GATTTACCGT
CAGGTATTGG ATGTATTTAG CGGACAAGTA CCAGATTATT ACCGCCGCAC AGAACTAGAT
AGCAAACTGC GGAATGGGGA AATTTCTGTA CGGGAATTCG TGCGGGAAAT AGCTAGTTCC
GAAATCTATC GCAAGCGTTT CTACACACCT TATCCCAACA CCAAGGTAAT TGAATTCTTA
TTCCGTCACC TGTTGGGACG TGCGCCAGCA ACTCAAGGCG AAATTCGTCA ATATAACAAG
CTGCTAGCTG ATCACGGTTT GCGTGCTGCG GTAGAAGCAA TTGTGGATAG TCCAGAATAC
AGCCGCTACT TCGGTGAAGA TGTGGTTCCT TACCCACGCT TCCCATCACT ACCCGCAGGT
AACTACCTCG GTAGCGTCCA AGCAGCAGCT GACTTGGTAA AACAATCTTG GTCTAGCTTA
TCGCCATCCA CCCTTACAGG TAGACCAGGC GATCGCTAA
 
Protein sequence
MSVKASGGSS VARPQLYQTL AVATITQAEQ QDRFLGRGEL DELASYFASG AKRLEIAQLL 
TENSEIIVSR AANRIFVGGS PMAFLEKPRE PELAMAAVGG GGDVRESMKL GTVTYVETRG
GFLENLRSIF NTSPSGPTPP GFRPINIARY GPSNMAKSLR DLSWFLRYAT YAIVAGDPNI
IVVNTRGLRE IIENACSGEA TIVALQEIKA ASLSYFRKDP EATEIVSQYM DVLITEFKAP
TPSNKLRQRP SGDQQGLQLP QIYFSAAERR PKFVMKPGLS ATEKNEVVKA AYRQIFERDI
TRAYSLSISD LESKVKNGDI SMKEFVRRLA KSPLYQKQFY QPFINSRVIE LAFRHILGRG
PSSREEVQKY FSIISNGGLP ALVDALVDSA EYGDYFGEET VPYLRGLGQE AQECRNWGPQ
QDLFNYSAPF RKVPQFITTF AAYDRPLPDQ HPYGSGNDPL EIQFGAIFPK ETRNPSTSPA
PFGKDTRRIL IHQGPGINNQ VSNPSARGIA PGSLGPKVFK LDQLPGTIGR KAAKGASVKF
SESSTQAVIK ATYLQVFGRD VYEGQRLKVQ EIKLENGEIS VREFVRALAK SDLFRKLYWT
PFYVCKAIEY IHRRLLGRPT YGRQENNKYF DIASKKGLYA VVDAILDSLE YTETFGEDTV
PYERYLTPAG VALRQLRVGT IREDVANVEK QETPRFVELG TVKENRTQPD IDFRINQGVT
KQREQTKVFK RVAANNDKTA TQTLISAAYR QIFERDIAPY IAQNEFSGWE SKLGNGEITV
KEFIEGLGYS NLYLKEFYTP YPNTKVIELG TKHFLGRAPI DQAEIRKYNQ ILATQGIRAF
INALVNSQEY NEVFGEDTVP YRRFPTLPAA NFPNTQKLYN QLTKQNNDVV IPSFKPVQAR
IQSDKTPILA KAIADLAAQA KQIDKSKPLF IELGRSYNDG RGQSVEVGVG TTRRKPARIY
RLTDGIGQAE KQLVINAIYR QVLDVFSGQV PDYYRRTELD SKLRNGEISV REFVREIASS
EIYRKRFYTP YPNTKVIEFL FRHLLGRAPA TQGEIRQYNK LLADHGLRAA VEAIVDSPEY
SRYFGEDVVP YPRFPSLPAG NYLGSVQAAA DLVKQSWSSL SPSTLTGRPG DR