Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_2620 |
Symbol | |
ID | 3681903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 3241015 |
End bp | 3244413 |
Gene Length | 3399 bp |
Protein Length | 1132 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637717966 |
Product | phycobilisome protein |
Protein accession | YP_323129 |
Protein GI | 75908833 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00272718 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGTTA AGGCGAGTGG AGGAAGCTCG GTTGCGCGCC CGCAACTATA TCAAACCCTA GCTGTGGCAA CAATCACCCA AGCGGAACAG CAAGACCGCT TTTTAGGCAG GGGTGAACTA GATGAACTAG CAAGCTATTT TGCATCTGGT GCAAAACGTC TAGAAATTGC CCAACTTCTC ACAGAAAATT CCGAGATTAT TGTTTCTCGT GCTGCTAACC GGATTTTTGT TGGTGGTTCG CCAATGGCTT TCTTGGAAAA GCCGAGAGAA CCAGAACTGG CAATGGCTGC TGTTGGTGGT GGTGGAGATG TCAGAGAGAG CATGAAGTTG GGAACTGTCA CCTATGTGGA AACCCGTGGT GGATTCCTAG AAAACTTGCG CTCTATCTTT AATACATCTC CCAGTGGTCC AACTCCTCCA GGGTTTAGAC CAATCAACAT TGCTCGTTAC GGCCCAAGCA ACATGGCCAA GAGCTTGCGG GACTTGTCCT GGTTCTTGCG CTATGCTACT TATGCGATCG TGGCTGGCGA CCCTAACATC ATTGTGGTGA ACACAAGAGG TTTGCGGGAA ATTATCGAAA ATGCTTGCTC TGGTGAAGCA ACCATTGTTG CTTTGCAGGA AATCAAAGCT GCATCACTTT CTTATTTCCG TAAAGATCCA GAAGCCACAG AGATTGTGTC TCAATACATG GATGTTTTGA TCACAGAATT CAAAGCACCC ACACCATCTA ATAAGCTGCG TCAACGTCCC TCTGGTGACC AACAAGGCTT ACAACTACCT CAAATTTACT TCAGTGCGGC TGAAAGAAGA CCCAAGTTTG TGATGAAACC GGGGTTATCA GCGACTGAGA AAAATGAAGT AGTAAAAGCA GCTTATAGAC AAATCTTTGA GCGCGATATT ACTCGTGCTT ACAGCTTGTC AATCTCTGAC CTAGAATCCA AAGTTAAGAA CGGCGACATC TCTATGAAGG AGTTTGTCCG TCGTTTAGCA AAATCTCCTC TTTACCAAAA ACAGTTTTAC CAACCTTTTA TTAACAGCCG CGTTATCGAA CTAGCTTTCC GTCACATTTT GGGACGGGGG CCAAGTAGCC GTGAAGAAGT TCAAAAATAT TTCTCAATCA TTTCTAACGG CGGTCTACCA GCTTTAGTAG ATGCTTTGGT TGATTCGGCA GAATATGGGG ACTACTTTGG AGAAGAGACA GTACCTTACC TACGTGGTTT AGGTCAAGAA GCTCAAGAAT GTCGTAACTG GGGACCACAG CAAGACCTGT TTAACTACAG TGCGCCTTTC CGTAAAGTAC CTCAGTTTAT TACCACATTT GCGGCGTACG ATCGCCCACT ACCAGATCAA CACCCATACG GTTCTGGTAA TGACCCATTA GAAATTCAGT TTGGGGCAAT TTTCCCGAAA GAAACCCGCA ACCCCAGCAC CAGTCCCGCA CCTTTTGGTA AGGACACCAG ACGGATCTTG ATTCACCAAG GCCCTGGAAT TAACAACCAA GTTAGTAACC CCAGCGCACG AGGTATAGCT CCTGGTTCTC TTGGGCCTAA AGTGTTCAAG TTAGATCAAT TACCTGGAAC TATCGGTAGA AAAGCAGCTA AGGGTGCAAG TGTCAAGTTC TCCGAAAGCT CAACCCAAGC AGTAATTAAA GCTACTTACT TGCAAGTTTT CGGTCGGGAT GTGTATGAAG GTCAACGGCT GAAAGTCCAA GAAATTAAGC TGGAAAACGG CGAAATTTCT GTAAGAGAGT TTGTCAGAGC TTTGGCTAAA TCGGATCTAT TCCGTAAGCT TTACTGGACT CCTTTCTATG TTTGTAAGGC GATCGAATAT ATCCACCGTC GCTTATTAGG TCGTCCTACC TACGGTCGTC AAGAAAACAA CAAGTACTTC GATATCGCCT CTAAGAAGGG CTTATATGCT GTAGTTGATG CCATTCTGGA CAGCCTAGAG TATACCGAAA CCTTCGGCGA AGATACAGTT CCTTACGAAC GCTATCTAAC TCCGGCTGGT GTAGCACTCA GACAGCTACG TGTTGGTACT ATCCGGGAAG ATGTGGCGAA TGTTGAAAAA CAAGAAACAC CACGTTTTGT AGAACTGGGT ACAGTCAAGG AAAATCGGAC TCAACCAGAT ATCGATTTCC GCATCAACCA AGGTGTTACC AAGCAGCGTG AACAAACCAA GGTGTTCAAG CGGGTAGCGG CTAACAACGA TAAAACTGCG ACCCAAACCT TGATTAGTGC GGCTTATCGG CAAATTTTTG AGCGTGATAT TGCACCATAC ATTGCTCAGA ATGAATTTTC AGGCTGGGAA AGCAAACTGG GTAACGGTGA AATCACTGTA AAAGAATTTA TTGAAGGTTT GGGTTACTCT AACCTCTACC TGAAGGAGTT CTACACACCA TACCCCAACA CCAAAGTAAT CGAGTTGGGA ACCAAGCACT TCCTCGGTCG CGCACCAATT GACCAAGCAG AAATCCGCAA GTATAACCAA ATTTTGGCTA CCCAAGGGAT TCGGGCTTTT ATTAACGCTT TGGTAAATAG CCAGGAGTAC AACGAGGTAT TTGGTGAAGA TACAGTACCT TACAGACGCT TCCCGACCTT ACCTGCGGCG AACTTCCCCA ATACCCAAAA GCTGTATAAC CAACTCACCA AACAAAATAA TGATGTGGTT ATCCCCAGCT TTAAGCCTGT ACAAGCGCGG ATACAGTCTG ATAAGACACC AATTTTAGCG AAGGCGATCG CAGATTTAGC AGCCCAAGCC AAACAAATCG ACAAGAGCAA GCCTCTGTTC ATTGAATTGG GTCGCTCCTA CAACGACGGT CGCGGACAGT CTGTAGAAGT GGGTGTGGGT ACAACTCGTC GTAAACCTGC ACGTATTTAT CGCCTGACCG ATGGTATCGG CCAAGCAGAA AAACAACTGG TAATTAACGC GATTTACCGT CAGGTATTGG ATGTATTTAG CGGACAAGTA CCAGATTATT ACCGCCGCAC AGAACTAGAT AGCAAACTGC GGAATGGGGA AATTTCTGTA CGGGAATTCG TGCGGGAAAT AGCTAGTTCC GAAATCTATC GCAAGCGTTT CTACACACCT TATCCCAACA CCAAGGTAAT TGAATTCTTA TTCCGTCACC TGTTGGGACG TGCGCCAGCA ACTCAAGGCG AAATTCGTCA ATATAACAAG CTGCTAGCTG ATCACGGTTT GCGTGCTGCG GTAGAAGCAA TTGTGGATAG TCCAGAATAC AGCCGCTACT TCGGTGAAGA TGTGGTTCCT TACCCACGCT TCCCATCACT ACCCGCAGGT AACTACCTCG GTAGCGTCCA AGCAGCAGCT GACTTGGTAA AACAATCTTG GTCTAGCTTA TCGCCATCCA CCCTTACAGG TAGACCAGGC GATCGCTAA
|
Protein sequence | MSVKASGGSS VARPQLYQTL AVATITQAEQ QDRFLGRGEL DELASYFASG AKRLEIAQLL TENSEIIVSR AANRIFVGGS PMAFLEKPRE PELAMAAVGG GGDVRESMKL GTVTYVETRG GFLENLRSIF NTSPSGPTPP GFRPINIARY GPSNMAKSLR DLSWFLRYAT YAIVAGDPNI IVVNTRGLRE IIENACSGEA TIVALQEIKA ASLSYFRKDP EATEIVSQYM DVLITEFKAP TPSNKLRQRP SGDQQGLQLP QIYFSAAERR PKFVMKPGLS ATEKNEVVKA AYRQIFERDI TRAYSLSISD LESKVKNGDI SMKEFVRRLA KSPLYQKQFY QPFINSRVIE LAFRHILGRG PSSREEVQKY FSIISNGGLP ALVDALVDSA EYGDYFGEET VPYLRGLGQE AQECRNWGPQ QDLFNYSAPF RKVPQFITTF AAYDRPLPDQ HPYGSGNDPL EIQFGAIFPK ETRNPSTSPA PFGKDTRRIL IHQGPGINNQ VSNPSARGIA PGSLGPKVFK LDQLPGTIGR KAAKGASVKF SESSTQAVIK ATYLQVFGRD VYEGQRLKVQ EIKLENGEIS VREFVRALAK SDLFRKLYWT PFYVCKAIEY IHRRLLGRPT YGRQENNKYF DIASKKGLYA VVDAILDSLE YTETFGEDTV PYERYLTPAG VALRQLRVGT IREDVANVEK QETPRFVELG TVKENRTQPD IDFRINQGVT KQREQTKVFK RVAANNDKTA TQTLISAAYR QIFERDIAPY IAQNEFSGWE SKLGNGEITV KEFIEGLGYS NLYLKEFYTP YPNTKVIELG TKHFLGRAPI DQAEIRKYNQ ILATQGIRAF INALVNSQEY NEVFGEDTVP YRRFPTLPAA NFPNTQKLYN QLTKQNNDVV IPSFKPVQAR IQSDKTPILA KAIADLAAQA KQIDKSKPLF IELGRSYNDG RGQSVEVGVG TTRRKPARIY RLTDGIGQAE KQLVINAIYR QVLDVFSGQV PDYYRRTELD SKLRNGEISV REFVREIASS EIYRKRFYTP YPNTKVIEFL FRHLLGRAPA TQGEIRQYNK LLADHGLRAA VEAIVDSPEY SRYFGEDVVP YPRFPSLPAG NYLGSVQAAA DLVKQSWSSL SPSTLTGRPG DR
|
| |