Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4521 |
Symbol | |
ID | 3680183 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 5666172 |
End bp | 5668901 |
Gene Length | 2730 bp |
Protein Length | 909 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637719877 |
Product | transposase |
Protein accession | YP_325014 |
Protein GI | 75910718 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGTATG GCTGCCAACA AATACTATTA AACCCTGATA ATGATCTTCA TGCAATCTTA GAGTTTTTGT GTGGTGAAGC CACAAAACTC TCTAATTGCG GGACTTACTA TGCGCGTCAA CTTTACTTTA AGACGGGTAA AATTCCTAGC AAATTTGATT TAAATAATGA GCTATCTAAC AATATTCATT TTGCGGCAAT GTATTCTCAA GCAGCACAAC AATGTTTGAT GGGTGTAGCG GAGTCATTCA AATCATTCAT GGGACTGCTA AAAGGGATAA AAAATAGTAC TGTAACGCAA AAACCAAAAC TTCCAGGATA TCGAGATGGC GGGTTAAGTT TGGTCACATA TCCGGCTCAA GCTATAAAAC TGAAACCACA AGGTTTACGT TTTCCGTTGG GTAGCAAGGT TAAAGCATGG TTTGGAATAG CCGAATTTTA CTTATCTATG CCCTCGAATC TTGACCATAA GCAAATTAGA GAGTATCGGA TTCTGCCTAG AAATGGTAAA TTCTATCTTG AACTTGTTTA CAAACTTCCG ACCATTAAAT CTGATGTGGA TTTCGGTAAA TGTCTTGGTG TAGATCCAGG ACTCAATAAC TGGTTAACCT GTGTTAGCAA TATTGGCACA TCTTTAATTG TAGATGGACT GCACCTCAAG AGCTTGAATC AATGGTATAA CAAACGAGTC TCAGTTCTTA ACGAAAACCA GCCGCAAGGC TTCTGGTCTA AGCAGTTAGC TGCTATCACT GAAAAGCGAA ATCGACAAGT TCGAGATGCA GTTAATAAAA CTGCTCGAAT AATATTAAAT CACTGCCTTG AAAATCGTAT TGGAACAATC GTTTTTGGGT GGAATGAAGG TCAGCGTCAA AACATTAATC TTGGCAATAA GACTAATCAA ACATTTGTAC AAATTCCTAC TGCTAGGTTG AAAGATAGAA TCGCTCAACT CTGCGAACAG TACGGACTGA GGTTTGAGGA AACTGAAGAA AGCTACACAA GTAAGGCAAG TTTTTTAGAC TCTGATTTGC TACCTACATT CGGCGAAAAA CCTGAAGGGT GGCAACCATC TGGGAAAAGA GTAAAACGCG GTCTGTACGC TTGTGCAAGT GGTTTAAAAA TAAATGCCGA TGCTAACGGT GCAGCAAAGA TAGGATCAAT ACGAGAAACT CTGCTGGAAC TCAAACTCAT GTCCCTAACT GAAGAAATTC TCTCCCAACT ACCAGGAGAC GTTCTAGGAA ATTTACGCCG TGCTGATGAT GTCCTCAAAT CTATTCGAGA AAACACTGCA CCAACACCCT CAGTAGTTAA AGAAAGCCCA GCCACCTTAG AGACTGTAAA CTGGGATGTA ATTATTTGTG GTGGTACATT AGGCATTTTA ATTGGTTGTG CCTTAGCTGT ACGGGGATTG CGGGTGGCGC TACTTGAGAG AGGTACTTTG CAGGGACGGG AACAAGAGTG GAATATCTCC CGTAAAGAGT TAGAAGTCTT TGTGGAGTTG AATCTGCTGA CACCAGAGGA GTTAAAGAAA GCGATCGCCA CTGAATATAA TCCAGCCAGA GTCCAATTTA AAGATGGTGC AGAAGTTTGG GTAAAGGATG TCTTAAATAT TGGCGTAGAT CCAGTTTATT TACTAGCTAC CTTAAAACAG AGATTTTTAG ATGCTGGTGG TCAGTTATTT GAACATACAC CTTTTAGTGA AGTCGTCATT CATCCAGATG GGGTAATGGT CAATCAGCAA TTTACAGCCA AGTTATTGAT AGATGCAATG GGACACCTTT CTCCCATTAG CAAACAAGCA CGCCAAGGCA AAAAACCAGA TGCACTTTGT CTGGTGGTGG GTAGTTGCGC TCAAGGTTTT AGTGAAAATT CTGCCGGTGA TTTAATTTTA TCTTTTACAT CTTTGCAAAA CCAATGTCAG TATTTTTGGG AAGCCTTTCC TGCTAGAGAT GGTAGAACAA CATATTTATT CACCTACATG GATGCTCATC CCCAACGCTT GAGCTTAGAA GACTTGTTTG GAGAATACTT GGGTCTCTTA CCAGAATACC AAGGTGTAGA ATTACAGCAG TTGAAATTTC AAAGAGCCTT GTTTGGGTTT TTTCCGAGCG ATCGCCAAAG TCCATTAAAA ACACCCTGGA ACCGCATCCT ACCAGTAGGA GACAGTAGCG GTAATCAATC ACCCCTAAGT TTTGGCGGTT TTGGCGCAAT GCTACGTCAC CTGCAACGTT TAACATTGGG TACCCAGGAA GCCTTGCAAA CTGAGCAATT ATCAGCCACA GCACTAGCAT TACTGCAACC ATATCAACCA AGCCTCAGTG TTACTTGGTT ATTTCAAAAA GCCATGAGCG TTGGTGTTAA TCAAAATATT GCTCCAGAGC AAATTAACCA ACTACTATCA ACGGTGTTTC AAGAAATGGC ACAACTCGGA ACACCCGTAT TAAAGCCCTT TTTACAAGAT ATAGTCCAAT TTTCAGCACT GACACAAACA CTAGCAAAGA CTGGCCTATC TCATCCAGTA TTAGTTGCTA AAATAATTCC CCAAGTAGGT TTAGTCAATC TATTAGATTG GCTAGTGCAT TACACCAACT TAGGCATTTA TACTGCGCTG TTTGCACTAA GTCCAACCCT AGAAATATGG ATTAAGAATT TTCCACCTAC GCAACAATAC TATTGGCATC GTTTAATGGA TGCTTGGAAA TTCGGTTCTG GCGGTGATTA TAACGCTTAA
|
Protein sequence | MPYGCQQILL NPDNDLHAIL EFLCGEATKL SNCGTYYARQ LYFKTGKIPS KFDLNNELSN NIHFAAMYSQ AAQQCLMGVA ESFKSFMGLL KGIKNSTVTQ KPKLPGYRDG GLSLVTYPAQ AIKLKPQGLR FPLGSKVKAW FGIAEFYLSM PSNLDHKQIR EYRILPRNGK FYLELVYKLP TIKSDVDFGK CLGVDPGLNN WLTCVSNIGT SLIVDGLHLK SLNQWYNKRV SVLNENQPQG FWSKQLAAIT EKRNRQVRDA VNKTARIILN HCLENRIGTI VFGWNEGQRQ NINLGNKTNQ TFVQIPTARL KDRIAQLCEQ YGLRFEETEE SYTSKASFLD SDLLPTFGEK PEGWQPSGKR VKRGLYACAS GLKINADANG AAKIGSIRET LLELKLMSLT EEILSQLPGD VLGNLRRADD VLKSIRENTA PTPSVVKESP ATLETVNWDV IICGGTLGIL IGCALAVRGL RVALLERGTL QGREQEWNIS RKELEVFVEL NLLTPEELKK AIATEYNPAR VQFKDGAEVW VKDVLNIGVD PVYLLATLKQ RFLDAGGQLF EHTPFSEVVI HPDGVMVNQQ FTAKLLIDAM GHLSPISKQA RQGKKPDALC LVVGSCAQGF SENSAGDLIL SFTSLQNQCQ YFWEAFPARD GRTTYLFTYM DAHPQRLSLE DLFGEYLGLL PEYQGVELQQ LKFQRALFGF FPSDRQSPLK TPWNRILPVG DSSGNQSPLS FGGFGAMLRH LQRLTLGTQE ALQTEQLSAT ALALLQPYQP SLSVTWLFQK AMSVGVNQNI APEQINQLLS TVFQEMAQLG TPVLKPFLQD IVQFSALTQT LAKTGLSHPV LVAKIIPQVG LVNLLDWLVH YTNLGIYTAL FALSPTLEIW IKNFPPTQQY YWHRLMDAWK FGSGGDYNA
|
| |