Gene Ava_4521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4521 
Symbol 
ID3680183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5666172 
End bp5668901 
Gene Length2730 bp 
Protein Length909 aa 
Translation table11 
GC content41% 
IMG OID637719877 
Producttransposase 
Protein accessionYP_325014 
Protein GI75910718 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTATG GCTGCCAACA AATACTATTA AACCCTGATA ATGATCTTCA TGCAATCTTA 
GAGTTTTTGT GTGGTGAAGC CACAAAACTC TCTAATTGCG GGACTTACTA TGCGCGTCAA
CTTTACTTTA AGACGGGTAA AATTCCTAGC AAATTTGATT TAAATAATGA GCTATCTAAC
AATATTCATT TTGCGGCAAT GTATTCTCAA GCAGCACAAC AATGTTTGAT GGGTGTAGCG
GAGTCATTCA AATCATTCAT GGGACTGCTA AAAGGGATAA AAAATAGTAC TGTAACGCAA
AAACCAAAAC TTCCAGGATA TCGAGATGGC GGGTTAAGTT TGGTCACATA TCCGGCTCAA
GCTATAAAAC TGAAACCACA AGGTTTACGT TTTCCGTTGG GTAGCAAGGT TAAAGCATGG
TTTGGAATAG CCGAATTTTA CTTATCTATG CCCTCGAATC TTGACCATAA GCAAATTAGA
GAGTATCGGA TTCTGCCTAG AAATGGTAAA TTCTATCTTG AACTTGTTTA CAAACTTCCG
ACCATTAAAT CTGATGTGGA TTTCGGTAAA TGTCTTGGTG TAGATCCAGG ACTCAATAAC
TGGTTAACCT GTGTTAGCAA TATTGGCACA TCTTTAATTG TAGATGGACT GCACCTCAAG
AGCTTGAATC AATGGTATAA CAAACGAGTC TCAGTTCTTA ACGAAAACCA GCCGCAAGGC
TTCTGGTCTA AGCAGTTAGC TGCTATCACT GAAAAGCGAA ATCGACAAGT TCGAGATGCA
GTTAATAAAA CTGCTCGAAT AATATTAAAT CACTGCCTTG AAAATCGTAT TGGAACAATC
GTTTTTGGGT GGAATGAAGG TCAGCGTCAA AACATTAATC TTGGCAATAA GACTAATCAA
ACATTTGTAC AAATTCCTAC TGCTAGGTTG AAAGATAGAA TCGCTCAACT CTGCGAACAG
TACGGACTGA GGTTTGAGGA AACTGAAGAA AGCTACACAA GTAAGGCAAG TTTTTTAGAC
TCTGATTTGC TACCTACATT CGGCGAAAAA CCTGAAGGGT GGCAACCATC TGGGAAAAGA
GTAAAACGCG GTCTGTACGC TTGTGCAAGT GGTTTAAAAA TAAATGCCGA TGCTAACGGT
GCAGCAAAGA TAGGATCAAT ACGAGAAACT CTGCTGGAAC TCAAACTCAT GTCCCTAACT
GAAGAAATTC TCTCCCAACT ACCAGGAGAC GTTCTAGGAA ATTTACGCCG TGCTGATGAT
GTCCTCAAAT CTATTCGAGA AAACACTGCA CCAACACCCT CAGTAGTTAA AGAAAGCCCA
GCCACCTTAG AGACTGTAAA CTGGGATGTA ATTATTTGTG GTGGTACATT AGGCATTTTA
ATTGGTTGTG CCTTAGCTGT ACGGGGATTG CGGGTGGCGC TACTTGAGAG AGGTACTTTG
CAGGGACGGG AACAAGAGTG GAATATCTCC CGTAAAGAGT TAGAAGTCTT TGTGGAGTTG
AATCTGCTGA CACCAGAGGA GTTAAAGAAA GCGATCGCCA CTGAATATAA TCCAGCCAGA
GTCCAATTTA AAGATGGTGC AGAAGTTTGG GTAAAGGATG TCTTAAATAT TGGCGTAGAT
CCAGTTTATT TACTAGCTAC CTTAAAACAG AGATTTTTAG ATGCTGGTGG TCAGTTATTT
GAACATACAC CTTTTAGTGA AGTCGTCATT CATCCAGATG GGGTAATGGT CAATCAGCAA
TTTACAGCCA AGTTATTGAT AGATGCAATG GGACACCTTT CTCCCATTAG CAAACAAGCA
CGCCAAGGCA AAAAACCAGA TGCACTTTGT CTGGTGGTGG GTAGTTGCGC TCAAGGTTTT
AGTGAAAATT CTGCCGGTGA TTTAATTTTA TCTTTTACAT CTTTGCAAAA CCAATGTCAG
TATTTTTGGG AAGCCTTTCC TGCTAGAGAT GGTAGAACAA CATATTTATT CACCTACATG
GATGCTCATC CCCAACGCTT GAGCTTAGAA GACTTGTTTG GAGAATACTT GGGTCTCTTA
CCAGAATACC AAGGTGTAGA ATTACAGCAG TTGAAATTTC AAAGAGCCTT GTTTGGGTTT
TTTCCGAGCG ATCGCCAAAG TCCATTAAAA ACACCCTGGA ACCGCATCCT ACCAGTAGGA
GACAGTAGCG GTAATCAATC ACCCCTAAGT TTTGGCGGTT TTGGCGCAAT GCTACGTCAC
CTGCAACGTT TAACATTGGG TACCCAGGAA GCCTTGCAAA CTGAGCAATT ATCAGCCACA
GCACTAGCAT TACTGCAACC ATATCAACCA AGCCTCAGTG TTACTTGGTT ATTTCAAAAA
GCCATGAGCG TTGGTGTTAA TCAAAATATT GCTCCAGAGC AAATTAACCA ACTACTATCA
ACGGTGTTTC AAGAAATGGC ACAACTCGGA ACACCCGTAT TAAAGCCCTT TTTACAAGAT
ATAGTCCAAT TTTCAGCACT GACACAAACA CTAGCAAAGA CTGGCCTATC TCATCCAGTA
TTAGTTGCTA AAATAATTCC CCAAGTAGGT TTAGTCAATC TATTAGATTG GCTAGTGCAT
TACACCAACT TAGGCATTTA TACTGCGCTG TTTGCACTAA GTCCAACCCT AGAAATATGG
ATTAAGAATT TTCCACCTAC GCAACAATAC TATTGGCATC GTTTAATGGA TGCTTGGAAA
TTCGGTTCTG GCGGTGATTA TAACGCTTAA
 
Protein sequence
MPYGCQQILL NPDNDLHAIL EFLCGEATKL SNCGTYYARQ LYFKTGKIPS KFDLNNELSN 
NIHFAAMYSQ AAQQCLMGVA ESFKSFMGLL KGIKNSTVTQ KPKLPGYRDG GLSLVTYPAQ
AIKLKPQGLR FPLGSKVKAW FGIAEFYLSM PSNLDHKQIR EYRILPRNGK FYLELVYKLP
TIKSDVDFGK CLGVDPGLNN WLTCVSNIGT SLIVDGLHLK SLNQWYNKRV SVLNENQPQG
FWSKQLAAIT EKRNRQVRDA VNKTARIILN HCLENRIGTI VFGWNEGQRQ NINLGNKTNQ
TFVQIPTARL KDRIAQLCEQ YGLRFEETEE SYTSKASFLD SDLLPTFGEK PEGWQPSGKR
VKRGLYACAS GLKINADANG AAKIGSIRET LLELKLMSLT EEILSQLPGD VLGNLRRADD
VLKSIRENTA PTPSVVKESP ATLETVNWDV IICGGTLGIL IGCALAVRGL RVALLERGTL
QGREQEWNIS RKELEVFVEL NLLTPEELKK AIATEYNPAR VQFKDGAEVW VKDVLNIGVD
PVYLLATLKQ RFLDAGGQLF EHTPFSEVVI HPDGVMVNQQ FTAKLLIDAM GHLSPISKQA
RQGKKPDALC LVVGSCAQGF SENSAGDLIL SFTSLQNQCQ YFWEAFPARD GRTTYLFTYM
DAHPQRLSLE DLFGEYLGLL PEYQGVELQQ LKFQRALFGF FPSDRQSPLK TPWNRILPVG
DSSGNQSPLS FGGFGAMLRH LQRLTLGTQE ALQTEQLSAT ALALLQPYQP SLSVTWLFQK
AMSVGVNQNI APEQINQLLS TVFQEMAQLG TPVLKPFLQD IVQFSALTQT LAKTGLSHPV
LVAKIIPQVG LVNLLDWLVH YTNLGIYTAL FALSPTLEIW IKNFPPTQQY YWHRLMDAWK
FGSGGDYNA