Gene Ava_3784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3784 
Symbol 
ID3678812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4716879 
End bp4719065 
Gene Length2187 bp 
Protein Length728 aa 
Translation table11 
GC content46% 
IMG OID637719135 
ProductDNA topoisomerase I 
Protein accessionYP_324284 
Protein GI75909988 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAAC GCTTGCTAGT GGTTGAATCT CCGGGTAAAG TGAAAAAGCT CAGTCAGATT 
TTGGGTTCTG AGTGGATTGT CCGTGCTTCT TGTGGACACA TTCGGGAACT GAGTGATGAG
GGTGAGGATG CTTTGGGGTT CAGCATGGAT GGTGGTAGTG TGCAGTGTCA CTACGTACCG
CGTGACCAAC GTTCAAAAGA AACTATCCAG CAGTTAAAGA GTGCGGCGAA GCAGGTGAGT
GAAGTTGTGT TAGCGACTGA CCCGGATAGG GAAGGCGAAA CCATTGCTTG GCATCTCAAG
GAAGTACTGG GATTGAGAGA ACCGAAACGA GTTGTCTATA CAGAGATTAC AGCCTCGGCG
GTACAGAGTG CGATCGCTCA TCCACGCAAA CTGGACTTAA ATTTAGTCGG GGCGGGATTG
TGTCGAGATT GTTTAGATAA GTTGGTCGGG TATAAGGGTA GTCCTCTGGT TTGGGCGTTA
AATAATGGGG CAAAGAGTGT GGGGAGAGTG CAGAGTTCTA CACTCCACCT GATTTGTCAA
CGGGAGAGAG AAATTCAAAG TTTTGTACCT CAAGATTATT GGAGTGTTTG GGTTGATTAT
GTTGAAGGTT TCCGCGCTTT CTATAAAGGT GCAGTCAACA CTCGCTCAGA CTCTCCAGAA
ACAGAAGTAG AAATTCATGA TGATGCGGCG GGAAATAGCA CAACTGCGCC TGAGTCTACC
CGCGTGCTGG CGGAAGCAGA AGCTAATCGT TTAGTAGAAG AAGCACGCCG TCATCCCCAC
CAAATTGTGC AGTATGAGGG AAAAATTGCC AACCGTCAAC CACCCCCACC TTTTATTACT
TCCACCTTGC AACAAGCGGC TGGTTCTAAG TTGAAGTTCG CGCCAGAGAA GACGATGCAG
CTAGCCCAAA AGTTGTATGA GGCTGGGTTG ATTACATATA TGCGGACAGA CTCGGTAATG
TTGAGTCCAG AGTTTTGTGA AAGTGCGCGT CAGTGGTTGG AGCAAAATGA CCCCCAGAAT
GTACCGAAGC AGGTTGCAAG ACATCGCAGT AGTAAGACAG CCCAACAAGG ACACGAAGCG
ATTCGCCCAA CTGATGTGTT TCGTCCTTCC GCCCAGTTGC GAATAGAACT ATCCACAGAT
GAGTTTAACT TGTATGTGAT GATTTGGAAA CGGGCGATCG CTTCCCAATG TCGTCCAGCC
CAACTACGAA AAACGGTTGT AATTACCCAG TCTGGTCAAA TTCTCTGGCA AGCAAGAGGA
CAAGTAGTAG AATTTCTCGG TTACACCCGT TACTGGCCTA ACCTCAGCAA GGATACCCTC
TTACCCACCT TGCAACAAGG GCAAATGGTC ACTTTAGAAA ATGCTGGTCA CGAAAAGAAA
CAAACCCAAC CCCCACCACG CTACAGCGAA CCCAAGTTAG TGCAACTGAT GGAACGTAAA
GGTATTGGTC GTCCCAGTAC CTATTCTCCC ACCATTGCCA CCCTGAAGAA ACGGGGTTAT
GTGGAATTGA CCAAAGACAA CTTACACCCA ACAAATCTAG GGTTAGAAGT TGATACTTTC
TTACAAAAAG CCCTACCGGA TTTACTAGAA GCGGAATTTA CTGCCAAAAT GGAGAATGCC
CTAGATGCGA TCGCTGAAGG TAAACAACCT TGGCAAATAT ACTTAACCAC TTGGAATCAG
AATTATTTTG CCCCTGCTTT AGCCAAAGCC AAAACCATAG CTGTCGATTC AGGTAATCCG
ACAAAAACCT TTCCCCCGCG TCAATACGAT ACTAGCCGGA CTCGTTGCCC TGATTGTAAT
AACTTTCTCA GCAAAATTCC CAGCAGTAAA CTGAAGAAAA AATACTTCCT CAAATGCACT
AGCGGTTGTG AAAACACAGT CTTATTCTGG AGTGAATTTA GTAAGACTTG GCAAGCGCCC
CGAACAAAAG ATGACAAGAT GGCAGAAAAT GGGCAAAAGA ATCATCTCTC ACCTGCAACG
AAACTTCCTA CTCAGTTGAC AGCATACCCC TGTCCAGTAT GTAAAAGACC TTTGGAGGAG
TATACTTACA CCAAAGATGG ACAGAAAAAA ACTATGCTGC GTTGCTCTGC ATCCTCATCT
CGCACTGATA AAAAACATCA AGATGTGGCT TATTTCCACA CTGCTAAAGG TTGGTGGAGT
CCTAAGTTTG GGGAAATAAA TAAATGA
 
Protein sequence
MPKRLLVVES PGKVKKLSQI LGSEWIVRAS CGHIRELSDE GEDALGFSMD GGSVQCHYVP 
RDQRSKETIQ QLKSAAKQVS EVVLATDPDR EGETIAWHLK EVLGLREPKR VVYTEITASA
VQSAIAHPRK LDLNLVGAGL CRDCLDKLVG YKGSPLVWAL NNGAKSVGRV QSSTLHLICQ
REREIQSFVP QDYWSVWVDY VEGFRAFYKG AVNTRSDSPE TEVEIHDDAA GNSTTAPEST
RVLAEAEANR LVEEARRHPH QIVQYEGKIA NRQPPPPFIT STLQQAAGSK LKFAPEKTMQ
LAQKLYEAGL ITYMRTDSVM LSPEFCESAR QWLEQNDPQN VPKQVARHRS SKTAQQGHEA
IRPTDVFRPS AQLRIELSTD EFNLYVMIWK RAIASQCRPA QLRKTVVITQ SGQILWQARG
QVVEFLGYTR YWPNLSKDTL LPTLQQGQMV TLENAGHEKK QTQPPPRYSE PKLVQLMERK
GIGRPSTYSP TIATLKKRGY VELTKDNLHP TNLGLEVDTF LQKALPDLLE AEFTAKMENA
LDAIAEGKQP WQIYLTTWNQ NYFAPALAKA KTIAVDSGNP TKTFPPRQYD TSRTRCPDCN
NFLSKIPSSK LKKKYFLKCT SGCENTVLFW SEFSKTWQAP RTKDDKMAEN GQKNHLSPAT
KLPTQLTAYP CPVCKRPLEE YTYTKDGQKK TMLRCSASSS RTDKKHQDVA YFHTAKGWWS
PKFGEINK