Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_3784 |
Symbol | |
ID | 3678812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 4716879 |
End bp | 4719065 |
Gene Length | 2187 bp |
Protein Length | 728 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637719135 |
Product | DNA topoisomerase I |
Protein accession | YP_324284 |
Protein GI | 75909988 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAAAC GCTTGCTAGT GGTTGAATCT CCGGGTAAAG TGAAAAAGCT CAGTCAGATT TTGGGTTCTG AGTGGATTGT CCGTGCTTCT TGTGGACACA TTCGGGAACT GAGTGATGAG GGTGAGGATG CTTTGGGGTT CAGCATGGAT GGTGGTAGTG TGCAGTGTCA CTACGTACCG CGTGACCAAC GTTCAAAAGA AACTATCCAG CAGTTAAAGA GTGCGGCGAA GCAGGTGAGT GAAGTTGTGT TAGCGACTGA CCCGGATAGG GAAGGCGAAA CCATTGCTTG GCATCTCAAG GAAGTACTGG GATTGAGAGA ACCGAAACGA GTTGTCTATA CAGAGATTAC AGCCTCGGCG GTACAGAGTG CGATCGCTCA TCCACGCAAA CTGGACTTAA ATTTAGTCGG GGCGGGATTG TGTCGAGATT GTTTAGATAA GTTGGTCGGG TATAAGGGTA GTCCTCTGGT TTGGGCGTTA AATAATGGGG CAAAGAGTGT GGGGAGAGTG CAGAGTTCTA CACTCCACCT GATTTGTCAA CGGGAGAGAG AAATTCAAAG TTTTGTACCT CAAGATTATT GGAGTGTTTG GGTTGATTAT GTTGAAGGTT TCCGCGCTTT CTATAAAGGT GCAGTCAACA CTCGCTCAGA CTCTCCAGAA ACAGAAGTAG AAATTCATGA TGATGCGGCG GGAAATAGCA CAACTGCGCC TGAGTCTACC CGCGTGCTGG CGGAAGCAGA AGCTAATCGT TTAGTAGAAG AAGCACGCCG TCATCCCCAC CAAATTGTGC AGTATGAGGG AAAAATTGCC AACCGTCAAC CACCCCCACC TTTTATTACT TCCACCTTGC AACAAGCGGC TGGTTCTAAG TTGAAGTTCG CGCCAGAGAA GACGATGCAG CTAGCCCAAA AGTTGTATGA GGCTGGGTTG ATTACATATA TGCGGACAGA CTCGGTAATG TTGAGTCCAG AGTTTTGTGA AAGTGCGCGT CAGTGGTTGG AGCAAAATGA CCCCCAGAAT GTACCGAAGC AGGTTGCAAG ACATCGCAGT AGTAAGACAG CCCAACAAGG ACACGAAGCG ATTCGCCCAA CTGATGTGTT TCGTCCTTCC GCCCAGTTGC GAATAGAACT ATCCACAGAT GAGTTTAACT TGTATGTGAT GATTTGGAAA CGGGCGATCG CTTCCCAATG TCGTCCAGCC CAACTACGAA AAACGGTTGT AATTACCCAG TCTGGTCAAA TTCTCTGGCA AGCAAGAGGA CAAGTAGTAG AATTTCTCGG TTACACCCGT TACTGGCCTA ACCTCAGCAA GGATACCCTC TTACCCACCT TGCAACAAGG GCAAATGGTC ACTTTAGAAA ATGCTGGTCA CGAAAAGAAA CAAACCCAAC CCCCACCACG CTACAGCGAA CCCAAGTTAG TGCAACTGAT GGAACGTAAA GGTATTGGTC GTCCCAGTAC CTATTCTCCC ACCATTGCCA CCCTGAAGAA ACGGGGTTAT GTGGAATTGA CCAAAGACAA CTTACACCCA ACAAATCTAG GGTTAGAAGT TGATACTTTC TTACAAAAAG CCCTACCGGA TTTACTAGAA GCGGAATTTA CTGCCAAAAT GGAGAATGCC CTAGATGCGA TCGCTGAAGG TAAACAACCT TGGCAAATAT ACTTAACCAC TTGGAATCAG AATTATTTTG CCCCTGCTTT AGCCAAAGCC AAAACCATAG CTGTCGATTC AGGTAATCCG ACAAAAACCT TTCCCCCGCG TCAATACGAT ACTAGCCGGA CTCGTTGCCC TGATTGTAAT AACTTTCTCA GCAAAATTCC CAGCAGTAAA CTGAAGAAAA AATACTTCCT CAAATGCACT AGCGGTTGTG AAAACACAGT CTTATTCTGG AGTGAATTTA GTAAGACTTG GCAAGCGCCC CGAACAAAAG ATGACAAGAT GGCAGAAAAT GGGCAAAAGA ATCATCTCTC ACCTGCAACG AAACTTCCTA CTCAGTTGAC AGCATACCCC TGTCCAGTAT GTAAAAGACC TTTGGAGGAG TATACTTACA CCAAAGATGG ACAGAAAAAA ACTATGCTGC GTTGCTCTGC ATCCTCATCT CGCACTGATA AAAAACATCA AGATGTGGCT TATTTCCACA CTGCTAAAGG TTGGTGGAGT CCTAAGTTTG GGGAAATAAA TAAATGA
|
Protein sequence | MPKRLLVVES PGKVKKLSQI LGSEWIVRAS CGHIRELSDE GEDALGFSMD GGSVQCHYVP RDQRSKETIQ QLKSAAKQVS EVVLATDPDR EGETIAWHLK EVLGLREPKR VVYTEITASA VQSAIAHPRK LDLNLVGAGL CRDCLDKLVG YKGSPLVWAL NNGAKSVGRV QSSTLHLICQ REREIQSFVP QDYWSVWVDY VEGFRAFYKG AVNTRSDSPE TEVEIHDDAA GNSTTAPEST RVLAEAEANR LVEEARRHPH QIVQYEGKIA NRQPPPPFIT STLQQAAGSK LKFAPEKTMQ LAQKLYEAGL ITYMRTDSVM LSPEFCESAR QWLEQNDPQN VPKQVARHRS SKTAQQGHEA IRPTDVFRPS AQLRIELSTD EFNLYVMIWK RAIASQCRPA QLRKTVVITQ SGQILWQARG QVVEFLGYTR YWPNLSKDTL LPTLQQGQMV TLENAGHEKK QTQPPPRYSE PKLVQLMERK GIGRPSTYSP TIATLKKRGY VELTKDNLHP TNLGLEVDTF LQKALPDLLE AEFTAKMENA LDAIAEGKQP WQIYLTTWNQ NYFAPALAKA KTIAVDSGNP TKTFPPRQYD TSRTRCPDCN NFLSKIPSSK LKKKYFLKCT SGCENTVLFW SEFSKTWQAP RTKDDKMAEN GQKNHLSPAT KLPTQLTAYP CPVCKRPLEE YTYTKDGQKK TMLRCSASSS RTDKKHQDVA YFHTAKGWWS PKFGEINK
|
| |