Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_14470 |
Symbol | topA |
ID | 7760383 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 1427388 |
End bp | 1429994 |
Gene Length | 2607 bp |
Protein Length | 868 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643804345 |
Product | DNA topoisomerase I |
Protein accession | YP_002798638 |
Protein GI | 226943565 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTAAAT CGCTGGTCAT CGTGGAATCC CCGGCCAAGG CCAAGACCAT CAACAAGTAT CTGGGCAGCC AGTACGTGGT GAAGTCGAGC ATCGGCCATA TCCGTGACCT GCCCACCAGC GGCTCGGCCA GTACCGCCAA GGAGCCGGCC AAGCGCGGCA AGGGCGTGGC CGAAGGGCCG GCGTTGTCGC CCAGGGACAA GGCCAAGCGT CAGTTGTTCG CGCGCATGGG GGTCGATCCC GAGCACGGCT GGCAGGCCCA TTACGAAATC CTGCCGGGCA AGGAAAAGGT GGTCGACGAG TTGCGCCGCC TGGCCCGGGA AGCCGACACC ATCTATCTCG CCACCGACCT GGACCGCGAA GGGGAGGCCA TCGCCTGGCA CCTGCGCGAA TCCATCGGCG GCGACGAGGA GCGCTACAAG CGCGTGGTGT TCAACGAAAT CACCAAGAAG GCGATCCAGG AGGCTTTCTC CCAGCCGGGC GACCTGGATA TCAACCGGGT CAACGCGCAG CAGGCGCGGC GCTTCCTCGA CCGGGTGGTC GGCTACATGG TTTCGCCGCT GCTCTGGCAG AAGATCGCTC GCGGCCTGTC CGCCGGCCGC GTGCAGTCGG TGGCGGTGAA GCTGATCGTC GAGCGCGAGC GGGAGATCCG CGCCTTCATC CCGGAGGAGT TCTGGGAGGT GCATGCCGAC CTGGGCACCG CCCGCGGCGA CAAGGTGCGC TTCGAGGTGG CCAGGGAGCA GGGCGAGGTC TTCCGCCCGC TCAACGAGGC CCAGGCCATG GCCGCGCTGG AGAAGCTCAA GGCCTCCAGC TACCAGGTGC TCAAGCGCGA GGACAAGCCG ACCCGCAGCA AGCCCTCGGC GCCCTTCATC ACCTCGACCT TGCAGCAGGC GGCGAGCAAC CGCCTCGGCT TCTCGGTGAA GAAGACCATG ATGATGGCTC AGCGCCTGTA CGAGGCCGGT TACATCACCT ACATGCGGAC CGACTCGACC AACCTCTCCG CCGATGCCCT GGAGATGGCG CGCGGCTTCA TCGACAGCGA GTTCGGCGGG AAGTACCTGC CGGCCAAGCC CAATGTCTAC ACCAGCAAGG AAGGCGCCCA GGAGGCTCAC GAGGCGATCC GTCCCTCCGA CGTCAACCTG CGGCCGAACC AGTTGGCCGG CATGGAGCGC GACGCCGAGC GCCTCTACGA GCTGATCTGG CGCCAGTTCG TCGCCTGCCA GATGCCGCCG GCCGAATACC TGTCGACCAA CGTCAGCGTC CAGGCCGGCG ACTTCGAGCT GCGCGCCAAG GGCCGCATCC TCAAGTTCGA CGGCTATACC CGCGTGTTGC CGCAATTGGC CAAGCCCGGC GAGGACGACG TGCTGCCGGA GATGAGCGAG GGCGAACTGC TCGATCTGCT CAAGCTCGAT CCCAGCCAGC ATTTCACCAA GCCGCCAGCG CGCTACAGCG AGGCCAGCCT GGTCAAGGAG ATGGAAAAGC GCGGGATCGG CCGGCCGTCC ACCTACGCGG CGATCATCTC GACCATCCAG GAGCGCGGCT ACGTGACCCT GCAGAATCGC CGCTTCCATT CGGAGAAGAT GGGCGAGATC GTCACCGAGC GGCTCGGCGA GAGCTTCGCC AACCTGATGG ACTACGGCTT CACCGCTAGC ATGGAGGAGC ATCTGGACGA CGTCGCCCAG GGCGAGCGCG ACTGGAAGAA CCTGCTCGAC GAGTTCTACG GCGATTTCCG CAGGAAGCTG GAGGCGGCCG AATCCAGCGA GGCCGGCATG CGCGCCAACC AGCCGACCCT GACCGACATT CCCTGCCGCG AATGCGGCCG GCCGATGATG ATCCGCACCG CCTCGACCGG CGTGTTCCTC GGCTGCTCGG GTTACAACCT GCCGCCCAAG GAACGCTGCA AAGCGACCGT CAACCTGATC CCGGGCGACG AGATCGCCGC GGACGACGAG GGCGAATCCG AATCCCTGCT GCTGCGCCAC AAGCGTCGCT GCCCGAAGTG CGGCACGGCG ATGGACGCCT ATCTGCTCGA CGAGCGGCAC AAGCTGCACA TCTGCGGCAA CAATCCGGAC TGCCCCGGCT ACGAGATCGA GGAGGGCCAG TACCGCATCA AGGGCTACGA GGGGCCGACC CTGGAGTGCG ACAAGTGCGG TAGCGAGATG CAGCTCAAGA CCGGCCGCTT CGGCAAGTTC TTCGGCTGTA CCAACGCCGC CTGCAAGAAC ACCCGCAAGC TGCTGAAGAA CGGCGAGCCG GCGCCGCCGA AGATGGATGC GGTGAAAATG CCGGAGCTGC GCTGCGAGAA GGTCGACGAT GTCTACGTGC TGCGCGACGG CGCTTCCGGC CTGTTCCTCG CCGCCAGCCA GTTCCCGAAG AACCGCGAGA CCCGTGCGCC GCTGGTCCTG GAACTGTTGC CGCATCGGGA CGAAATCGAC CCGAAGTACC ACTTCCTGCT GGAAGCCCCT AGCCACGACC CGGAAGGGCG CCCAGCGGTG ATCCGCTTCA GCCGCAAGAC CAAGGAGCAA TACGTGCAGA GCGAGGTCGA GGGCAAGCCC AGCGGCTGGC GCGCCTTCCA TCGGGACGGC CGCTGGGTGG TCGAGGACAA GCACTGA
|
Protein sequence | MGKSLVIVES PAKAKTINKY LGSQYVVKSS IGHIRDLPTS GSASTAKEPA KRGKGVAEGP ALSPRDKAKR QLFARMGVDP EHGWQAHYEI LPGKEKVVDE LRRLAREADT IYLATDLDRE GEAIAWHLRE SIGGDEERYK RVVFNEITKK AIQEAFSQPG DLDINRVNAQ QARRFLDRVV GYMVSPLLWQ KIARGLSAGR VQSVAVKLIV EREREIRAFI PEEFWEVHAD LGTARGDKVR FEVAREQGEV FRPLNEAQAM AALEKLKASS YQVLKREDKP TRSKPSAPFI TSTLQQAASN RLGFSVKKTM MMAQRLYEAG YITYMRTDST NLSADALEMA RGFIDSEFGG KYLPAKPNVY TSKEGAQEAH EAIRPSDVNL RPNQLAGMER DAERLYELIW RQFVACQMPP AEYLSTNVSV QAGDFELRAK GRILKFDGYT RVLPQLAKPG EDDVLPEMSE GELLDLLKLD PSQHFTKPPA RYSEASLVKE MEKRGIGRPS TYAAIISTIQ ERGYVTLQNR RFHSEKMGEI VTERLGESFA NLMDYGFTAS MEEHLDDVAQ GERDWKNLLD EFYGDFRRKL EAAESSEAGM RANQPTLTDI PCRECGRPMM IRTASTGVFL GCSGYNLPPK ERCKATVNLI PGDEIAADDE GESESLLLRH KRRCPKCGTA MDAYLLDERH KLHICGNNPD CPGYEIEEGQ YRIKGYEGPT LECDKCGSEM QLKTGRFGKF FGCTNAACKN TRKLLKNGEP APPKMDAVKM PELRCEKVDD VYVLRDGASG LFLAASQFPK NRETRAPLVL ELLPHRDEID PKYHFLLEAP SHDPEGRPAV IRFSRKTKEQ YVQSEVEGKP SGWRAFHRDG RWVVEDKH
|
| |