Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1645 |
Symbol | |
ID | 6975061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 1830787 |
End bp | 1833561 |
Gene Length | 2775 bp |
Protein Length | 924 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643391180 |
Product | DNA topoisomerase I |
Protein accession | YP_002276037 |
Protein GI | 209543808 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.247282 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGACG TCGTCGTGGT CGAGTCGCCT GCCAAGGCGA AGACGATCAA CAAGTATCTG GGGGACGGAT TCACGGTTCT TGCCTCGTTC GGCCATGTCC GCGACCTGCC GCCGAAGGAT GGCAGCGTCC GCCCCGACGA GAATTTCGCC ATGGACTGGC AGGCCGACGA GCGCGGCAGC CGGCAGATGG CCGCGATCGC CAAGGCGTTG CGCGGGGCGC GGCATCTGTA CCTGGCCACT GACCCGGATC GCGAGGGCGA GGCGATTTCC TGGCATGTCC GCGCGATGCT GGAGGAAAAG AACCTGCTGA AGGGGGTGGA CGTCCAGCGC GTCACCTTCA ACGAGATCAC CAAGAGCGCC ATCCGCGCCG CGATGGCCCA GCCGCGCGAC CTGGACCAGC CGCTGATCGA GGCCTATCTG GCCCGGCGCG CGCTGGATTA CCTGGTGGGC TTCACCCTGT CGCCCGTGCT GTGGCGCAAG CTGCCGGGGT CGCGCAGCGC CGGCCGGGTA CAGTCGGTCG CCCTGCGCCT GATCTGCGAG CGCGAGGCGG AGATCGAGAG CTTCCGCGCC CGCGAATACT GGACGGTGGC CGCGCAGTTC ACCACGCCCG GCGGCGCGGC CTTCACCGCG CGGCTGACGC ACCTGGCCGG CCGCAAGCTG GACCAGTTCG ACCTGCATGA CGAAGCCGGG GCCATGGCGG CCAAGGCGGC CGTCGAGGCC GGGCGCTTCG CCGTGCAGTC GGTGGAACGG CGCAAGGTCC GCCGCAACCC GCCGCCGCCC TTCACGACCT CGACCATGCA GCAGGAAGCG TCGCGCAAGC TGGGCATGGG CGCGCAGGGC ACCATGCGCA CCGCCCAGCA GCTGTACGAA GGCATCGACC TGGGCGGCGA GACGGTCGGT CTGATCACCT ATATGCGAAC CGATGGCGTG CAGATGGCCG GCGAGGCCAT CGCCGCCATC CGCGGCCATA TCGGCGAGAG CTTCGGCGCG CCCTACGTGC CCGAGAAGGC CCGGATCTAT TCCACCAAGG CGAAGAACGC CCAGGAAGCG CACGAGGCGA TCCGCCCCAC CGATGTCAGC CGCACCCCGG CGCAGATGGC CCGCTACCTG AATGACGAGC AGCGGCGGCT GTACGAACTG ATCTGGAAAC GGTCGGTCGC CAGCCAGATG CAGTCGGCCG AACTGGACCA GGTGATCGTC GAGATCGCGG ATGCCGGAGG GGCCGCCACC CTGCGCGCCA CCGGGTCGAT GATCGCCTTC GACGGGTTCC TGAAGCTGTA CAGCGAAGGC CGGGACGACG CCGCGCCGAA GGACGAGCAG GACGACGACA GCCGCATGCT GCCGCCGATG CGCGAGCGCG ACGCGCTGAA AACCGGCGAG GTCGCGGCCG ACCAGCATTT CACCCAGCCG CCGCCGCGCT TCTCCGAAGC GTCGCTGGTC AAGAAGATGG AAGAGATCGG GATCGGCCGG CCGTCGACCT ATGCCTCGAT CCTGACGGTG CTGCGCGACC GCAATTACGT GCGGCTGGAT GCCCGCCGCT TCGTCCCCGA GGACCGGGGG CGGCTGGTCA CCGCGTTCCT GACCTCGTTC TTCGAACGCT ATGTGGACAC GCAGTTCACG GCGGGGCTGG AAGAGCAGCT GGACGACATA TCCGGTGGGC GGGCCGACTG GCGCGACGTG ATGTCGGCCT TCTGGCAGGA TTTTTCCCGC GCGGTGGACC AGACGAAGGA TCTGAAGATC TCCGACGTCA TCAGCGCGCT GGATGCCGAC CTGGCGCCGC ATTTCTTCCC CGCGCACCTC GACGGCAGCG ATCCGCGCGT CTGCACCGCC TGCGGCACCG GGCGGCTGGG GCTGAAGCTG GGGCGGTACG GCGCCTTCAT CGGCTGTTCC AACTATCCGA CCTGCCAGTT CACCCGCCGC CTGGTGGTGG ACCCCAAGGA GGACGGCGAG GCCGACACGC TGAAGGACGG CATGCGCCTG CTGGGCCAGA CGCCCGGCGG CGAGGATGTG ACCGTGCGGC GCGGCCCGTG GGGCCTGTAC GTCCAGCAGG GCGAACCCGA CCCCGAGGAC AAGAAGGCCA AGCCCCGGCG CGCCACCATT CCGCGCGGGA TCGAAGGCGA CAAGATCACG CTGGACCAGG CGCTGGGCCT GCTCTCGCTG CCGCGGGTCG TCGGCATCCA TCCGGAAACC GGCGAGCAGA TCGAGGCCGG GCTTGGCCGC TTCGGGCCAT ACGTGAAGAT GGGCGCGGTC TATGGATCGC TGGACAAGGA TGACGACATC CTGACGGTCG GGCTGAACCG GGCGGTGGAC GTGCTGGCCC GCAAGCTGGC CTCGGTCCGC ACCATCGCGC CGCACCCCAA GGATGGCGAG CCGGTGATCG TCCGCAAGGG ACGGTTCGGA CCGTATATCC AGCATGGCAC GATGGTGGTG AACGTGCCCC GGGGCGAGGC CATGGAGGAC GTGACCCTGG ACCAGGCGGT GGCGCTGCTG GCCGAAAAGG GCAAGCCGCT GAAGCCCAAG GGCAAGGCCG CGGCGAAGAA GGCCCCCGCC CGGAAGACGG CCGCCAGGAA GGCGCCGGCC AAGACCGCGG CGAAGAAAGC CGCCCCGGAC GGGGATGCCG ATACGCAGGC CAGGGCCGCG AAACCCCCGG CGCGCAAGGC CGCCGCCCGC AAGACCCCGG CCGGCAAGGC AACGGGCAAG ACCGCGAAGG GCAAGGCCGA ACCCGGCGAG GGTGCCGGGC CCCGCACGCG GCGCACGGCG ACCGAGGCCG GCTGA
|
Protein sequence | MTDVVVVESP AKAKTINKYL GDGFTVLASF GHVRDLPPKD GSVRPDENFA MDWQADERGS RQMAAIAKAL RGARHLYLAT DPDREGEAIS WHVRAMLEEK NLLKGVDVQR VTFNEITKSA IRAAMAQPRD LDQPLIEAYL ARRALDYLVG FTLSPVLWRK LPGSRSAGRV QSVALRLICE REAEIESFRA REYWTVAAQF TTPGGAAFTA RLTHLAGRKL DQFDLHDEAG AMAAKAAVEA GRFAVQSVER RKVRRNPPPP FTTSTMQQEA SRKLGMGAQG TMRTAQQLYE GIDLGGETVG LITYMRTDGV QMAGEAIAAI RGHIGESFGA PYVPEKARIY STKAKNAQEA HEAIRPTDVS RTPAQMARYL NDEQRRLYEL IWKRSVASQM QSAELDQVIV EIADAGGAAT LRATGSMIAF DGFLKLYSEG RDDAAPKDEQ DDDSRMLPPM RERDALKTGE VAADQHFTQP PPRFSEASLV KKMEEIGIGR PSTYASILTV LRDRNYVRLD ARRFVPEDRG RLVTAFLTSF FERYVDTQFT AGLEEQLDDI SGGRADWRDV MSAFWQDFSR AVDQTKDLKI SDVISALDAD LAPHFFPAHL DGSDPRVCTA CGTGRLGLKL GRYGAFIGCS NYPTCQFTRR LVVDPKEDGE ADTLKDGMRL LGQTPGGEDV TVRRGPWGLY VQQGEPDPED KKAKPRRATI PRGIEGDKIT LDQALGLLSL PRVVGIHPET GEQIEAGLGR FGPYVKMGAV YGSLDKDDDI LTVGLNRAVD VLARKLASVR TIAPHPKDGE PVIVRKGRFG PYIQHGTMVV NVPRGEAMED VTLDQAVALL AEKGKPLKPK GKAAAKKAPA RKTAARKAPA KTAAKKAAPD GDADTQARAA KPPARKAAAR KTPAGKATGK TAKGKAEPGE GAGPRTRRTA TEAG
|
| |