Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1529 |
Symbol | |
ID | 8136858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1792492 |
End bp | 1794831 |
Gene Length | 2340 bp |
Protein Length | 779 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644869141 |
Product | DNA topoisomerase I |
Protein accession | YP_003021343 |
Protein GI | 253700154 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 6.23259e-34 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGTCTCAGA ATCTCGTCAT CGTTGAGTCT CCCGCCAAGG CGAAGACCAT TGAAAAATTC CTCGGCCACG ACTACAAGGT CCTGGCCTCG TTCGGTCACG TGCGCGCCCT CCCCAGCAAG CAGGGGTCGG TGGACACGGA GCACGACTTC GAGCCGAAGT ACGCCGTCCT CCCCGAGAGC AAAAAGCACA TCGACGCCAT CAAGAAGGAG ATGAAGGGAA TCTCCTCGCT CCTCTTGGCG ACAGACCCCG ACCGCGAAGG GGAGGCGATC TCCTGGCACC TTTTGGCCGC GCTCGGCCTG GACGGGAAGA AGAAGCTTCC GTTCGAGATC AAGCGGGTGG TCTTCCACGA GATCACCAAG GACGCCATCG TGCATGCGGT GGAAAATCCC CGCGACATCG CCCTTGACCT GGTGGATGCC CAGCAGGCCC GCTCCATCCT CGATTACCTG GTCGGCTTCA CCCTCTCTCC TTTTCTTTGG AAGAAGATCC GCTACGGACT TTCCGCGGGG CGGGTGCAGT CGGTCGCACT GAGGCTCATC TGCGAGCGCG AGAAGGAGAT CCAGGCCTTC AAGGAGCAGG AGTACTGGAC CATCGCCGCG AAGCTCGAGA CCGCCAAGAA GCAGGGGCTC ACGGCCACGC TGGTGGAGGC CGAGGGGAAG AAGCTAGGCA AGTTCGACAT CCCCGACCAG GAAACGGCCT ACCGGCACTT CGCCAAGCTA GGGGGGAAGG AATTCCCGCC GCAGGAAGGG GACGAGTCGG GAAAACCGCT GACGGTCGAG CACCCGGCGC ACCCCGAGTA CCGGGTCGAC AAGGTCACCA AGAGCGAGAG AAAGCGCCAG CCTTCTCCCC CTTTCACCAC CTCCACCTTG CAGCAGGAGG CGGCGCGCAA ACTCGGCTTC TCCGCCAAGA AGACCATGTC CACGGCCCAA AAGCTCTACG AGGGGATCGA CGTCGGCGAA GGGGCCGTAG GTCTCATCAC CTACATGCGT ACCGACTCCG TCGCCCTTTC CAACGTGGCG CTGGAAGAGG CAAAGTCGGT GATCACCTCG CTCTACGGCA AGGAATACGC GCTGGAGAAG CCGCGCTTCT TCAAGAACAA GTCCAAAAAC GCCCAGGAAG CGCACGAGGC GGTCCGCCCG ACCTACATCG CCAAGACCCC GGTCGAGCTG AAGAAGTTCC TGAGCAGCGA CCAGTTCAAG CTCTACGACC TGATCTGGAA GAGGACTGTC GCCTGCCAAA TGGCCGAGGC GCTCCTGGAC CAGACCTCGG TCGATATCGG CGCGGGAGAA GGGTTCCGCT TCCGGGTCGC AGGCACCGTG ATCCGCTTCG CCGGCTTCAT GAAGCTCTAC ATCGAAGGGG TCGACGACGA GGCGGAGGAC AAGGATAAGG AAGGGCTGCT CCCGCCTTTG GCCGAGGGGG ACATCCTCAA GCTGCAGCAG CTTCTCCCCG AGCGGCACTT CACCCAGCCC CCGCCGCGCT ACACCGAGGC GAGTCTGGTG AAGACCCTGG AGGAGTACGG CATCGGGCGC CCGTCCACCT ACGCCTCCAT CATGAACACC CTTACCGAGA GAAAGTACGC GCGCCTGGAC AAGAAGCGCT TCTTCCCCGA AGACGTAGGG ATGGTGGTCT CCGACCTTTT GACCAACCAC TTCACCCAGT ACGTCGACTA CAACTTCACG GCGAACCTGG AGGAAGAGCT CGACATGGTC TCGCGCGGCG AGAAGCAGTG GCGCCCGCTC CTGCACGAGT TCTGGGGTCC CTTCATCAAC CTCTTGAAGC TGAAGGAAGG GGAGGTGAAC AAGTCGGATT TAACCACCGA GGCGACCAAC GAGGTCTGCC CCGAGTGCGG CAAGCCCCTG GTGGTGAAGC TCGGCAAGTT CGGCAAGTTC TACGCCTGCA CCGGTTATCC CGAATGCCGT TACATCCGAC CGTTGGACAA GGAGACGGGC GAGGTGGTCG AGCCCGTGGT TTCCGAGGAA ATCTGCGACA AGTGCGGCAG CCACATGCTG ATCAAGGACG GGCGTTTCGG CAAGTATCTG GCCTGTTCCG CCTACCCCAA CTGCAAGAAC ATCCAGCCGC TGGTGAAGCC CAAGGGTACC GGGATCACCT GCGCCGAATG CGGCAAGGGG GAGCTGATCG AGAAGAAGTC CCGTTTCGGC AAGCTCTTCT ACTCCTGTAA CCGCTACCCC GAGTGCAAGT TCGCCCTGTG GGATCTCCCG GTGCAGCAGC CCTGTCCCAA GTGCGGCTTC CCGCTCCTCA TCAAGAAGGT CTACAAGCGC GAGGGGGAAT TCCTCAAGTG CCCGAAGGAA GGGTGCGACT ACAAGAGCAA CCAGTCCTAG
|
Protein sequence | MSQNLVIVES PAKAKTIEKF LGHDYKVLAS FGHVRALPSK QGSVDTEHDF EPKYAVLPES KKHIDAIKKE MKGISSLLLA TDPDREGEAI SWHLLAALGL DGKKKLPFEI KRVVFHEITK DAIVHAVENP RDIALDLVDA QQARSILDYL VGFTLSPFLW KKIRYGLSAG RVQSVALRLI CEREKEIQAF KEQEYWTIAA KLETAKKQGL TATLVEAEGK KLGKFDIPDQ ETAYRHFAKL GGKEFPPQEG DESGKPLTVE HPAHPEYRVD KVTKSERKRQ PSPPFTTSTL QQEAARKLGF SAKKTMSTAQ KLYEGIDVGE GAVGLITYMR TDSVALSNVA LEEAKSVITS LYGKEYALEK PRFFKNKSKN AQEAHEAVRP TYIAKTPVEL KKFLSSDQFK LYDLIWKRTV ACQMAEALLD QTSVDIGAGE GFRFRVAGTV IRFAGFMKLY IEGVDDEAED KDKEGLLPPL AEGDILKLQQ LLPERHFTQP PPRYTEASLV KTLEEYGIGR PSTYASIMNT LTERKYARLD KKRFFPEDVG MVVSDLLTNH FTQYVDYNFT ANLEEELDMV SRGEKQWRPL LHEFWGPFIN LLKLKEGEVN KSDLTTEATN EVCPECGKPL VVKLGKFGKF YACTGYPECR YIRPLDKETG EVVEPVVSEE ICDKCGSHML IKDGRFGKYL ACSAYPNCKN IQPLVKPKGT GITCAECGKG ELIEKKSRFG KLFYSCNRYP ECKFALWDLP VQQPCPKCGF PLLIKKVYKR EGEFLKCPKE GCDYKSNQS
|
| |