Gene GM21_1529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1529 
Symbol 
ID8136858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1792492 
End bp1794831 
Gene Length2340 bp 
Protein Length779 aa 
Translation table11 
GC content61% 
IMG OID644869141 
ProductDNA topoisomerase I 
Protein accessionYP_003021343 
Protein GI253700154 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value6.23259e-34 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTCTCAGA ATCTCGTCAT CGTTGAGTCT CCCGCCAAGG CGAAGACCAT TGAAAAATTC 
CTCGGCCACG ACTACAAGGT CCTGGCCTCG TTCGGTCACG TGCGCGCCCT CCCCAGCAAG
CAGGGGTCGG TGGACACGGA GCACGACTTC GAGCCGAAGT ACGCCGTCCT CCCCGAGAGC
AAAAAGCACA TCGACGCCAT CAAGAAGGAG ATGAAGGGAA TCTCCTCGCT CCTCTTGGCG
ACAGACCCCG ACCGCGAAGG GGAGGCGATC TCCTGGCACC TTTTGGCCGC GCTCGGCCTG
GACGGGAAGA AGAAGCTTCC GTTCGAGATC AAGCGGGTGG TCTTCCACGA GATCACCAAG
GACGCCATCG TGCATGCGGT GGAAAATCCC CGCGACATCG CCCTTGACCT GGTGGATGCC
CAGCAGGCCC GCTCCATCCT CGATTACCTG GTCGGCTTCA CCCTCTCTCC TTTTCTTTGG
AAGAAGATCC GCTACGGACT TTCCGCGGGG CGGGTGCAGT CGGTCGCACT GAGGCTCATC
TGCGAGCGCG AGAAGGAGAT CCAGGCCTTC AAGGAGCAGG AGTACTGGAC CATCGCCGCG
AAGCTCGAGA CCGCCAAGAA GCAGGGGCTC ACGGCCACGC TGGTGGAGGC CGAGGGGAAG
AAGCTAGGCA AGTTCGACAT CCCCGACCAG GAAACGGCCT ACCGGCACTT CGCCAAGCTA
GGGGGGAAGG AATTCCCGCC GCAGGAAGGG GACGAGTCGG GAAAACCGCT GACGGTCGAG
CACCCGGCGC ACCCCGAGTA CCGGGTCGAC AAGGTCACCA AGAGCGAGAG AAAGCGCCAG
CCTTCTCCCC CTTTCACCAC CTCCACCTTG CAGCAGGAGG CGGCGCGCAA ACTCGGCTTC
TCCGCCAAGA AGACCATGTC CACGGCCCAA AAGCTCTACG AGGGGATCGA CGTCGGCGAA
GGGGCCGTAG GTCTCATCAC CTACATGCGT ACCGACTCCG TCGCCCTTTC CAACGTGGCG
CTGGAAGAGG CAAAGTCGGT GATCACCTCG CTCTACGGCA AGGAATACGC GCTGGAGAAG
CCGCGCTTCT TCAAGAACAA GTCCAAAAAC GCCCAGGAAG CGCACGAGGC GGTCCGCCCG
ACCTACATCG CCAAGACCCC GGTCGAGCTG AAGAAGTTCC TGAGCAGCGA CCAGTTCAAG
CTCTACGACC TGATCTGGAA GAGGACTGTC GCCTGCCAAA TGGCCGAGGC GCTCCTGGAC
CAGACCTCGG TCGATATCGG CGCGGGAGAA GGGTTCCGCT TCCGGGTCGC AGGCACCGTG
ATCCGCTTCG CCGGCTTCAT GAAGCTCTAC ATCGAAGGGG TCGACGACGA GGCGGAGGAC
AAGGATAAGG AAGGGCTGCT CCCGCCTTTG GCCGAGGGGG ACATCCTCAA GCTGCAGCAG
CTTCTCCCCG AGCGGCACTT CACCCAGCCC CCGCCGCGCT ACACCGAGGC GAGTCTGGTG
AAGACCCTGG AGGAGTACGG CATCGGGCGC CCGTCCACCT ACGCCTCCAT CATGAACACC
CTTACCGAGA GAAAGTACGC GCGCCTGGAC AAGAAGCGCT TCTTCCCCGA AGACGTAGGG
ATGGTGGTCT CCGACCTTTT GACCAACCAC TTCACCCAGT ACGTCGACTA CAACTTCACG
GCGAACCTGG AGGAAGAGCT CGACATGGTC TCGCGCGGCG AGAAGCAGTG GCGCCCGCTC
CTGCACGAGT TCTGGGGTCC CTTCATCAAC CTCTTGAAGC TGAAGGAAGG GGAGGTGAAC
AAGTCGGATT TAACCACCGA GGCGACCAAC GAGGTCTGCC CCGAGTGCGG CAAGCCCCTG
GTGGTGAAGC TCGGCAAGTT CGGCAAGTTC TACGCCTGCA CCGGTTATCC CGAATGCCGT
TACATCCGAC CGTTGGACAA GGAGACGGGC GAGGTGGTCG AGCCCGTGGT TTCCGAGGAA
ATCTGCGACA AGTGCGGCAG CCACATGCTG ATCAAGGACG GGCGTTTCGG CAAGTATCTG
GCCTGTTCCG CCTACCCCAA CTGCAAGAAC ATCCAGCCGC TGGTGAAGCC CAAGGGTACC
GGGATCACCT GCGCCGAATG CGGCAAGGGG GAGCTGATCG AGAAGAAGTC CCGTTTCGGC
AAGCTCTTCT ACTCCTGTAA CCGCTACCCC GAGTGCAAGT TCGCCCTGTG GGATCTCCCG
GTGCAGCAGC CCTGTCCCAA GTGCGGCTTC CCGCTCCTCA TCAAGAAGGT CTACAAGCGC
GAGGGGGAAT TCCTCAAGTG CCCGAAGGAA GGGTGCGACT ACAAGAGCAA CCAGTCCTAG
 
Protein sequence
MSQNLVIVES PAKAKTIEKF LGHDYKVLAS FGHVRALPSK QGSVDTEHDF EPKYAVLPES 
KKHIDAIKKE MKGISSLLLA TDPDREGEAI SWHLLAALGL DGKKKLPFEI KRVVFHEITK
DAIVHAVENP RDIALDLVDA QQARSILDYL VGFTLSPFLW KKIRYGLSAG RVQSVALRLI
CEREKEIQAF KEQEYWTIAA KLETAKKQGL TATLVEAEGK KLGKFDIPDQ ETAYRHFAKL
GGKEFPPQEG DESGKPLTVE HPAHPEYRVD KVTKSERKRQ PSPPFTTSTL QQEAARKLGF
SAKKTMSTAQ KLYEGIDVGE GAVGLITYMR TDSVALSNVA LEEAKSVITS LYGKEYALEK
PRFFKNKSKN AQEAHEAVRP TYIAKTPVEL KKFLSSDQFK LYDLIWKRTV ACQMAEALLD
QTSVDIGAGE GFRFRVAGTV IRFAGFMKLY IEGVDDEAED KDKEGLLPPL AEGDILKLQQ
LLPERHFTQP PPRYTEASLV KTLEEYGIGR PSTYASIMNT LTERKYARLD KKRFFPEDVG
MVVSDLLTNH FTQYVDYNFT ANLEEELDMV SRGEKQWRPL LHEFWGPFIN LLKLKEGEVN
KSDLTTEATN EVCPECGKPL VVKLGKFGKF YACTGYPECR YIRPLDKETG EVVEPVVSEE
ICDKCGSHML IKDGRFGKYL ACSAYPNCKN IQPLVKPKGT GITCAECGKG ELIEKKSRFG
KLFYSCNRYP ECKFALWDLP VQQPCPKCGF PLLIKKVYKR EGEFLKCPKE GCDYKSNQS