Gene Cagg_1347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1347 
Symbol 
ID7268639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1667890 
End bp1670232 
Gene Length2343 bp 
Protein Length780 aa 
Translation table11 
GC content57% 
IMG OID643566190 
ProductDNA topoisomerase I 
Protein accessionYP_002462690 
Protein GI219848257 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA
[COG0551] Zn-finger domain associated with topoisomerase type I 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000873362 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000469192 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGGTGAAA AAGTTGTGAT TGTTGAGTCG CCGGCCAAAG CGCGGACGAT TCAGAAATAT 
CTGGGTAAAG GCTATAAAGT GACCTCAAGT ATGGGTCACG TGCGCGATTT GCCGAAAAAT
GGGCTGGCGA TCGATATCGA GCACGATTTT GCTCCTTCTT ATGAAATTGT GAAGCCCAAG
GTAGTGAGTG AGCTGAACCA GGCTGTGCGC AATGCTGACG CGATCTATTT GGCAACTGAC
CCCGACCGTG AGGGCGAGGC GATTGCGTGG CATATTACCC AAGCGGTGAA GACGCCGAAA
AAGACGCCGA TCTATCGCGT CGTTTTTCAA GAGATTACGC GCAATGCCGT TCAGCAGGCG
TTGCAGCAGC CACGTCAGAT CAACCAAAAT CTGGTTGACG CGCAACAAGC ACGGCGCGTG
CTCGACCGTT TGGTCGGTTA CCAGCTTAGT CCATTGTTGT GGGATAAGGT CAAGCGCGGG
CTGAGCGCCG GACGGGTGCA ATCGGTGGCG GTGCGGCTGA TCGTCGAGCG TGAGCGTGAA
ATTGAGAACT TTAAGCCGCA AGAGTATTGG ACAATCGAGG CTGATCTGCT GAAAGAGGCC
GGTATCGCGC CGCGTGATCT GTTTCGGGCG ACGCTCATCG AGCGCGACGG TAAGAAGCTT
GAGAAATTCT CGATTGAACG CCGTGAGCAA GCTGAGGCGA TTGTCGCCGA TCTACAAGGT
GCGGCGTATA CCGTCCTTAA AGTGACCCGT CGCGATAAGC GGCGATCACC ACCACCACCG
TTTACCACCA GCACCTTACA ACAAGAGGCT GCCCGTAAGT TGGGTTTCAG CGCGAAGAAG
ACGATGATGC TGGCTCAGCG TCTCTACGAA GGTGTTGATA TTGGTGGTGA GGAGGGGATG
GTCGGTCTCA TCACCTATAT GCGTACCGAT AGTGTGCAGG TGGCGGCAGA AGCCCAAGCT
GAGGCGCGTG AGGTGATCGA TCGGCGGTTT GGGCGTGAGT ATCTGCCCGA CCAGCCGCCG
GTCTACAAGA TCAAGGCGAA AGGCGCGCAA GAGGCTCACG AGGCAATCCG GCCTACCAGC
AGTGCCCGTA CTCCTGAGCA GTTGAGCGAA CGGCTGGAGC GCGATCTGTG GCGGCTCTAC
GATCTGATTT GGAAGCGATT TATCGCTTCG CAGATGGCTC CGGCCATTTT CGACAGCACC
ACCGTTGATA TTGCTGCCCA ACCGAGTGTG GCCGGTGCGC CACCCTACTT GTTCCGTGCT
ACCGGCTCGG TGCTCAAGTT CCCCGGCTTC CTTGCCGTTT ACAACGTGAG CCTTGATGAG
GGCGAGGAAG ATGAAGACAG TGAGCGTCGC TTGCCGCCAC TGGTCGAGGG CGAAAACCTC
CAGTTAGTTG AGCTGTTGCC GGTGCAGCAC TTCACCGAAC CGCCGCCGCG CTACACCGAG
GCCAGTCTGG TGAAAGAACT CGAACGTCTT GGGATTGGGC GTCCGAGTAC CTACGCAACG
ATTCTTTCGA CCATCCAGGA ACGCGAGTAC GTCGAGATGG TCGATAAGAA ACTGATTCCG
ACGATGCTTG GCCGGATTGT GACCGACTTG TTGGTTGAGC ATTTCGGCAA CATCGTCGAT
TACGACTTTA CGTCGTCGCT TGAACAGCAG CTTGACGATA TTGCCGAAGG CTCGAAGCAG
TGGGTGCCGG TGCTGCGCGA ATTCTATGGC CCCTTCCGCT CGACGCTGGA AACAGCTCAA
CGCCAGATGC GCAATGTCAA GCGCGAAGAG ATTATCACCG ATCTCGATTG CCCGAAGTGC
GGCAAAGGGA AGCTGGTGAT CAAGTTTGGC CGCAACGGCG AGTTTCTGGC CTGTTCGCGC
TACAACCGGG AAGGCGAGGG TGATTCGTGC GATTTCACCG GCGATTTTCA CCGCGATGAA
AATGGCAATA TTGTGCTCGA TCAGGCCAGC GCGCCAGAGA CGAGCGATGT CTTGTGTAAT
GTCTGTGGGC GGCCAATGGT GATCAAGAAG AGCCGTTTCG GCCCCTTCCT CGGCTGTTCG
GGATACCCTG AATGCACCAA CACCCGCCGG ATTGGCCGCG ACGGCAAGCC GGTTCCACTC
CCCGAACCAA CCGGCGTTAC CTGCCCGAAG TGCGGTGAAG GGGAGTTACT ACGTCGACGC
GGCAAGTTTG GCCGTCCGTT CTACGGCTGC TCGCGCTACC CCAAGTGCGA CTACATCACC
AACTCGCTTG ACGAAGCGCA GGCAGGAGTG GCGGTCGAAG CTGCGCCAGC GCTGCCTCCT
ACCGTTGAGA AACCGGCGGC ACCAGCCCGC AAATCTAGTG GCAAGACCCG CAAATCGGCG
TAA
 
Protein sequence
MGEKVVIVES PAKARTIQKY LGKGYKVTSS MGHVRDLPKN GLAIDIEHDF APSYEIVKPK 
VVSELNQAVR NADAIYLATD PDREGEAIAW HITQAVKTPK KTPIYRVVFQ EITRNAVQQA
LQQPRQINQN LVDAQQARRV LDRLVGYQLS PLLWDKVKRG LSAGRVQSVA VRLIVERERE
IENFKPQEYW TIEADLLKEA GIAPRDLFRA TLIERDGKKL EKFSIERREQ AEAIVADLQG
AAYTVLKVTR RDKRRSPPPP FTTSTLQQEA ARKLGFSAKK TMMLAQRLYE GVDIGGEEGM
VGLITYMRTD SVQVAAEAQA EAREVIDRRF GREYLPDQPP VYKIKAKGAQ EAHEAIRPTS
SARTPEQLSE RLERDLWRLY DLIWKRFIAS QMAPAIFDST TVDIAAQPSV AGAPPYLFRA
TGSVLKFPGF LAVYNVSLDE GEEDEDSERR LPPLVEGENL QLVELLPVQH FTEPPPRYTE
ASLVKELERL GIGRPSTYAT ILSTIQEREY VEMVDKKLIP TMLGRIVTDL LVEHFGNIVD
YDFTSSLEQQ LDDIAEGSKQ WVPVLREFYG PFRSTLETAQ RQMRNVKREE IITDLDCPKC
GKGKLVIKFG RNGEFLACSR YNREGEGDSC DFTGDFHRDE NGNIVLDQAS APETSDVLCN
VCGRPMVIKK SRFGPFLGCS GYPECTNTRR IGRDGKPVPL PEPTGVTCPK CGEGELLRRR
GKFGRPFYGC SRYPKCDYIT NSLDEAQAGV AVEAAPALPP TVEKPAAPAR KSSGKTRKSA