Gene Clim_2433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2433 
Symbol 
ID6355904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2664406 
End bp2666802 
Gene Length2397 bp 
Protein Length798 aa 
Translation table11 
GC content52% 
IMG OID642670023 
ProductDNA topoisomerase I 
Protein accessionYP_001944433 
Protein GI189347904 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA
[COG0551] Zn-finger domain associated with topoisomerase type I 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.336729 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATCAA AAACAGAAGC TCTTTCAGCC AGAAACAGAA CCCTTATTGT CGTTGAATCT 
CCTTCAAAGG CAAAAACCAT CAACAAATAT CTGGGCGACC GCTACACGGT TTTCGCGTCG
GTGGGGCACA TCAAAGATCT GCCGAAACGG GAAATCGGCC TTGATTTCGA TCATAACTAC
GAACCCCGCT ATGAGGTCAT TGCCGGAAAG GAAAAAGTTG TCCGACAGTT GAAAAAACTT
GCCGGAGAGG CCGACTCGGT ACTGATCGCT ACTGACCCTG ACCGCGAAGG CGAAGCTATA
GCCTGGCATA TTTCCAACGA AATCGAAGCT GCACGAAAAC CGGTTTTCAG GGTTTTGTTC
AATGAAATTA CGAAAAATGC GATTCTTGCA GCAATAAGCG AACCTCGCCA GATCGATTAC
CGTCTGGTGC GATCCCAGCA GACCCGTCAG GGACTCGACA AGATCGTGGG ATACAAGATC
AGTCCTTTCC TGTGGAATGT CGTGCTGCGA GGCCTCTCCG CAGGCAGGGT ACAATCGGTT
GCGCTCAGAC TTATCTGCGA ACGGGAAGAG GAGATCAACC GGTTCGAAAT TCAGGAGTAC
TGGACGGTTT CCGCCGACTT TGCAACGGCA AAAGGAGAAA TATTCAAGGC TAAACTGGTC
AAGGTTGACG GAGGCAAACC GGAACTCTCC AGTCAGGAGC AGGCCGAAGC CGCAGCTTCG
CTTGTAAGAA ACCGCCTCTA TGCTGTCGGG GACATTACGG CTAAAGCCCA GCAGCGCAAA
GCTCCGCTGC CATTTACCAC CTCGCTGCTC CAGCAGGCTG CTTCAAACCA GCTCGGGTTC
GGATCACAGA AAACCATGCG TATTGCCCAG CAGCTTTACG AAGGCATAGA TCTCGGAAGC
GAAGGGGCAA CAGGCCTGAT TACTTACATG CGTACCGATT CGACCCGTAT CGGCTCCGAA
GCCGTTGCGG AAGCCCATAA ATATATACGT GCAGTATTCG GTTCCGAATA TACCGGTTAC
GGCAGTTCGG CGAAATCCCC GAAAAATGCC CAGGACGCCC ACGAAGCTAT CAGGCCAACA
TCTATCGAAA GAAAGCCCGA AGCGATGAAA CCCTATCTTT CCGCAGACCA GTATCGCCTC
TATGAGCTTA TCTGGAAACG GTTTCTTGCC GCCATGATGG CCCCGGCGAA AATCGAACAG
ACAAAGGTTG ATGTCGAAGA TCACGAACGC CGCATTGTGT TCCGGGCGAA CGGAAGCCGC
GTGCTTTTCC CCGGCTTCAT GCGGGTGTAT GACGATCAGC AGGAGCTTGA GTATGAAGCC
CGTACATCGA CAAAGGAGGA TGTCGAAAAA GAACAGACCG TCAAACTTCC TGATACGCTT
GCCGCACGGG ACAGCCTGAA TCTTGACGAA ATCGAACAGA AACAGAGTTT TACCCGTCCT
CCTGCCCGTT ACAGTGAAGC TACGCTGGTA AAAGATCTCG ACAACTACGG CATCGGACGC
CCTTCGACCT ATGCCTCGAT ATTTTCCACC CTGCAGGATC GTCGCTATGT TGTATTGCAG
AAAAAAAAAA TAGCGCCTAC CGATCTCGGA AAAGATGTCT CACAGATTCT TGTGGCTAAT
TTTCCGGACA TCTTCAATAT CCGCTTTACC GCGTTCATGG AGGATGAACT GGACAAAGTG
GCGGCAGGTG ATGACGAATA TGAAAAGGTG CTCGACAGCT TTTACCGACC CCTGGAAACC
GCACTCAGCC TGAGAAAGAA TGACCCGCTC ATTCCACAGA ACATGAATGC GGAAACCTGC
GACAAGTGCG GTGAGGGCCG AATGATCATC AAATGGACGG CCAGCGGAAA ATTCCTTGGC
TGCTCACGAT ACCCTGCCTG CAAAAACATC AAACCGATCA GCTCGACAAA GGCAAAACCG
AAAGAAACCG GCATAAAATG CCCCTCCTGC AGTGACGGCC AGATGCTCCT CCGCGACGGG
CGGCTCGGCC CGTTCCTTGC ATGTTCAGGC TATCCCAAAT GCAACACCCT GCTCAATCTC
AGCAAACAGC GCCATATCGA ACCGCTTAAA ACTCCGCCGG TTCAGACCGA TCTCCCCTGT
CCGAAATGCG GAGCGCCTCT GTATCTCAGA AGCGGTAAAC GGGGGTTATG GCTGGGTTGC
TCAAAATTTC CTAAATGCCG GGGACGCCTT GCATGGAACT CGCTTGACCC TGCTCTTCAG
CTGCACTGGG AAGGGGTTAT GGCAGAACAC CGCAAAGCCC ATCCGGATGT CGTGCTGACC
ATGACCGATG GGCGACCGGT TCCAATGACT CTTCCTGTAG ACGACATAAT GGCAAGAGCA
GAGGAGAACG GTCTGATTGC ACCTGTAACC GAAGAGAATG AGGGCGTCAC AGTATGA
 
Protein sequence
MASKTEALSA RNRTLIVVES PSKAKTINKY LGDRYTVFAS VGHIKDLPKR EIGLDFDHNY 
EPRYEVIAGK EKVVRQLKKL AGEADSVLIA TDPDREGEAI AWHISNEIEA ARKPVFRVLF
NEITKNAILA AISEPRQIDY RLVRSQQTRQ GLDKIVGYKI SPFLWNVVLR GLSAGRVQSV
ALRLICEREE EINRFEIQEY WTVSADFATA KGEIFKAKLV KVDGGKPELS SQEQAEAAAS
LVRNRLYAVG DITAKAQQRK APLPFTTSLL QQAASNQLGF GSQKTMRIAQ QLYEGIDLGS
EGATGLITYM RTDSTRIGSE AVAEAHKYIR AVFGSEYTGY GSSAKSPKNA QDAHEAIRPT
SIERKPEAMK PYLSADQYRL YELIWKRFLA AMMAPAKIEQ TKVDVEDHER RIVFRANGSR
VLFPGFMRVY DDQQELEYEA RTSTKEDVEK EQTVKLPDTL AARDSLNLDE IEQKQSFTRP
PARYSEATLV KDLDNYGIGR PSTYASIFST LQDRRYVVLQ KKKIAPTDLG KDVSQILVAN
FPDIFNIRFT AFMEDELDKV AAGDDEYEKV LDSFYRPLET ALSLRKNDPL IPQNMNAETC
DKCGEGRMII KWTASGKFLG CSRYPACKNI KPISSTKAKP KETGIKCPSC SDGQMLLRDG
RLGPFLACSG YPKCNTLLNL SKQRHIEPLK TPPVQTDLPC PKCGAPLYLR SGKRGLWLGC
SKFPKCRGRL AWNSLDPALQ LHWEGVMAEH RKAHPDVVLT MTDGRPVPMT LPVDDIMARA
EENGLIAPVT EENEGVTV