Gene Ccel_0922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0922 
Symbol 
ID7309760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1098275 
End bp1100443 
Gene Length2169 bp 
Protein Length722 aa 
Translation table11 
GC content41% 
IMG OID643607854 
ProductDNA topoisomerase III 
Protein accessionYP_002505269 
Protein GI220928360 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid
[TIGR01057] DNA topoisomerase I, archaeal 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTTTTA CATTAATATT AGCAGAAAAA CCTTCGGTAG CAAGAGATTT GGCAAAAGTC 
CTGAACTGCG GGCAAAATGC CAATGGCTAT ATAATGGGAA AAAAATATAT CGTAACCTGG
GCCTTGGGAC ATCTTGTTAC CCTTGCAGAC CCTGAAGCTT ACGGCAACAA GTACAAGACA
TGGAATCTGG AAGACCTGCC AATGCTGCCA AACAAAATGG AACTGGTAGT AATAAAGCAG
ACTGCAAAAC AGTACGGAGT AGTCAGAGGT CTTTTGAACA GGGCCGATGT GGACGAGCTT
GTAATAGCCA CTGATTCCGG GCGTGAAGGT GAGCTTGTAG CCAGATGGAT TATAATGAAG
GCAGGATTTA AAAAGCCGAT AAAGCGTCTA TGGATTTCAT CCCAGACGGA CAAGGCCATA
AAGGAAGGGT TTGCAAAATT AAGGCCTTCC AAGGAATACG ACAACCTATA CTATTCGGCA
CAGAGCAGGG CAGAAGCCGA CTGGCTCATT GGTCTTAATG TCACAAGGGC ATTGACCTGC
AAATACAATG CTCAGCTATC AGCAGGGAGA GTACAGACAC CTACACTTGC CATGATTGTA
GAAAGAGAAG AGGAAATTCG TAAATTCAGG CCCAAGGACT ATTGGACAAT TTCCGCACAG
TTTAACGGAT TTACGGTACA GTGGCAGGAT AGCAGAAACA ATCAGACCAG AACTTTCAAC
AAAGAAGAGG CGGACGGAAT AGTAGCTAAA ATAACAGGAC AGATGGGTGA AGTAGTTGAG
GTAAAAAAAG AAACTAAGAA AGAATTGCCT CCATTGGCTT ACGACCTGAC AGAGCTGCAA
AGAGATGCAA ACAAAAAATT CTCATATTCA GCAAAACAAA CCCTTAACAT AATGCAGCGT
CTTTATGAAT CACACAAGCT GGTTACGTAT CCAAGAACAG ATTCAAGGTA CATAACGGAT
GACATTGTAC CTACCTTGAA CGAACGCTTG AAAAGCATAG CGGTGGGCCC TTATGCAAAG
CTTGTACAGG GAGTTATGAG AAACAAACCA AGTGTTACAA AAAGGTTTGT GGATAACAGC
AAGGTTACAG ACCACCATGC CATAATACCA ACAGAACAAT TTGTGGACTT ATTCTCACTG
AATTCAGAAG AGAGAAATAT ATATGACCTT ATAGTAAAAA GATTTATTGC AGTACTAAGC
AAGCCGTTTG AATATGAGCA GACTACAGTA AAGCTTGATA TAGCAGGAGA AAGTTTTTAT
GCAAAAGGGA AAATTGTCAA ATCATCGGGC TGGAAAACAG TCTATGACGG TTTTGGAAAA
CTTGATGAAG ACGATGAAGA TGATAATGAC CAGTCACTGC CGGATATCCA GAAGGGCCAT
AAGGCAAAAG TTGTCGGACC TAAATCAATA AACGGAAAAA CAAAGCCACC CGCAAGGTAT
ACTGAGGCCA CTTTACTTTC TGCAATGGAG CATCCGGGGA AATTTGTTGA CAACAAAGCG
TTAAAGGAAG CCTTAGAAAG CACCAGCGGA CTTGGCACAC CAGCTACAAG AGCAGATATA
ATAGAAAAGC TTTTTAATAC GTTTTATATA GAAAGAAAGG GTAAGGAAAT TTACCCTACC
TCAAAGGGAA CTCAACTTAT TTCACTGGTT CCTACTGATT TGAAATCTCC TGAACTTACT
GCAAAGTGGG AGCAGCAGCT TACCCTTATA AGTAAAGGTA AAGTCAATTC AAATGTGTTT
GTAAATGATA TGAAAAAATA TGCAAGAAAA TTAGTGGGAG CGGTTATAGC AAGTTCGGAA
CAGTTCAAAC ATGATAACGT AACCCGGGAA AAGTGTCCCG AATGTGGGAA ATACCTTCTG
GAAGTAAACG GTAAAAAAGG CAAAATGCAT ATCTGTCCTG ACAGAGAATG TGGCTACAGA
AAGTCTGTAA CCGTAATATC AAATGCCAGA TGCCCTGAAT GTCACAAGAA AATGGAAATC
AGAGGAGAGG GAGAAAATAA GTCATTTTAC TGTTCATGCG GTTATAGGGA GAAGTTGGAC
GCTTTTAAGA AACGTAAAGG TCAACAGGTT GATAAAAGGG AGGTTGCCAA ATTCATGAGG
CAGCAGGAAA AGGACGAAAA CATAAACTCT GCACTTGCTG ATGCATTGGC AAAGTGGAAA
AATAATTGA
 
Protein sequence
MGFTLILAEK PSVARDLAKV LNCGQNANGY IMGKKYIVTW ALGHLVTLAD PEAYGNKYKT 
WNLEDLPMLP NKMELVVIKQ TAKQYGVVRG LLNRADVDEL VIATDSGREG ELVARWIIMK
AGFKKPIKRL WISSQTDKAI KEGFAKLRPS KEYDNLYYSA QSRAEADWLI GLNVTRALTC
KYNAQLSAGR VQTPTLAMIV EREEEIRKFR PKDYWTISAQ FNGFTVQWQD SRNNQTRTFN
KEEADGIVAK ITGQMGEVVE VKKETKKELP PLAYDLTELQ RDANKKFSYS AKQTLNIMQR
LYESHKLVTY PRTDSRYITD DIVPTLNERL KSIAVGPYAK LVQGVMRNKP SVTKRFVDNS
KVTDHHAIIP TEQFVDLFSL NSEERNIYDL IVKRFIAVLS KPFEYEQTTV KLDIAGESFY
AKGKIVKSSG WKTVYDGFGK LDEDDEDDND QSLPDIQKGH KAKVVGPKSI NGKTKPPARY
TEATLLSAME HPGKFVDNKA LKEALESTSG LGTPATRADI IEKLFNTFYI ERKGKEIYPT
SKGTQLISLV PTDLKSPELT AKWEQQLTLI SKGKVNSNVF VNDMKKYARK LVGAVIASSE
QFKHDNVTRE KCPECGKYLL EVNGKKGKMH ICPDRECGYR KSVTVISNAR CPECHKKMEI
RGEGENKSFY CSCGYREKLD AFKKRKGQQV DKREVAKFMR QQEKDENINS ALADALAKWK
NN