Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1638 |
Symbol | |
ID | 4809333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1964406 |
End bp | 1965659 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640107053 |
Product | ParB-like nuclease |
Protein accession | YP_001038054 |
Protein GI | 125974144 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0863] DNA modification methylase [COG1475] Predicted transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0822558 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATATAC TGAAAATACC AACAGAAAAA CTAAAACCAT CTAAATATAA TCCGCGGAAA GATTTAAAGC CTGGTGACCC TGAATATGAA AAATTACGTC GGTCTATTGA AGAGTTTGGA TATGTAGAGC CGGTTATATG GAATAAACGC ACCGGGAACA TTGTCGGCGG ACATCAGCGT TATAAAGTAC TTACAGCTTT GGGGTATAAG GAGATCGACT GTGTTGTAGT TGATTTGGAT GAACAGCGGG AAAAGGCGCT CAATGTTGCA CTGAATAAAA TCAGTGGCGA GTTTGATATT CCGCTTTTGA CCGATCTGCT TATGGATTTA AATGAAGATG GTTTTGACGT TTCTCTTACC GGGTTTGATG CTGCGGAAAT TGATGAGTTG TTCCGTGATA AAACAACCGC TAATGTCAAA GAGGATAATT TCGATACAGA AAAGGCAATT GCAGAGATTG AAACTCCGGT CACTAAAAAG GGCGACATAT GGGTGCTTGG CAGCCACCGT CTGATGTGCG GTGATAGCAC CATGCTTTCA GATGTGCAAA AGCTGATGAA CGGACAAAAG GCGAGATTTG TTTTCACCGA CCCACCCTGG AATGTTGATT ACGGTTCAGA TACCAGGCAT CCAAGCTGGA AGCCAAGACA AATTCTAAAT GACAATATGA GCACCGAAGA ATTCGGCGCT TTTTTATTGC GCGCTTTTAA ATGCATGAAA GAGGTTTCTG AAGCCGGATG CATGACCTAT ATAGTAATGA GTGCTCAGGA ATGGGGCAGT TTGATGAACG TCATGCGGGA GGCAGGGTAT CACTGGTCGA GCACAATTAT ATGGAAAAAA GACAGCTTGG TACTGTCAAG AAAGGACTAT CATACCCAGT ACGAGCCGAT CTGGTACGGT TGGCTTGAAG GAACACGCCT TTGCCCGCTT AAAGACCGTA AACAGTCAGA TGTTTGGGAG ATACCCCGTC CTAAAGTATC GGAGGAGCAC CCTACCATGA AGCCGGTATC GCTTGTAGCA AAGGCAATGC TCAATAGTTC CCATATTGGA GATTTAACTC TTGACCTGTT CGGTGGTTCT GGTACGACAA TGATTGCGGC ACAGCAGACC GGGCGGGTTT GTTTTATGAT GGAGCTTGAC CCGAAATACT GCGATGTGAT TGTAAAGCGC TATGTTTCAC AATTTGGCGC AGATTCAGTA TTCTTGGTAA CAGGTAGTGA AAAAATACCT TACGCGGAAA CACAGATTGA TTAA
|
Protein sequence | MDILKIPTEK LKPSKYNPRK DLKPGDPEYE KLRRSIEEFG YVEPVIWNKR TGNIVGGHQR YKVLTALGYK EIDCVVVDLD EQREKALNVA LNKISGEFDI PLLTDLLMDL NEDGFDVSLT GFDAAEIDEL FRDKTTANVK EDNFDTEKAI AEIETPVTKK GDIWVLGSHR LMCGDSTMLS DVQKLMNGQK ARFVFTDPPW NVDYGSDTRH PSWKPRQILN DNMSTEEFGA FLLRAFKCMK EVSEAGCMTY IVMSAQEWGS LMNVMREAGY HWSSTIIWKK DSLVLSRKDY HTQYEPIWYG WLEGTRLCPL KDRKQSDVWE IPRPKVSEEH PTMKPVSLVA KAMLNSSHIG DLTLDLFGGS GTTMIAAQQT GRVCFMMELD PKYCDVIVKR YVSQFGADSV FLVTGSEKIP YAETQID
|
| |