Gene Cthe_1638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1638 
Symbol 
ID4809333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1964406 
End bp1965659 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content44% 
IMG OID640107053 
ProductParB-like nuclease 
Protein accessionYP_001038054 
Protein GI125974144 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase
[COG1475] Predicted transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0822558 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATAC TGAAAATACC AACAGAAAAA CTAAAACCAT CTAAATATAA TCCGCGGAAA 
GATTTAAAGC CTGGTGACCC TGAATATGAA AAATTACGTC GGTCTATTGA AGAGTTTGGA
TATGTAGAGC CGGTTATATG GAATAAACGC ACCGGGAACA TTGTCGGCGG ACATCAGCGT
TATAAAGTAC TTACAGCTTT GGGGTATAAG GAGATCGACT GTGTTGTAGT TGATTTGGAT
GAACAGCGGG AAAAGGCGCT CAATGTTGCA CTGAATAAAA TCAGTGGCGA GTTTGATATT
CCGCTTTTGA CCGATCTGCT TATGGATTTA AATGAAGATG GTTTTGACGT TTCTCTTACC
GGGTTTGATG CTGCGGAAAT TGATGAGTTG TTCCGTGATA AAACAACCGC TAATGTCAAA
GAGGATAATT TCGATACAGA AAAGGCAATT GCAGAGATTG AAACTCCGGT CACTAAAAAG
GGCGACATAT GGGTGCTTGG CAGCCACCGT CTGATGTGCG GTGATAGCAC CATGCTTTCA
GATGTGCAAA AGCTGATGAA CGGACAAAAG GCGAGATTTG TTTTCACCGA CCCACCCTGG
AATGTTGATT ACGGTTCAGA TACCAGGCAT CCAAGCTGGA AGCCAAGACA AATTCTAAAT
GACAATATGA GCACCGAAGA ATTCGGCGCT TTTTTATTGC GCGCTTTTAA ATGCATGAAA
GAGGTTTCTG AAGCCGGATG CATGACCTAT ATAGTAATGA GTGCTCAGGA ATGGGGCAGT
TTGATGAACG TCATGCGGGA GGCAGGGTAT CACTGGTCGA GCACAATTAT ATGGAAAAAA
GACAGCTTGG TACTGTCAAG AAAGGACTAT CATACCCAGT ACGAGCCGAT CTGGTACGGT
TGGCTTGAAG GAACACGCCT TTGCCCGCTT AAAGACCGTA AACAGTCAGA TGTTTGGGAG
ATACCCCGTC CTAAAGTATC GGAGGAGCAC CCTACCATGA AGCCGGTATC GCTTGTAGCA
AAGGCAATGC TCAATAGTTC CCATATTGGA GATTTAACTC TTGACCTGTT CGGTGGTTCT
GGTACGACAA TGATTGCGGC ACAGCAGACC GGGCGGGTTT GTTTTATGAT GGAGCTTGAC
CCGAAATACT GCGATGTGAT TGTAAAGCGC TATGTTTCAC AATTTGGCGC AGATTCAGTA
TTCTTGGTAA CAGGTAGTGA AAAAATACCT TACGCGGAAA CACAGATTGA TTAA
 
Protein sequence
MDILKIPTEK LKPSKYNPRK DLKPGDPEYE KLRRSIEEFG YVEPVIWNKR TGNIVGGHQR 
YKVLTALGYK EIDCVVVDLD EQREKALNVA LNKISGEFDI PLLTDLLMDL NEDGFDVSLT
GFDAAEIDEL FRDKTTANVK EDNFDTEKAI AEIETPVTKK GDIWVLGSHR LMCGDSTMLS
DVQKLMNGQK ARFVFTDPPW NVDYGSDTRH PSWKPRQILN DNMSTEEFGA FLLRAFKCMK
EVSEAGCMTY IVMSAQEWGS LMNVMREAGY HWSSTIIWKK DSLVLSRKDY HTQYEPIWYG
WLEGTRLCPL KDRKQSDVWE IPRPKVSEEH PTMKPVSLVA KAMLNSSHIG DLTLDLFGGS
GTTMIAAQQT GRVCFMMELD PKYCDVIVKR YVSQFGADSV FLVTGSEKIP YAETQID