Gene Cthe_1955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1955 
Symbol 
ID4810738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2327899 
End bp2330055 
Gene Length2157 bp 
Protein Length718 aa 
Translation table11 
GC content45% 
IMG OID640107371 
ProductRNA binding S1 
Protein accessionYP_001038366 
Protein GI125974456 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGATA TTATTTCCAC ACTTGTAAAA GAGTTTAATC TCAAGCCTTT CCAGGTGGAA 
AACACCGTAA AACTTATTGA CAGCGGCAAT ACCATTCCCT TTATTGCAAG GTACAGGAAA
GAAATAACGG GAGAATTGAA CGATCAGGTG CTAAGGCAGC TTCATGAAAG ACTGATTTAC
CTGAGGAACC TTGAGGCAAG AAAAGAAGAA GTTCGGCGCC TGATTGACGA GCAGGGAAAG
CTTACCGCGG AAATTACGGC ATCTCTTGAA AAAGCGACCA CCCTCCGGGA GGTTGAAGAT
ATATACAGGC CTTTCAGGCC AAAAAGAAGA ACCAGGGCGA CTGTTGCAAA GGAAAAAGGA
CTTGAACCTT TGGCCGAAAT TATTATGGCC CAGGAACTTA AAACCGGCAG TATTGAAGAC
ATAGCCAAGC CTTTTATAAA TCCTGAGAAG GAAGTCAATA CCGTCGAGGA TGCCTTAAAC
GGAGCAATGG ACATAATAGC CGAAGACATT TCGGACAATC CGCATATCAG AAGTATTGTG
CGGGACGTGT TTATGAAACA AGGAATGATT GTGTCCAAAA AGAAGAAGGA TGAAGATTCG
GTATACAGAA TGTACTATGA TTTCTCGGAA CCGGTGGCAA AAATAGCCGG TCACAGGGTT
CTTGCGATAA ACAGGGGAGA AAAGGAAGAG TTTTTGCAAG TCAAAATTGA AGTTCCTGAA
GAAACGCTTA TGGAGCAGCT TAAGGCGAAA CTTGTAAAGA GGCCTCCTTC CATAACGTCG
GAATATGTAG AAAAGGCGTT GGCGGACTCT TATGAGCGCC TTATTTTTCC TTCGGTTGAG
AGGGAAGTAA GGAATGAACT TACGGAGAAT GCCGAGGAAC AGGCGATAAA GGTCTTTGCG
ACCAATCTTA AAAATCTTTT GCTCCAGCCT CCTGTGAAAG GAAAAACCGT TTTGGGGCTT
GACCCTGCAT ACAGGACGGG CTGCAAAATT GCAGTGGTGG ATGAGACGGG AAAAGTACTT
GACACTGCCG TAATATATCC GACACCTCCC CAGAACAAGG TTGAGGAAGC AAAAGAGATT
ATGAAGCGGC TTATTGAGAA ACACGGTGTT GATATAATAT CAATAGGCAA CGGGACTGCT
TCGAGGGAGT CTGAAATATT TGTCGCCGAG CTTTTGAAGG AGATAGACAG AAAAGTTTAC
TATATGGTGG TAAGCGAAGC GGGAGCTTCG GTTTATTCCG CTTCGAAGCT TGGGGCGGAG
GAATTTCCCG ACTTTGACGT GGCTTTAAGA AGTGCTGTGT CCATAGCCAG AAGGCTTCAG
GACCCATTGG CGGAGCTGGT TAAAATAGAT CCCAAATCCA TAGGCGTGGG CCAGTACCAG
CACGACATGA ATCAAAAGCG GCTGAGTGAG ACTTTGCAGG GCGTGGTTGA AGATTGTGTA
AACAGCGTGG GCGTTGACCT GAATACGGCC TCACCGTCTC TTTTGTCTTA CATCTCGGGA
ATAAACTCCG TAATTGCAAA AAATATTGTG GAATACAGGG AAACCAACGG AAAGTTTAAA
AGAAGAGAAG AACTCAAAAA AGTTAAGAAA CTAGGTGACA AAACTTTCGA GCAATGTGCC
GGCTTTCTTA GGATACCTGA CGGAGACAAT GTTCTTGACA ATACTTCCGT ACATCCGGAG
TCTTATGAGG CGGCCAAAAA GCTTCTTGAT ATTATGGGAT ACAGCCTTGA AGATGTGAAG
AACAGAAAAC TTGATGGACT TGTGGAAAAA GTGGAGAAAA TGGGTATGGA AAAAGTTGCC
AGGGAGATTG GTGTCGGAGT GCCGACTTTG AAAGATATTA TAAAAGAGCT TTTAAAGCCT
GGACGCGACC CCAGGGATGA GCTTCCGAAA CCGATGCTTC TTACCGACGT GCTGCATTTG
GAGGATTTGA GGCCGGGCAT GATATTGACC GGAACCGTAA GGAATGTTGC CGACTTTGGT
GCCTTTGTGG ATGTGGGAGT GCACCAGGAC GGGCTGGTTC ACATATCCGA GCTTAGCGAC
AAGTATGTAA AAAGTCCCAT GGATGTGGTG TCGGTGGGGG ATATAGTGAA GGTCAGAATT
TTGGATGTTG ATGTTGAAAG AAAAAGAATA TCCATGAGCA TGAAGGGTGT CAATTAA
 
Protein sequence
MSDIISTLVK EFNLKPFQVE NTVKLIDSGN TIPFIARYRK EITGELNDQV LRQLHERLIY 
LRNLEARKEE VRRLIDEQGK LTAEITASLE KATTLREVED IYRPFRPKRR TRATVAKEKG
LEPLAEIIMA QELKTGSIED IAKPFINPEK EVNTVEDALN GAMDIIAEDI SDNPHIRSIV
RDVFMKQGMI VSKKKKDEDS VYRMYYDFSE PVAKIAGHRV LAINRGEKEE FLQVKIEVPE
ETLMEQLKAK LVKRPPSITS EYVEKALADS YERLIFPSVE REVRNELTEN AEEQAIKVFA
TNLKNLLLQP PVKGKTVLGL DPAYRTGCKI AVVDETGKVL DTAVIYPTPP QNKVEEAKEI
MKRLIEKHGV DIISIGNGTA SRESEIFVAE LLKEIDRKVY YMVVSEAGAS VYSASKLGAE
EFPDFDVALR SAVSIARRLQ DPLAELVKID PKSIGVGQYQ HDMNQKRLSE TLQGVVEDCV
NSVGVDLNTA SPSLLSYISG INSVIAKNIV EYRETNGKFK RREELKKVKK LGDKTFEQCA
GFLRIPDGDN VLDNTSVHPE SYEAAKKLLD IMGYSLEDVK NRKLDGLVEK VEKMGMEKVA
REIGVGVPTL KDIIKELLKP GRDPRDELPK PMLLTDVLHL EDLRPGMILT GTVRNVADFG
AFVDVGVHQD GLVHISELSD KYVKSPMDVV SVGDIVKVRI LDVDVERKRI SMSMKGVN