Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1230 |
Symbol | |
ID | 4809922 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1472107 |
End bp | 1472997 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106653 |
Product | helix-hairpin-helix repeat-containing competence protein ComEA |
Protein accession | YP_001037655 |
Protein GI | 125973745 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1555] DNA uptake protein and related DNA-binding proteins |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region [TIGR01259] comEA protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.890666 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCTGAGGG ATTTTTTTAA TCAGGAAGTA AGTATGAAAA AAGGAATAGT TGGGCTCATG ATATTGGGAT TGATTGTAAC TACTTCGGTT ACGGGATTTC TTCTTGCAAA TGACGGTGAG GATATAATTA TAAGCAAGGC GAAAGCCGGT CAATATACCG TGGAAGCGGA AAACGGTGAG GAAAAGACAA CGGAAAAATT GGTTCAGGAG AAAGAAGAAG CTGCAGATGA AATAAAGGTG TATGTTGTCG GTGAGGTTAA CAAACCCGGT GTGGTTACAC TGAAAAAAGG TCAGATAATA CAAGATGCCA TTGAACTTGC CGGAGGTCCC ACAGAGGATG CGGATATTGA GAATATAAAT TTGGCTTATG AGCTCCGGGA GAATGTTATG ATAAGGGTAA TGTCCAAAAG TGAGACTACA GGGCAGGATA TTGGTGAAGA AGGCGATATG CAGGTTGCTG CGGGCAATAC TGAAAATAAA AGTACGGCTG CAGGAAGCAA TTCAACCAAG AATAGTCAGT CGAAGAATGT TTCCGGCGGA ACAAGCAAAA ACAGTTCAGG CTCAAATGAT GCAAAAAGCA ATTCGGGAAA AAGCACCAAC AATGGGGGTG TTTCAGGAAT AGCCGTTACA AAGGACAGCG GCGGGGCGGT AGTCGGAGAA AATGCAAGTA GTAGTGAAAA CAGCAAGACT GCAAATTCAA AAATCAATAT AAATACTGCA ACTGTGGAGG AACTTGATTC TCTCCCGGGA ATCGGGCCGG CCATTGCGGC CAAAATAGTG GCTTATCGCG AGCAGAACGG CAAATTTAAA TCAATAGAAG ACATAATGAA TGTCTCAGGA ATAGGCCAGA GTAAATTCAA CAATATCAAG GACTTTATTA CGGTAAACTG A
|
Protein sequence | MLRDFFNQEV SMKKGIVGLM ILGLIVTTSV TGFLLANDGE DIIISKAKAG QYTVEAENGE EKTTEKLVQE KEEAADEIKV YVVGEVNKPG VVTLKKGQII QDAIELAGGP TEDADIENIN LAYELRENVM IRVMSKSETT GQDIGEEGDM QVAAGNTENK STAAGSNSTK NSQSKNVSGG TSKNSSGSND AKSNSGKSTN NGGVSGIAVT KDSGGAVVGE NASSSENSKT ANSKININTA TVEELDSLPG IGPAIAAKIV AYREQNGKFK SIEDIMNVSG IGQSKFNNIK DFITVN
|
| |