Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3212 |
Symbol | |
ID | 4809514 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3804783 |
End bp | 3806114 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640108646 |
Product | hypothetical protein |
Protein accession | YP_001039600 |
Protein GI | 125975690 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1604] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTATA AAAACAGATT GCTTAAAACA ATAAGATACA ATGTTAAAGC AACAACGATT TCACCTTTGG CAATTAGAGA TGACGAAGAT AATCTAAAAA TAGATCAATT GACAGGCAAA GTTTATATAC CCGGTTCATC TGTTGCCGGT GCTTTTAGAA ACTATTATGA AAATTATATT GATAAGAATT CCAATGAGAA TTTTAATGAA TTGTTTGGCG GCCAAAAAAC AGGAATGAGT CAGATTGTTT TTTATGACGG TTATCTTGTT AATGATTATG TGAAAGAGAT GATTTCGTCA AGACCGGGGA TAAAGATGGA TCTTAAAAGA ATGACAGTTG AAGTTTCCGA TGATGCAAAA AAATCCGGTA AGAAATTTAA AAGGCGTTTT TTAAATGAAG GATTAACTTT TGAGTTTGTT TTTGAGCTGA ACAATTATGA AGATGATGCC GGAAAATTTG AAGAAAAGCA AAGAAAATTC GAAGAGTTGC TAAAAGCTTT CTCAATAGGG GATATTTCAT TAGGAAGTAA CAAAATGATT GGCTATGGGA GATTTAGAGT GGACTCAATT TCAAAAAGCG TGTTTGACTT TACAAATATT AATGATTTGT TGAAGTATAT GTTGATGGAG ACTGATAGCA CTGAGATAAC TCAAGATATA TTAGGCAGGG AGCAAGAGAC TTCAAAAGTT CGTTTTAAAA TAAAAGGAAA AACTGTTACC CCTCTATTGG TGAAAGACGA AACAGTTCGT TTATCAAACG AGTCGGACGG CATTAACATA AAAGACAGTA GAGGCAACTA TATTATTCCC GGAAGTTCAA TTAAAGGTGT TATAAGGTCA CGGGCGGAGA GACTGCACAG AACCTTTCCC TGCATTGGTG AGGAAATTTT AACAAATATT TTTGGTATAG AATCAAAAAA GGATGATGAT GGACATATTT CAAGACTGAG TTGCTTTGAT GCGGTAGTTA AAAATCCCAA CAAAGGCATA TACAACAAGA TAAAGATTGA CTATTTTACA GGAGGAGTCA TGCAAGGAGC ATTGATGAAT GATGAGGTTG TAATGGGAGA TGTTGAGATA GAGTGTACCT TTAATACATC AGGATTAAAT GATTACAAAA GAGAGATTGG GCTTTTGCTT TTGGTGTTAA GGGATTTGTG CAAAGAAGAT TTAAGTATAG GAAGCGGTTA TGCTGTAGGA AGAGGATATA TCAAGGCAGA AACTTTGGAA TTGTATGACG GTGAAAAATT AGTTCTTGAT TTTAAGTCAC CAAATAGAGA GGTATTGAAA AGATTTGATT CCTATATATC GAGCTTGATG AATGTGGGGT GA
|
Protein sequence | MIYKNRLLKT IRYNVKATTI SPLAIRDDED NLKIDQLTGK VYIPGSSVAG AFRNYYENYI DKNSNENFNE LFGGQKTGMS QIVFYDGYLV NDYVKEMISS RPGIKMDLKR MTVEVSDDAK KSGKKFKRRF LNEGLTFEFV FELNNYEDDA GKFEEKQRKF EELLKAFSIG DISLGSNKMI GYGRFRVDSI SKSVFDFTNI NDLLKYMLME TDSTEITQDI LGREQETSKV RFKIKGKTVT PLLVKDETVR LSNESDGINI KDSRGNYIIP GSSIKGVIRS RAERLHRTFP CIGEEILTNI FGIESKKDDD GHISRLSCFD AVVKNPNKGI YNKIKIDYFT GGVMQGALMN DEVVMGDVEI ECTFNTSGLN DYKREIGLLL LVLRDLCKED LSIGSGYAVG RGYIKAETLE LYDGEKLVLD FKSPNREVLK RFDSYISSLM NVG
|
| |