Gene Cthe_3212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3212 
Symbol 
ID4809514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3804783 
End bp3806114 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content34% 
IMG OID640108646 
Producthypothetical protein 
Protein accessionYP_001039600 
Protein GI125975690 
COG category[L] Replication, recombination and repair 
COG ID[COG1604] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTATA AAAACAGATT GCTTAAAACA ATAAGATACA ATGTTAAAGC AACAACGATT 
TCACCTTTGG CAATTAGAGA TGACGAAGAT AATCTAAAAA TAGATCAATT GACAGGCAAA
GTTTATATAC CCGGTTCATC TGTTGCCGGT GCTTTTAGAA ACTATTATGA AAATTATATT
GATAAGAATT CCAATGAGAA TTTTAATGAA TTGTTTGGCG GCCAAAAAAC AGGAATGAGT
CAGATTGTTT TTTATGACGG TTATCTTGTT AATGATTATG TGAAAGAGAT GATTTCGTCA
AGACCGGGGA TAAAGATGGA TCTTAAAAGA ATGACAGTTG AAGTTTCCGA TGATGCAAAA
AAATCCGGTA AGAAATTTAA AAGGCGTTTT TTAAATGAAG GATTAACTTT TGAGTTTGTT
TTTGAGCTGA ACAATTATGA AGATGATGCC GGAAAATTTG AAGAAAAGCA AAGAAAATTC
GAAGAGTTGC TAAAAGCTTT CTCAATAGGG GATATTTCAT TAGGAAGTAA CAAAATGATT
GGCTATGGGA GATTTAGAGT GGACTCAATT TCAAAAAGCG TGTTTGACTT TACAAATATT
AATGATTTGT TGAAGTATAT GTTGATGGAG ACTGATAGCA CTGAGATAAC TCAAGATATA
TTAGGCAGGG AGCAAGAGAC TTCAAAAGTT CGTTTTAAAA TAAAAGGAAA AACTGTTACC
CCTCTATTGG TGAAAGACGA AACAGTTCGT TTATCAAACG AGTCGGACGG CATTAACATA
AAAGACAGTA GAGGCAACTA TATTATTCCC GGAAGTTCAA TTAAAGGTGT TATAAGGTCA
CGGGCGGAGA GACTGCACAG AACCTTTCCC TGCATTGGTG AGGAAATTTT AACAAATATT
TTTGGTATAG AATCAAAAAA GGATGATGAT GGACATATTT CAAGACTGAG TTGCTTTGAT
GCGGTAGTTA AAAATCCCAA CAAAGGCATA TACAACAAGA TAAAGATTGA CTATTTTACA
GGAGGAGTCA TGCAAGGAGC ATTGATGAAT GATGAGGTTG TAATGGGAGA TGTTGAGATA
GAGTGTACCT TTAATACATC AGGATTAAAT GATTACAAAA GAGAGATTGG GCTTTTGCTT
TTGGTGTTAA GGGATTTGTG CAAAGAAGAT TTAAGTATAG GAAGCGGTTA TGCTGTAGGA
AGAGGATATA TCAAGGCAGA AACTTTGGAA TTGTATGACG GTGAAAAATT AGTTCTTGAT
TTTAAGTCAC CAAATAGAGA GGTATTGAAA AGATTTGATT CCTATATATC GAGCTTGATG
AATGTGGGGT GA
 
Protein sequence
MIYKNRLLKT IRYNVKATTI SPLAIRDDED NLKIDQLTGK VYIPGSSVAG AFRNYYENYI 
DKNSNENFNE LFGGQKTGMS QIVFYDGYLV NDYVKEMISS RPGIKMDLKR MTVEVSDDAK
KSGKKFKRRF LNEGLTFEFV FELNNYEDDA GKFEEKQRKF EELLKAFSIG DISLGSNKMI
GYGRFRVDSI SKSVFDFTNI NDLLKYMLME TDSTEITQDI LGREQETSKV RFKIKGKTVT
PLLVKDETVR LSNESDGINI KDSRGNYIIP GSSIKGVIRS RAERLHRTFP CIGEEILTNI
FGIESKKDDD GHISRLSCFD AVVKNPNKGI YNKIKIDYFT GGVMQGALMN DEVVMGDVEI
ECTFNTSGLN DYKREIGLLL LVLRDLCKED LSIGSGYAVG RGYIKAETLE LYDGEKLVLD
FKSPNREVLK RFDSYISSLM NVG