Gene Cthe_1958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1958 
SymboluvsE 
ID4810741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2333189 
End bp2334154 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content40% 
IMG OID640107374 
Productputative UV damage endonuclease 
Protein accessionYP_001038369 
Protein GI125974459 
COG category[L] Replication, recombination and repair 
COG ID[COG4294] UV damage repair endonuclease 
TIGRFAM ID[TIGR00629] UV damage endonuclease UvdE 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGTAA GACTTGGCTA TGTTGCAATG ACTATGAACT TAAAAGACAG TTCGCCTTCC 
GGGACCGTAA CCGTAAAAAC CCTTGCAGGA TTGAACGAGG ATGAAACAAG GCTTAACAAA
TTACGGAAGG TTACCGCGGC AAATCTGCAC AACACACTAA GGATACTAAG GTATAACAAA
GCATACAACA TCAGTGTGTA CCGTTTTACA TCAAAACTTG TGCCTTTGGC CACACATCCC
ATGGTATCAC ACTGGGACTA TGTCGGGGAG TTTAGAAATG AGTTTAAGAG TCTCGGTGAC
TTTGTAAAGG AAAACAATTT CAGGGTAAGC GCCCATCCGG ACCACTACAC ATTGCTCAAC
TCACCTTCAA AAGAGGTTTT TGAAGCTTCG GTGCGGGACC TTGATTATCA CGTAAAACTG
TTTGAAGCCA TGGGTCTTGA AGATTACAAA TACAAGCTGG TAATGCACAT AGGCGGGCTT
TACAAAAACA AACAAACGTC TGTTGAAAGG TTTAAAGAAA ACTATGCCAT CCTCCCTGAC
AGAATCAGAA AAAGGCTGAT CTTTGAGAAT GACGACAAGA TATATACTGC CAGGGATGTG
CTTGACATCT GCAAGGAGCT GAAGGTTCCA ATGGTGCTGG ATGTGCATCA TCACAACTGT
GTAAACAATG GAGAAAGACT TGAAGATATG CTTAAGGAAA TATTTGACAC ATGGAAAGAT
GAGTATTTTC CGCCTAAAAT ACACTTTTCA ACGCCAAAAA GCAAAGAAAA CTTCAGAAGC
CATGCCGATG AAATAGATGT AAATGAGTTT TACCGGTTTT TACAGATTGC AAAAAAGTTA
AAAAAGGATT TTGATGTTAT GATTGAGGCA AAGAACAAGG ATAATGCATT ATTTAAACTT
TCCAGGGAAT TGAAAACCTT TGATGACATA AGATGGATAG ATGAAGGACA TTTTGAAATC
CGGTAA
 
Protein sequence
MLVRLGYVAM TMNLKDSSPS GTVTVKTLAG LNEDETRLNK LRKVTAANLH NTLRILRYNK 
AYNISVYRFT SKLVPLATHP MVSHWDYVGE FRNEFKSLGD FVKENNFRVS AHPDHYTLLN
SPSKEVFEAS VRDLDYHVKL FEAMGLEDYK YKLVMHIGGL YKNKQTSVER FKENYAILPD
RIRKRLIFEN DDKIYTARDV LDICKELKVP MVLDVHHHNC VNNGERLEDM LKEIFDTWKD
EYFPPKIHFS TPKSKENFRS HADEIDVNEF YRFLQIAKKL KKDFDVMIEA KNKDNALFKL
SRELKTFDDI RWIDEGHFEI R