Gene Cthe_1294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1294 
Symbol 
ID4809546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1571358 
End bp1572785 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content39% 
IMG OID640106717 
Productrecombinase 
Protein accessionYP_001037719 
Protein GI125973809 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAAAA ACGGACTGGG TGAAATTAAA AGCATTTATA TTGACGATAA TGTTTCCGGA 
TCCGGCTTTG AGAGAAGGGG TATCTACGAG CTTAAACGTG ATGTGATGGA AGGAAAAATA
AATCTTCTGG TTCTGAAAGA TTTGTCCAGG CTTGGGAGAA ACAATGCAAA AACTCTTTTA
TTTCTTGATT TTTTAGAGGA GAACGGAGTA AGGGTCGTTA CCTTCGACGG AAGGTATGAC
AGCCTGAAAG ACAACGATAT TGTAGGTATC GAGACGTGGT TCAACGAGCG CTACATCCTG
GATATTTCAA GAAAAATCCG GGCCAATTTA AGATTCAAGA TACAAAAAGG GGAATATATA
GGGCATGCTC CCTTTGGATA TGAAAAGTCG CCTTATGAGA AGAACCGGCT GATTGTAAAC
GAGGAAGAGG CGGGTATTGT AAGAAGAATA TACAGCCTTT ACAAGGAAGG ATATGGATAC
AGCTATATCG CAGGCATATT GAACAGCGAA GGTGTAAAAT CGCCTTCCAA TGGCCGCTGG
AATCCCACAG CCATAAGGAG GATTCTTTTA AACAGGGTTT ATACCGGGGA TACGGTACAG
GGAGTAAGTG AGAAAATCAG TTTTAAAAGC AAGAAAACCA GACGTCTGCC AAAGGAACGA
TGGGTGATTA CTGAAAATAC CCATGAACCC ATTGTTTCCA AAGAGGAATA TGAAGAAATT
CAAAGAATAA GGAATAATAA AAATGCAAGA CCGGTTCCCC ACAAAGGGAT TATACACGTG
TTTCGCGGCA GCATATTTTG TGGGGGCTGC GGAAGCGTTA TGTTTGCAAG AAAGCGGCAT
AACAGGCCTA TGAGCTATAT TTGCAGCAGT TATGCCAAAG AAGGAAGGAC GGCGTGTACA
AGCCACAGTA TTCGAGAAAA GGATTTGTGT GAGGTTGTTT TGGATGATGT AGCAAAGCTT
TTGGATGATG AAAACATGGT AAACAAAATA CTGCAAAAGA TTGACTTGGC CGGAGCGGCG
GAGGATTATC AAACTTTGAG GGAAAAACTT TTAAAACAGA TGGAAGCAAA ACAAAAGCAG
CAGGAGATTC TTTATCAGGA TAGACTTGAA AACAAGATAT CCGAAAGTCT TTTTTTAAGG
ATGAATAACA GGCTTGAAAA CAGAATTTCT GAAATAAAAC GGGAAATTGC CCAATTGGAT
TTACGAAAGT CAGAGTTTGT GACTTCAGAA GAAAAGATTG CGAAACTGAA GAATTATATT
ACAAATAATG GAATTACCAA TGAAATAGTT AAAATTGTTA TAAACAGGAT AATAGTTTTT
GACAAAGGCG ATAATTATTT GGAAGAAAAG TGGAATCTGA ATCTTTCGCA AAAGGAAAAA
AGGTATATCG AATTATATGG TGCGGTCTTA ATTGAATACG GTTTTTAG
 
Protein sequence
MKKNGLGEIK SIYIDDNVSG SGFERRGIYE LKRDVMEGKI NLLVLKDLSR LGRNNAKTLL 
FLDFLEENGV RVVTFDGRYD SLKDNDIVGI ETWFNERYIL DISRKIRANL RFKIQKGEYI
GHAPFGYEKS PYEKNRLIVN EEEAGIVRRI YSLYKEGYGY SYIAGILNSE GVKSPSNGRW
NPTAIRRILL NRVYTGDTVQ GVSEKISFKS KKTRRLPKER WVITENTHEP IVSKEEYEEI
QRIRNNKNAR PVPHKGIIHV FRGSIFCGGC GSVMFARKRH NRPMSYICSS YAKEGRTACT
SHSIREKDLC EVVLDDVAKL LDDENMVNKI LQKIDLAGAA EDYQTLREKL LKQMEAKQKQ
QEILYQDRLE NKISESLFLR MNNRLENRIS EIKREIAQLD LRKSEFVTSE EKIAKLKNYI
TNNGITNEIV KIVINRIIVF DKGDNYLEEK WNLNLSQKEK RYIELYGAVL IEYGF