Gene Cthe_1700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1700 
Symbol 
ID4808875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2022140 
End bp2023468 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content34% 
IMG OID640107113 
Productrecombinase 
Protein accessionYP_001038114 
Protein GI125974204 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000219867 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGCTT ATTGCAGAGT ATCAACAGAC CAATTAGAAC AGTTATCAAG CTATGAAGCA 
CAGGTAGCTT ATTATACATC TTATATAACA AATCATCCAG ATTATGAATT CGGAGGAATT
TACGCAGATG AGGGGATTTC AGGGACAAAC ACTAAAAAGA GAGAACAGTT CAATAAAATG
ATAGAAGACT GCAAAGCAGG GAAAATAGAT ATGATAATAA CCAAGTCTAT ATCAAGGTTT
GCAAGAAATA CACTTGATAC ATTAAACTAC GTAAGACAGC TTAAAGAATT AGGTATTGGA
GTAATATTTG AAAAGGAAAA TATTAATACT TTAGATTCAA AGGGAGAAGT ACTGCTTACA
ATCCTCAGCT CCCTTGCCCA AGATGAATCA AGAAGTATAA GTGAAAATTC TACATGGGGC
ATAAGGAGAA GGTTTGAACA GGGAAAGCTT CATATAAATC ATACAAAGTT TTTAGGCTAT
GATAAAGATG AAGAAGGAAA TCTTGTGATA AATGAAAAGC AGGCTAAAAT TGTAAGAAGA
ATATATAAGG ATTACCTTGA CGGTAAAGGT GCAAACAGAA TTGCAAGGGA ACTTGAAGAA
GAAGGTGTTC CTAACTGGAA TGGAAAACCT AAGTGGTATG AAAGCAGTAT AAGAAAGATT
TTAAGTAATG AAAAATATAA AGGAGATGCA CTTCTCCAAA AGACATATAC TGTTGATTTT
CTAACCAAAA AAAGAGCAGT AAACAATGGC GAAGTTCCAA TGTATTATGT AGAAGAAAGC
CATCCTGCAA TTATAGATAA AGAAATTTGG GAAGCGGTAC AGCTAGAGAT GGAGAGAAGA
AGAGCTTTTG CTGAAAAATA TAACATCAGT AAGCTTGATT ATGCCACAGT AGATAATCCC
TTTGCAGGAA GAGTTATCTG CGGACACTGC GGCAGTGCCT TTGGAAGAAA GGTATGGAAT
TCTACTGATG AAAGGCTAAG AAGAGTGGTT TGGAGATGTA ATAAAAAATA TGAAGTAAAG
GGAAAAAAGA GCTGTGAGAA TAAGCATATA GATGACAAGG TTTTATATCA TGCCTTTGTA
AATACATTTA ATGCTATGGT GGAGAATAAG GAATACTTTA TGGAGAAGTG GAAGGAAGGA
CTTAAAAGTG ATAACTTGCT TAAAAGATAT AAGGCAAGGC AGTTTATTGA AATTTTAAAG
GATGCAAAGA TAGTAGAGGA GTTTGATGTT GATATGTATT TTAGAATAAT AGAGAAAATG
ACAGTATTTG ATGGAAAAAA GATAATAGTG AGTTTGCTTG ATAGCACGGA GATTGAAGTT
GCAATTTAA
 
Protein sequence
MGAYCRVSTD QLEQLSSYEA QVAYYTSYIT NHPDYEFGGI YADEGISGTN TKKREQFNKM 
IEDCKAGKID MIITKSISRF ARNTLDTLNY VRQLKELGIG VIFEKENINT LDSKGEVLLT
ILSSLAQDES RSISENSTWG IRRRFEQGKL HINHTKFLGY DKDEEGNLVI NEKQAKIVRR
IYKDYLDGKG ANRIARELEE EGVPNWNGKP KWYESSIRKI LSNEKYKGDA LLQKTYTVDF
LTKKRAVNNG EVPMYYVEES HPAIIDKEIW EAVQLEMERR RAFAEKYNIS KLDYATVDNP
FAGRVICGHC GSAFGRKVWN STDERLRRVV WRCNKKYEVK GKKSCENKHI DDKVLYHAFV
NTFNAMVENK EYFMEKWKEG LKSDNLLKRY KARQFIEILK DAKIVEEFDV DMYFRIIEKM
TVFDGKKIIV SLLDSTEIEV AI