Gene Cthe_2868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2868 
Symbol 
ID4809148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3387793 
End bp3389049 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content45% 
IMG OID640108287 
Producttransposase IS116/IS110/IS902 
Protein accessionYP_001039259 
Protein GI125975349 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.666774 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTGGAGA CATTGGAAAA GGAGAGGATA GATATTTATC ATAAACGAAA CACCTGCTTT 
GTTGGAATTG ATATGCACAA GGACGCACAT TGTGCAGTTG TAATTGATTG TTGGATGAAT
AAACTGGGTG AGGTTAACTT TGAAAACAGG CCATCCAGAT TCCCTGCATT CGTTGAGGAT
GTAAGGAAGA TTTGCGGCAC AAAGGGAATT GTATTCGGAC TTGAAGATAC CAGAGGCTTT
GGCAGAAACC TTGCTGCCTA TCTGGTCGGC AGGAAGTTTG AAGTCAAGCA CGTTAACCCT
GCCTATACAA GCGCTGTAAG GCTTGCAAAC CCCATTATTT ACAAGGATGA CTCCTATGAT
GCCTATTGTG TGGCAAGGGT GCTCAGGGAT ATGGTGGACA CTTTGCAGGA TGCCAAGCAT
GAGGATATAT TCTGGACAAT ACGGCAAATG GTGAAAAGAC GGGATTTGAT TGTAAAGAGC
AATGTGATGA ACAAGAACCA GCTCCACAGC CAGCTTGCTT ATAGCTACCC ATCCTACAGG
AAATTCTTTG GCATGATTGA TTCCAAGAGT GCCTTATGCT TCTGGGAGAA CTACCCGTCA
CCGGAGTATA TATGGAAAAC AACACCGGAA GAAATATATC AGACGATAAA GCCTGTGCAT
CAGGCGCTTA AAATACAGCG CATCCATGAG ATTATATCCA TGATTGAAAG GGATGGAGAC
ACAAGAAAGG ACTATCAGCC CGAAAGGGAT TTTATTGTCA GAAACATTGT AAAGGATATC
AGGCACAACA AGGAGTTGAT TGCCGAAATT GACGATGAAC TAAGAAAGCT GATACCTTTG
ACAGGCTATA AGCTACATAC AATGCCGGGA ATCGACCTTG TTACAGAAGC ACAGATAATA
TCTGAAATCG GAGATATTAA CCGCTTCCCA GACTCAGACA AGCTGGCTCG GTTTATGGGC
TTGGCACCGG TGCAATTCAG CTCTGCCGGA AAGGGTAAAG ACCAAAGATG CAGGAATGGC
AACAGGGCAC TAAATGCGAT ATTTCACTTT CTCGCAATCC AGATGGTAGC AGTATCGGCC
TCAGGAAAGC CAAGACACCC GGTATTCAGG GAGTATTTTG AGCAGAAGGT TAAAGAGGGC
AAGAACAAGC CACAGGCGCT TGTGTGCGTG GCAAGGCGGC TTGTGAGGAT TATTTACGGC
ATGATGAAAA CCAGGACAGA ATACAGGCCA TTTGAGAAGG CTGACGACAA GAACTGA
 
Protein sequence
MVETLEKERI DIYHKRNTCF VGIDMHKDAH CAVVIDCWMN KLGEVNFENR PSRFPAFVED 
VRKICGTKGI VFGLEDTRGF GRNLAAYLVG RKFEVKHVNP AYTSAVRLAN PIIYKDDSYD
AYCVARVLRD MVDTLQDAKH EDIFWTIRQM VKRRDLIVKS NVMNKNQLHS QLAYSYPSYR
KFFGMIDSKS ALCFWENYPS PEYIWKTTPE EIYQTIKPVH QALKIQRIHE IISMIERDGD
TRKDYQPERD FIVRNIVKDI RHNKELIAEI DDELRKLIPL TGYKLHTMPG IDLVTEAQII
SEIGDINRFP DSDKLARFMG LAPVQFSSAG KGKDQRCRNG NRALNAIFHF LAIQMVAVSA
SGKPRHPVFR EYFEQKVKEG KNKPQALVCV ARRLVRIIYG MMKTRTEYRP FEKADDKN