Gene Cthe_2850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2850 
Symbol 
ID4809130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3367449 
End bp3368705 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content44% 
IMG OID640108270 
Producttransposase IS116/IS110/IS902 
Protein accessionYP_001039242 
Protein GI125975332 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.516093 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTGGAGA CATTGGAAAA GGAAAGGATA GATATTTATC ATAAACGAAA CACCTGCTTT 
GTTGGAATTG ATATGCACAA GGACGCACAT TGTGCAGTTG TAATTGATTG TTGGATGAAT
AAACTGGGTG AGGTTAACTT TGAAAACAGG CCATCCAGAT TCCCTGCATT CGTTGAGGAT
GTAAGGAAGA TTTGCGGGAT AAAGGAAATT GTATTCGGAC TTGAAGATAC CAGAGGCTTT
GGCAGAAACC TTGCTGCCTA TCTGGTGGGC AGGAAGTTTG AAGTAAAGCA CGTAAACCCT
GCATATACAA GCGCTGTAAG GCTTGCAAAC CCCATTATTT ACAAGGATGA CTCCTATGAT
GCCTATTGTG TGGCAAGGGT GCTCAGGGAT ATGGTGGACA CTCTGCAGGA TGCCAAGCAT
GAGGATATAT TCTGGACAAT ACGGCAAATG GTGAAAAGAC GGGATTTGAT TGTAAAAAGT
AATGTGATGA ACAAGAACCA GCTCCACAGC CAGCTTGCTT ATAGCTACCC ATCCTACAGG
AAATTCTTTG CCATGATTGA TTCCAAGAGT GCCTTATGCT TCTGGGAGAA CTACCCGTCA
CCGGAGTATA TATGGAAAAC AACACCAGAA GAAATATATC AGACGATAAA GCCTGTGCAT
CAGGCGCTTA AAATACAGCG CATCCATGAG ATTATATCCA TGATTGAAAG GGATGGAGAC
ACAAGAAAGG ACTATCAGCC CGAAAGGGAT TTTATTGTGA GAAACATCGT GAAGGATATC
AGACACAACA AGGAGTTGAT TGCCGAAATT GACGATGAAC TAAGAAAGCT GATACCTTTG
ACAGGCTATA AGCTACATAC AATGCCGGGA ATCGACCTTG TTACAGAAGC ACAGATAATA
TCTGAAATCG GAGATATTAA CCGTTTCCCT GACTCAGACA AGCTGGCTCG GTTTATGGGC
TTGGCACCGG TGCAATTCAG CTCTGCCGGA AAGGGTAAAG ACCAAAGATG CAGGAATGGC
AACAGGGCAC TAAATGCGAT ATTTCACTTT CTTGCAATCC AGATGGTAGC AGTATCGGCC
TCAGGAAAGC CAAGACACCC GGTATTCAGG GAGTATTTTG AGCAGAAGGT CAAAGAGGGC
AAGAACAAGC CACAGGCGCT TGTATGCGTG GCAAGGCGGC TTGTGAGGAT AATCTACGGT
ATGATGAAAA CCAAGACTGA ATACAGGCCA TATGAGAAGA CTGACGACAA GAACTGA
 
Protein sequence
MVETLEKERI DIYHKRNTCF VGIDMHKDAH CAVVIDCWMN KLGEVNFENR PSRFPAFVED 
VRKICGIKEI VFGLEDTRGF GRNLAAYLVG RKFEVKHVNP AYTSAVRLAN PIIYKDDSYD
AYCVARVLRD MVDTLQDAKH EDIFWTIRQM VKRRDLIVKS NVMNKNQLHS QLAYSYPSYR
KFFAMIDSKS ALCFWENYPS PEYIWKTTPE EIYQTIKPVH QALKIQRIHE IISMIERDGD
TRKDYQPERD FIVRNIVKDI RHNKELIAEI DDELRKLIPL TGYKLHTMPG IDLVTEAQII
SEIGDINRFP DSDKLARFMG LAPVQFSSAG KGKDQRCRNG NRALNAIFHF LAIQMVAVSA
SGKPRHPVFR EYFEQKVKEG KNKPQALVCV ARRLVRIIYG MMKTKTEYRP YEKTDDKN