Gene Cthe_2412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2412 
Symbol 
ID4808127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2879144 
End bp2881795 
Gene Length2652 bp 
Protein Length883 aa 
Translation table11 
GC content41% 
IMG OID640107825 
ProductSMC protein-like protein 
Protein accessionYP_001038807 
Protein GI125974897 
COG category[S] Function unknown 
COG ID[COG4717] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00784167 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATTG ACAAGCTTGA TATAAGGGGA TTTGGAAAGA TTCACAATTT AATAATTGAA 
TTTTCAAAAG GATTTAATTT GGTTTACGGT GAAAACGAGG CCGGGAAAAC AACGGTTCAA
TGGTTTATCC GCGGCATGCT GTATTCTCTG AAAGGAGGAA AAAACACAAA AGGCGGCGCG
ATACCTCCTT TGAAAAAATA CAGTCCATGG AAAGGAGATT TCTACGGAGG AAGCATTGTT
TATACTCTTG ACAGCGGAGC TTCTTTTACC GTGGAAAGAG ATTTTAACAA TAATACGGTA
AAAATCTTTG ATTCTTTTTT CAATGACATA AGTGACAGCT TCAAAAAAAG CAAGGAAAAA
GGGCCTTTGT TTGCCGTGGA GCATCTTGGT ATAAATGAGG CTTGCTTTGA AAGAACCGTG
TTTATTGGCC AGATGGATAC GAAAGTTGAC GCCTCGGGAG GCAGGGAGCT TGTGGACAAG
CTTGCCAATA TCAGAGAGAC AGGCTCGGAG GAAGTTTCCC TTAAAAAGGC AAGGGAGGCT
CTTAAAGATT CTCTTATAAA CTATGTCGGA ACCGACAGAA GCACGACACG GCCCCTGGAT
ATGGTGAACT TAAAGCTGGC CGAACTTGAG GGAAAGAAAA AGGAACTTTT TAAGGAAAAG
GAAAAAATAT TTGAGGCGGA AGAAAAGCTA AGAAAGCTTT CCGAACAGAA GAATCGTTAT
GAGAAAAAAA GAGAAGTTTT TAACCTTGCA AGAAAGGTCA TTGAACTTAG GAAAAACGTT
GAAGAAGTAA AGAAGAAGAA AAGGGAACTT GTCTTGATTA TAAAAGAGGC GGAAAAATAT
GAGCAGGAAA GAGAAACTTT AAGTCAGCAG ACGGAGCTTT GCAACGGAGT TAAAAAACAG
TATGAAGCCT ATTCGAAATA CGCAAAAGAC GACCCGGGAC TTATCAATAT TTTATACAAT
AAGCTGGAAG ATGCATTAAA AGAAAAGGAG CGGCTTTGTA AAAAGCAAGA GACATTAATG
CAGGAGATTG GGGAAATTGA AAGGTCGCTC GAAGAATATA AAGCTTTTCG CAGCTTTGAA
GAAGACGTGG ACGGCAGGGT GCAAAATCTT TCCGGGAGTA TAAAAGAACT TGAACAGAAA
AAAAGGGATG TAAACGTTAC AGCATTGGAC GAAAGTGTGA AAGCTGCTTC TTACAAATTG
GGTTTCATTA AAGTTGGCAT TGGTATCTTG GCAATTTTAA CTCTTTTGTC CGGAATCTGT
ACATCGTTTT TCCGGCAGAA AGCTGTTTTT GCGGTGCTGA CTTTTGTCTT TGCACTGTTG
ACACTGGTAT TTGTTTATAT GGGAAAAGCC GTAAGGGACA ATCTTCAAAG ACTTGATCAC
AATAGGAATA TTCTTCTTTC AGAAATGCGG GACATAGACA GGGAGCTTTC CGCAAAACAA
GAGGAAATCC GGCGGATTTT TTCCGTGGCG GGTGTGGAAA ATGAGGGTGA GTTTATTAAG
AAGAAAACCC TGTACGAAAA CAAGGTTCTC CGCCTTGCCG AATTAAACGG CAGCATGGAT
GAGCTTGAAA GGGAAATGGA TGAGAACCGG ATATATATTG AAAAAATTAA GACTTTAATG
CTTGACAGAC TTGGAACGTG CGGTATAATT GCTTTGGAAG AAAATGAAAT AAAATCAGAG
CATGTAAAAA CTTTTAGAGA AGGCCTTGCA AAATACCTGG AAGCAATTGA AAACTTAAAA
AGGCTCAATG AAAAGCGGGA GGATGCTGCC AAATACCTGC AATCTCTTTA TGACAGGGCG
TCTTCATTGT TTGGTGAGAG CTTTGCCAAA AAAGAAGATT TGTTGAGAAG CCTTGACGGG
ATGGATCTAA AAATAAATGA GCTCTACGAA AAAATTGAGA AATATTCGAT GGAAATTCAG
AATTCTTACG GCTTTACGGA TAATTCTCCG GAATATCATG AGCTGATGGA GAAAATTTAT
GACGCTGAAT TTCAAAGTGC TGAAAGTTAT ATAGAAAATT TACTTTCAGA GCTAAATGGC
AGGATTGACG AGATTGTGCT TGAGATGAGC AGGGACTGGG CTTTGGTGGA AAGAGGCTGT
TTAATTGAAA ATGAAATTCA GGAACTTGAA GTAAAGACGG CAGAACTTGA AAGGGAGAAA
GAACGTCTTC TGGATATTGG CAAAAGTTTA AAGACTGCGC TGGATGTCCT TGAGGAGGCG
GCGCTTGAGA TAAAAAGGGA ATTTGCACCT TTGCTGAATC AAAAGCTTGG CAGCATAGCA
GGTTTTATAA CGCAAGGCAA ATACAGTGAG GTAAGAGCCG ATGACAGTTT CATGATAAGA
GCGTTGGAGC CGGGTACCCG GCGCATTGTG GAGCTGCCTT TTTTAAGCGG CGGCACCGTT
GAACAGCTGT ACCTTGCATT AAGAATTGCC CTTGCAGAGA CTGTTGAAGA CGGCGGCGAA
GTTTTACCCC TCATTATGGA CGAGGTGTTT GCGCATTATG ATGACACAAG GGTGTTTAGT
ACTTTGAAGA TGCTTTTTGA GCTGTCGAAA GAGCGCCAGA TTATATTTTT TACATGCAAG
GACAGAGAGA TGGAGGCAGC CACAGAGGTT TTCGGCAAAG ATTTAAATGT TATAAAACTG
GGCACTTGTT GA
 
Protein sequence
MRIDKLDIRG FGKIHNLIIE FSKGFNLVYG ENEAGKTTVQ WFIRGMLYSL KGGKNTKGGA 
IPPLKKYSPW KGDFYGGSIV YTLDSGASFT VERDFNNNTV KIFDSFFNDI SDSFKKSKEK
GPLFAVEHLG INEACFERTV FIGQMDTKVD ASGGRELVDK LANIRETGSE EVSLKKAREA
LKDSLINYVG TDRSTTRPLD MVNLKLAELE GKKKELFKEK EKIFEAEEKL RKLSEQKNRY
EKKREVFNLA RKVIELRKNV EEVKKKKREL VLIIKEAEKY EQERETLSQQ TELCNGVKKQ
YEAYSKYAKD DPGLINILYN KLEDALKEKE RLCKKQETLM QEIGEIERSL EEYKAFRSFE
EDVDGRVQNL SGSIKELEQK KRDVNVTALD ESVKAASYKL GFIKVGIGIL AILTLLSGIC
TSFFRQKAVF AVLTFVFALL TLVFVYMGKA VRDNLQRLDH NRNILLSEMR DIDRELSAKQ
EEIRRIFSVA GVENEGEFIK KKTLYENKVL RLAELNGSMD ELEREMDENR IYIEKIKTLM
LDRLGTCGII ALEENEIKSE HVKTFREGLA KYLEAIENLK RLNEKREDAA KYLQSLYDRA
SSLFGESFAK KEDLLRSLDG MDLKINELYE KIEKYSMEIQ NSYGFTDNSP EYHELMEKIY
DAEFQSAESY IENLLSELNG RIDEIVLEMS RDWALVERGC LIENEIQELE VKTAELEREK
ERLLDIGKSL KTALDVLEEA ALEIKREFAP LLNQKLGSIA GFITQGKYSE VRADDSFMIR
ALEPGTRRIV ELPFLSGGTV EQLYLALRIA LAETVEDGGE VLPLIMDEVF AHYDDTRVFS
TLKMLFELSK ERQIIFFTCK DREMEAATEV FGKDLNVIKL GTC