Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2412 |
Symbol | |
ID | 4808127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2879144 |
End bp | 2881795 |
Gene Length | 2652 bp |
Protein Length | 883 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640107825 |
Product | SMC protein-like protein |
Protein accession | YP_001038807 |
Protein GI | 125974897 |
COG category | [S] Function unknown |
COG ID | [COG4717] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00784167 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATTG ACAAGCTTGA TATAAGGGGA TTTGGAAAGA TTCACAATTT AATAATTGAA TTTTCAAAAG GATTTAATTT GGTTTACGGT GAAAACGAGG CCGGGAAAAC AACGGTTCAA TGGTTTATCC GCGGCATGCT GTATTCTCTG AAAGGAGGAA AAAACACAAA AGGCGGCGCG ATACCTCCTT TGAAAAAATA CAGTCCATGG AAAGGAGATT TCTACGGAGG AAGCATTGTT TATACTCTTG ACAGCGGAGC TTCTTTTACC GTGGAAAGAG ATTTTAACAA TAATACGGTA AAAATCTTTG ATTCTTTTTT CAATGACATA AGTGACAGCT TCAAAAAAAG CAAGGAAAAA GGGCCTTTGT TTGCCGTGGA GCATCTTGGT ATAAATGAGG CTTGCTTTGA AAGAACCGTG TTTATTGGCC AGATGGATAC GAAAGTTGAC GCCTCGGGAG GCAGGGAGCT TGTGGACAAG CTTGCCAATA TCAGAGAGAC AGGCTCGGAG GAAGTTTCCC TTAAAAAGGC AAGGGAGGCT CTTAAAGATT CTCTTATAAA CTATGTCGGA ACCGACAGAA GCACGACACG GCCCCTGGAT ATGGTGAACT TAAAGCTGGC CGAACTTGAG GGAAAGAAAA AGGAACTTTT TAAGGAAAAG GAAAAAATAT TTGAGGCGGA AGAAAAGCTA AGAAAGCTTT CCGAACAGAA GAATCGTTAT GAGAAAAAAA GAGAAGTTTT TAACCTTGCA AGAAAGGTCA TTGAACTTAG GAAAAACGTT GAAGAAGTAA AGAAGAAGAA AAGGGAACTT GTCTTGATTA TAAAAGAGGC GGAAAAATAT GAGCAGGAAA GAGAAACTTT AAGTCAGCAG ACGGAGCTTT GCAACGGAGT TAAAAAACAG TATGAAGCCT ATTCGAAATA CGCAAAAGAC GACCCGGGAC TTATCAATAT TTTATACAAT AAGCTGGAAG ATGCATTAAA AGAAAAGGAG CGGCTTTGTA AAAAGCAAGA GACATTAATG CAGGAGATTG GGGAAATTGA AAGGTCGCTC GAAGAATATA AAGCTTTTCG CAGCTTTGAA GAAGACGTGG ACGGCAGGGT GCAAAATCTT TCCGGGAGTA TAAAAGAACT TGAACAGAAA AAAAGGGATG TAAACGTTAC AGCATTGGAC GAAAGTGTGA AAGCTGCTTC TTACAAATTG GGTTTCATTA AAGTTGGCAT TGGTATCTTG GCAATTTTAA CTCTTTTGTC CGGAATCTGT ACATCGTTTT TCCGGCAGAA AGCTGTTTTT GCGGTGCTGA CTTTTGTCTT TGCACTGTTG ACACTGGTAT TTGTTTATAT GGGAAAAGCC GTAAGGGACA ATCTTCAAAG ACTTGATCAC AATAGGAATA TTCTTCTTTC AGAAATGCGG GACATAGACA GGGAGCTTTC CGCAAAACAA GAGGAAATCC GGCGGATTTT TTCCGTGGCG GGTGTGGAAA ATGAGGGTGA GTTTATTAAG AAGAAAACCC TGTACGAAAA CAAGGTTCTC CGCCTTGCCG AATTAAACGG CAGCATGGAT GAGCTTGAAA GGGAAATGGA TGAGAACCGG ATATATATTG AAAAAATTAA GACTTTAATG CTTGACAGAC TTGGAACGTG CGGTATAATT GCTTTGGAAG AAAATGAAAT AAAATCAGAG CATGTAAAAA CTTTTAGAGA AGGCCTTGCA AAATACCTGG AAGCAATTGA AAACTTAAAA AGGCTCAATG AAAAGCGGGA GGATGCTGCC AAATACCTGC AATCTCTTTA TGACAGGGCG TCTTCATTGT TTGGTGAGAG CTTTGCCAAA AAAGAAGATT TGTTGAGAAG CCTTGACGGG ATGGATCTAA AAATAAATGA GCTCTACGAA AAAATTGAGA AATATTCGAT GGAAATTCAG AATTCTTACG GCTTTACGGA TAATTCTCCG GAATATCATG AGCTGATGGA GAAAATTTAT GACGCTGAAT TTCAAAGTGC TGAAAGTTAT ATAGAAAATT TACTTTCAGA GCTAAATGGC AGGATTGACG AGATTGTGCT TGAGATGAGC AGGGACTGGG CTTTGGTGGA AAGAGGCTGT TTAATTGAAA ATGAAATTCA GGAACTTGAA GTAAAGACGG CAGAACTTGA AAGGGAGAAA GAACGTCTTC TGGATATTGG CAAAAGTTTA AAGACTGCGC TGGATGTCCT TGAGGAGGCG GCGCTTGAGA TAAAAAGGGA ATTTGCACCT TTGCTGAATC AAAAGCTTGG CAGCATAGCA GGTTTTATAA CGCAAGGCAA ATACAGTGAG GTAAGAGCCG ATGACAGTTT CATGATAAGA GCGTTGGAGC CGGGTACCCG GCGCATTGTG GAGCTGCCTT TTTTAAGCGG CGGCACCGTT GAACAGCTGT ACCTTGCATT AAGAATTGCC CTTGCAGAGA CTGTTGAAGA CGGCGGCGAA GTTTTACCCC TCATTATGGA CGAGGTGTTT GCGCATTATG ATGACACAAG GGTGTTTAGT ACTTTGAAGA TGCTTTTTGA GCTGTCGAAA GAGCGCCAGA TTATATTTTT TACATGCAAG GACAGAGAGA TGGAGGCAGC CACAGAGGTT TTCGGCAAAG ATTTAAATGT TATAAAACTG GGCACTTGTT GA
|
Protein sequence | MRIDKLDIRG FGKIHNLIIE FSKGFNLVYG ENEAGKTTVQ WFIRGMLYSL KGGKNTKGGA IPPLKKYSPW KGDFYGGSIV YTLDSGASFT VERDFNNNTV KIFDSFFNDI SDSFKKSKEK GPLFAVEHLG INEACFERTV FIGQMDTKVD ASGGRELVDK LANIRETGSE EVSLKKAREA LKDSLINYVG TDRSTTRPLD MVNLKLAELE GKKKELFKEK EKIFEAEEKL RKLSEQKNRY EKKREVFNLA RKVIELRKNV EEVKKKKREL VLIIKEAEKY EQERETLSQQ TELCNGVKKQ YEAYSKYAKD DPGLINILYN KLEDALKEKE RLCKKQETLM QEIGEIERSL EEYKAFRSFE EDVDGRVQNL SGSIKELEQK KRDVNVTALD ESVKAASYKL GFIKVGIGIL AILTLLSGIC TSFFRQKAVF AVLTFVFALL TLVFVYMGKA VRDNLQRLDH NRNILLSEMR DIDRELSAKQ EEIRRIFSVA GVENEGEFIK KKTLYENKVL RLAELNGSMD ELEREMDENR IYIEKIKTLM LDRLGTCGII ALEENEIKSE HVKTFREGLA KYLEAIENLK RLNEKREDAA KYLQSLYDRA SSLFGESFAK KEDLLRSLDG MDLKINELYE KIEKYSMEIQ NSYGFTDNSP EYHELMEKIY DAEFQSAESY IENLLSELNG RIDEIVLEMS RDWALVERGC LIENEIQELE VKTAELEREK ERLLDIGKSL KTALDVLEEA ALEIKREFAP LLNQKLGSIA GFITQGKYSE VRADDSFMIR ALEPGTRRIV ELPFLSGGTV EQLYLALRIA LAETVEDGGE VLPLIMDEVF AHYDDTRVFS TLKMLFELSK ERQIIFFTCK DREMEAATEV FGKDLNVIKL GTC
|
| |