Gene Cthe_0051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0051 
Symbol 
ID4808746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp66312 
End bp67571 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content39% 
IMG OID640105460 
Producthypothetical protein 
Protein accessionYP_001036485 
Protein GI125972575 
COG category[S] Function unknown 
COG ID[COG1306] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAAAGA ACAAACAGTT GTTAATGGCT TTAAGTGTTT TTTTGATTGT TTTTACAACG 
ACTTATGCAT TTGTATATTA CAACGATGGT TTAAATAAAA ATAATCGGAA AATTAATGAC
AATCAAAATA TTCAGATAAA GCACTTTTTA CCAAATGAGA ACGGTAATAA TGAAAATGGC
AGCGAAAATA ATAATGAAGA AGAAAGTCTA TCCAGCCCCC GGCCTGAAAG AAAACCGGTA
AAGGTAAAAG GCCTGTATAT CACCGGGACT TCCGCCGGAA ACAAAAAGTT TATGGAAAGG
CTTGTAAATC TTATCAATAC AACGGAACTG AATACAGTGG TCCTGGATGT AAAAGAGGAC
GGAAAAGTCA ATTATGCTTC CGAGGTGGAG AGTGTAAAAA AGATTGGTGC ATACCACGAG
TTGTATAATG TGGATGAAGT GATAAAGCTT TTGCATGACA ACAATATATA TGTAATTGGA
AGAATTGTTT GCTTCAGGGA TAACTATCTG GCAGGAAAGA GAGTGGACCT TGCCATAAAA
CGCAAGGACG GATCGATATG GAGGGAAAAC GGAAGTATAG CGTGGACAAA CCCATATAAC
AAAGAGGTCT GGAGATACAA TATTGACATA GCGAAAGAAG CGGTAAAGAA AGGTTTTGAC
GAGATACAGT TTGATTATGT AAGATTTCCC GCAGCAGGAA AAAATGAAGT TGATTACGGG
GAAAATCCTA TCCCTAAGGC TGATGCAATA TCGGGCTTTC TTAAAGAAGC GGCAAGTGAA
ATAAATAAAA TGGGTGTGCC GGTTTCGGCA GATATATTTG CCATTGTTTG TGAAACTCCG
GGTGACACCG AAGGCATAGG ACAGGTATTG GAGAGGATTG GAATGGATAT AGATTATATA
TCTCCGATGA TATATCCTTC CCATTTTGCC AATGCATCCC GTGGGATGAT GGGAAACGGA
AAAGGTCAGT CTATTAACGG TATACTTTTT ACGGCACCGG ATTTAAAGCC GTATGAAGTT
GTATATAATG TTCTTTTGAA AACAAAAGAC AGAATATCAA AAGTGGAGGG ATATAGAGCA
AAGGTAAGAC CGTATCTTCA AGGTTTTACG GCTTCTTATC TTCCGAAGGG TTATTATCAG
CATTATGGGC CGGAGCAAAT AAGGCAGCAG ATAAAGGCGG TCTATGACGC CGGGTATGAA
GAGTGGATAT TCTGGAATGC GGCAAACACT TACACGGAGT CAGCATTTGC CAGAGAATAA
 
Protein sequence
MLKNKQLLMA LSVFLIVFTT TYAFVYYNDG LNKNNRKIND NQNIQIKHFL PNENGNNENG 
SENNNEEESL SSPRPERKPV KVKGLYITGT SAGNKKFMER LVNLINTTEL NTVVLDVKED
GKVNYASEVE SVKKIGAYHE LYNVDEVIKL LHDNNIYVIG RIVCFRDNYL AGKRVDLAIK
RKDGSIWREN GSIAWTNPYN KEVWRYNIDI AKEAVKKGFD EIQFDYVRFP AAGKNEVDYG
ENPIPKADAI SGFLKEAASE INKMGVPVSA DIFAIVCETP GDTEGIGQVL ERIGMDIDYI
SPMIYPSHFA NASRGMMGNG KGQSINGILF TAPDLKPYEV VYNVLLKTKD RISKVEGYRA
KVRPYLQGFT ASYLPKGYYQ HYGPEQIRQQ IKAVYDAGYE EWIFWNAANT YTESAFARE