Gene Cthe_0054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0054 
Symbol 
ID4808749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp71948 
End bp74137 
Gene Length2190 bp 
Protein Length729 aa 
Translation table11 
GC content44% 
IMG OID640105463 
Producthypothetical protein 
Protein accessionYP_001036488 
Protein GI125972578 
COG category[S] Function unknown 
COG ID[COG1649] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000080624 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAGAC GTTGCAAAAG ACTTGTATGT ATTTTGTTGT CATTTCTTGT GATTGCCGGT 
GTTTTTACGT TCAGTGGTGC AAAAACCGAG CCCTGGAGTG CCTATCAAAA GTTTATACCC
AATGAGACAC CGGTTGTAAA AAGGCATCTT AGGGGAGTGT GGATTAGTAC TGTTGCAAAC
CTTGACTGGC CGTCCGTAGA GACACGAAAA ATAGAAAATC CTTCGGAACG GATAAGAAAA
ACTAAAGAAG AGCTTGTGGA GATTTTCGAC AAGGCTGTGG AGATGAATTT AAATGCCGTT
TTCCTCCAGG TCAGTCCGGA GGGGGATGCA TTCTACAAAT CAGATATAGT GCCCTGGTCA
CGTTATCTTA CAGGAACCTT TGGAGAGGAC CCCGGCTTTG ATCCCTTGGA GTTTGCAATT
GAGGAAGCTC ACAAGCGAAA TCTGGAGCTC CATGCATGGT TTAATCCTTA CAGGGTGTCG
ACAAACACGT CGGCTGCAAC CATTTCATCT TTAAAAGTCG AAAAAAGTGT GTACAAGGAA
CATCCTGACT GGATTAGGAC AGCCATGAAC AGGTTTGTTG TCGATCCTGG AATACCTGAA
GCGAGGCAAT GGGTGATTGA CCGTGTTATG GAGGTGGTGA AAAAATATGA TGTGGACGGA
GTGCATTTTG ACGACTATTT TTATTATGAG CAATATGTGG GAGAGCTGAA GGATCAGGAT
ACTTACAACA AGTACAATAA GGGACAGTTT TCCAATATAG GCGATTTCAG AAGAAACAAC
ACGTATTTGC TGGTAAAGGA GCTTTCGCAG AAGATAAGGG CAACCAAGCC CTGGGTTAAA
TTTGGCATTA GTCCTTCCGG CGTATGGGGG AACAAAAGCG ACGGCCACAG CTACGGTTCC
AATACGAGTG CAAGTCTTAC AAATTACGAT AAAAGTTTTG CGGATACAAA GAAATGGGTT
CAGGAGGAGC TTATCGATTA CATTGCTCCC CAGGTTTATT TCACTTTTGC AAATTCCAGA
GCACCTTACG GTGAGATTGC TTTGTGGTGG TCGGATGTTT GCAGGGGGAA AAATGTGCAT
CTTTATATAG GTCAGGCGTT TTATAAGATA AATGATGACA GCGATCAATA TTTTAAAGGT
GAGAATGCTG TGCCGGAGCT GACAAGGCAA TTGAAATTCA ATGCGGTAAA ACCTGAGATA
ATGGGAACTG TTTTGTTCCG TTTTGCAAAT TTTAAAGATT CCGGTAAACA GCAGGCGGTA
AATGCCGTAA AGAATGACTT GTGGTCACAA AAAGCCTTGA TTCCACCAAT GCCGTGGAAG
GGCGGCAATG CTCCTGATGC GCCTATACTG GGAAGATTGG AATCCCTGCC CGACGGAGTG
GAAATATCGT GGATGGATAA TGACCCGGAC ACCGCGTATT TTGCAATTTA CCGCTTTAAT
GCCGGAGAAA AAATGGACAT TACCTCTGAC AGCAGTGCAT ACAAACTTAT TGCCACTGTC
AGAAAAAACA GTAACGGTGT GCAGAAATTT GTGGATTATG GGGTTTTGGA TGCTGACAGC
GTATATTATG TTGTTACTGC TTTGGACCGG CTGCACAATG AAAGTGAAGG ACTTGCAATA
AGCACCAATC AGTCCGAATA TTTTCCGGAT GTCGGGATGA AATATTCCTG GGCCGTTGAT
GCAATTGACA TGCTTTATGA AAAAGGAGTT GTCAAGGGTG ATGAAAGCGG GATGTTCAAC
CCGGGGGTGA ACACGAAAAG AGCTGATTTT ACTATTATGA TTGTAAAGGC GCTGGCCCTG
AAAGCTGATT TTGAAGACAA TTTTGCCGAT GTCAGGAAAG ATGCATATTA CTATGAGGCG
GTAGGCGTTG CCAGGGCTCT GGGAATTGTA AAAGGAGACG GAAAGAATTT TAATCCCGAT
GCCAATATAA CCCGGGAGGA TATGATGGTT ATCGTGGTCA ATGCCCTTAA AGCGGCCGGG
GCAAAGATTG ACGAAGCCGA TGAGCAATTC CTTGAAAATT ACGGTGATGC GAACAGTATA
AGCGGCTATG CAAGGAAATC GGTGGCTGTT CTTACGAAAG CGGGAGTTGT AAACGGCTAC
GACGGGAAAA TACATCCTAA AAGTCTGGCC ACAAGGGCTG AGATTGCAGT GGTAGTATCA
AAGCTGTTAA CCAATATTGA GTATTTATAA
 
Protein sequence
MERRCKRLVC ILLSFLVIAG VFTFSGAKTE PWSAYQKFIP NETPVVKRHL RGVWISTVAN 
LDWPSVETRK IENPSERIRK TKEELVEIFD KAVEMNLNAV FLQVSPEGDA FYKSDIVPWS
RYLTGTFGED PGFDPLEFAI EEAHKRNLEL HAWFNPYRVS TNTSAATISS LKVEKSVYKE
HPDWIRTAMN RFVVDPGIPE ARQWVIDRVM EVVKKYDVDG VHFDDYFYYE QYVGELKDQD
TYNKYNKGQF SNIGDFRRNN TYLLVKELSQ KIRATKPWVK FGISPSGVWG NKSDGHSYGS
NTSASLTNYD KSFADTKKWV QEELIDYIAP QVYFTFANSR APYGEIALWW SDVCRGKNVH
LYIGQAFYKI NDDSDQYFKG ENAVPELTRQ LKFNAVKPEI MGTVLFRFAN FKDSGKQQAV
NAVKNDLWSQ KALIPPMPWK GGNAPDAPIL GRLESLPDGV EISWMDNDPD TAYFAIYRFN
AGEKMDITSD SSAYKLIATV RKNSNGVQKF VDYGVLDADS VYYVVTALDR LHNESEGLAI
STNQSEYFPD VGMKYSWAVD AIDMLYEKGV VKGDESGMFN PGVNTKRADF TIMIVKALAL
KADFEDNFAD VRKDAYYYEA VGVARALGIV KGDGKNFNPD ANITREDMMV IVVNALKAAG
AKIDEADEQF LENYGDANSI SGYARKSVAV LTKAGVVNGY DGKIHPKSLA TRAEIAVVVS
KLLTNIEYL