Gene Cthe_0914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0914 
Symbol 
ID4810535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1094627 
End bp1097179 
Gene Length2553 bp 
Protein Length850 aa 
Translation table11 
GC content38% 
IMG OID640106333 
Producthypothetical protein 
Protein accessionYP_001037341 
Protein GI125973431 
COG category[S] Function unknown 
COG ID[COG5373] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAACA ATTTAAAAAT TATTCTAGCA AAGCAGAAAG AAATCATAAC AAGTCTTGAA 
AATGAAATTG CCGGCATCGA AGCCAACGAC CTTGTAAAAG AAAATCAAAA ATTGAAAGAA
GAGCTGGCAA AACTTAAATC AAGTTTGGAA AAGGAAAAAA TTGAGAACGA AAAACTCTCA
AAAGAAAACA AAAATTTGAG AAACATTTTG TATGAACAGT TCTACAACGA AAAAATCAGC
CTTTTGAACT CAGCGGAAAA AAGAATGGAC GTTTATTACA GGGCTCATGT GGAAGGAGAA
ATAAACAGGC TGACAAGATT TGAATTGTCG GTAAAAAGAC GTATTGATGA AATGACAAAA
GTACTTCGGG CCAACAGAGT AAGCCTTCAG GATGAAATTT ACATAAAGCT TGAGGAGCTT
AGAAATCTTC TTAACGTTAA AATAACCAAA GCCCGTGAAG AAATCATGCA ACAGACCGGA
GCATTTATTC AAAACAGAAA CGAGGGCCTT GCAAGGCTAA GACAGGAACA GGTTACGGAG
CAGGAAATAA AAGCCCGGGC AAAGCAAAAC AACATTGAAT CCATTATAGG ACTTAACATA
ATCAACAAAG TGGGTATATT TTTACTCATT GTAGGAGTAA TAACCGCGGC GCAATTTACA
TATTTCAGGC TGCCCGATAC CTTAAAAAGC GTTTTTACCT TTGCAGTGGG CGTTGTTCTG
CTCATAGCCG GTGAAATATT AAACCGCAAA AAGCCCAACG TTTTCTCACT GGGCATAACC
AGCGGAGGTA TTGCAATACT GTATGTGGCT CTTTGCCTCA GTTATTTTCA ATTTAAACTG
CTTGAAACAT ATCCGGCTTT AGGATTGTGT ATTCTCATAA CGGCAGGAAC TTTTGTCCTT
TCGCAGAGGT ACAATTCCCA GACCATATCG GCTTTTGCAC TGATCGGGGG ATATCTTCCC
ATCTTCTCGA TAACCGGTAT TAGTGCAATA GCATACGGTG CCATGGTTTA CTTTGTAATA
CTGAATCTTT TGGCTCTTAT TATTGCTGTC AACAAAAAAT GGGCTGTCAC TGCATACATA
GGATTTGTGC TGAACGTCAT AGGTTCCGTA TATATAGCAT CAATAATGTT TGGAGGACTT
TTCGCACCAT CGGAATTTTC ATTCGATTCC GTCATTACCA TAATTTATAT TTTGTTCGCT
TTTGTAATTT ATACTCTGAT ACCAATAGCG GGAACCTTCA GGCAAAAATT GAGTTTTAAA
ACTTCTGACA TTGTACTGCT CGCATTAAAC ACATTTATCA GCAGTGTACT TCTTTACTGG
TCATTTTATG CATCCGACCT CGAAGACTTT ACGGGACTTC TTGCCCTCAC CTTTGCAATT
ATTTATCTTG CCCTTGGAAG GTTTATTGAA AAAAACATGC CAAAGGAAAA GCAGGTTACC
ACTTTGTTTT ATCTTACAGG ACTTACCTTT GTCGTACTTA TAATACCTTT CCAGTTCGGC
AAAGTGTGGC TGTCCCTGGG TTGGCTGATT GAGGGTGTTG CCCTGCTTTC CTACGGTATA
TGCAAAGAAA TAAAAAGATT TAAAAAAGCC GGAATTGCAA TTTCATTTTT GTGCCTGTTA
ACTTTCCTTT CCTTTGACGT GTCACTGATT CAAGATTCTC TTTTTACCTT TAAATATTTT
GCAATAACTT TGGGCAGCAT CATCATTTTA GGAGCCCTCA TATACAAAAA GAATCTGGCA
AGCAAGGAGT CAAAACTGTT CAAGTATGCC ACATCCATAA ATCTGCTGAT CTTTTTGCTA
TACATAATCA GCAACGAACT AAAACCATTT TTATCCCAAT ATGTTCAGGA CACAAAATAT
GTTCAGGACA CAAAATTTGA CCTGGATTAC TTGATCTGCT CGGCAATGAT TTTAACAAGC
TTTCTTGTGG CATACACCCT GCCAAGGATA AAAGTTTTGT GCGACAATGT TATAAAAGGC
ATTTCAATGT CTATTTATGC TTTGGCACTG CTAACCCTCT TTTCCCTGAA TTTCACCTCC
CCCGTGAAAG GATATTTGGT CGAAATGCCT TTGGCTATAA GTATTGTCGG AACCCTGGAA
CTTGCTTTAA TAGCTTTCCT GTCCATCCTT GCCGTAAGGG ATTTGGTATT GTATTTTGTA
ATTGACCACA AACTGGGTAT TGAATGGTAC CCACTTGCAA TATCTCTTTA CTTTGTAATA
ATACTGACTC AAAATTTAAT TACCCAGTAC AGGCTTGAAT TCAGCAACGC GGCAATAAGC
ATTATTTACC TTGCCGCAGC TCTCACATGG ATAATATTCG GTTTTGCCAA AAGGTATGTA
TTCATCCGCC GTTTCGGACT TGCCATGTCC ATGCTTTCGG TGGCAAAGCT GTTCATTATC
GACCTTGCTT TCCTGACCCA AGGTTACAGA ATAGTATCCT ATTTTGTGTT CGGCATAATA
CTCCTTGCAA TATCTTTTGT ATATCAGTAT TTCAACAAAA AATTAGAAAA TATATGCGAG
GTAATGCCTG ATGATAAAAA GAATAGTAAT TAG
 
Protein sequence
MTNNLKIILA KQKEIITSLE NEIAGIEAND LVKENQKLKE ELAKLKSSLE KEKIENEKLS 
KENKNLRNIL YEQFYNEKIS LLNSAEKRMD VYYRAHVEGE INRLTRFELS VKRRIDEMTK
VLRANRVSLQ DEIYIKLEEL RNLLNVKITK AREEIMQQTG AFIQNRNEGL ARLRQEQVTE
QEIKARAKQN NIESIIGLNI INKVGIFLLI VGVITAAQFT YFRLPDTLKS VFTFAVGVVL
LIAGEILNRK KPNVFSLGIT SGGIAILYVA LCLSYFQFKL LETYPALGLC ILITAGTFVL
SQRYNSQTIS AFALIGGYLP IFSITGISAI AYGAMVYFVI LNLLALIIAV NKKWAVTAYI
GFVLNVIGSV YIASIMFGGL FAPSEFSFDS VITIIYILFA FVIYTLIPIA GTFRQKLSFK
TSDIVLLALN TFISSVLLYW SFYASDLEDF TGLLALTFAI IYLALGRFIE KNMPKEKQVT
TLFYLTGLTF VVLIIPFQFG KVWLSLGWLI EGVALLSYGI CKEIKRFKKA GIAISFLCLL
TFLSFDVSLI QDSLFTFKYF AITLGSIIIL GALIYKKNLA SKESKLFKYA TSINLLIFLL
YIISNELKPF LSQYVQDTKY VQDTKFDLDY LICSAMILTS FLVAYTLPRI KVLCDNVIKG
ISMSIYALAL LTLFSLNFTS PVKGYLVEMP LAISIVGTLE LALIAFLSIL AVRDLVLYFV
IDHKLGIEWY PLAISLYFVI ILTQNLITQY RLEFSNAAIS IIYLAAALTW IIFGFAKRYV
FIRRFGLAMS MLSVAKLFII DLAFLTQGYR IVSYFVFGII LLAISFVYQY FNKKLENICE
VMPDDKKNSN