Gene Cthe_0017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0017 
Symbol 
ID4808782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp24056 
End bp25918 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content41% 
IMG OID640105427 
Producthypothetical protein 
Protein accessionYP_001036452 
Protein GI125972542 
COG category 
COG ID 
TIGRFAM ID[TIGR01131] ATP synthase subunit 6 (eukaryotes),also subunit A (prokaryotes) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000261883 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAT TAAATTTGAA AAGTAAACTC GCCATTTTTG CCACAACGTT AAAAGAAGTT 
TTTATATCTT CGCTGCCTCT TGCGGCAATT ATGATTATTG TGTGCGGTTT TATCGCACCT
TTGGACAGTG GGGCGGAGTA TGTCAAATTA TTTGTCGGCT ATGCCAGTGT TGTTTTTGGC
CAGGCATTGT TTTTGGACGG TTTAAATATT AGTATTCTTC CCATAGGAAA ATTGGTTGGG
GGTTCGCTAA TAAAGCTTAA AAAATCAATC TTTGTTATTT TCTTCGGACT TCTTTTTGGC
GTGCTTGCCA CTGTCGCGGA ACCTGCACTG ACCGTTTTGG CCAAGCAGAC CAACATGATT
ATGCCGATTA TCAACGAAAC CGTGTTTATC TGGATTATGG GTTTTGGAAT CGGCGTGATG
CTTGCATTCT CCCTCTTTCG AATTATGAAG GACTTAAATA TTAAAGTGGT TTTTGCCATA
TTGTATGTCA TTACTTTTCT GTTGATTATA TTTGTTCCTG ATGAATTTGT TGCTTTGGCT
TTTGACGGAA GCGGTGCAAC TACGGGGGAC ATTTCGGTTC CGTTTATTTT GGCTTTGGGT
ATGGGTGTTT CCACTACCAT GTCCAGGCAC AAAAGCAATG ACGACAGCTT TGGGATTATT
GGCCTTGCTC CGGTGGGTCC GATTATTTCA TTGGCCATCT ACGGTATAGT ACTTAAGTTT
CTTTACAACG GTGTATTTCC TCCTGAACAG GTATACTCTC CTGAGACGGT GGGAACCGTT
GGTGAGATAA TTGTAAATAA TTTGTGGGGA GTTACATTGG CACTTTTGCC GGTTATCATA
GTGTTTTTAC CGTTTCAGTT TTTGTTAATC AAACAGCCGA AGAAAGAATT TGTGAAAATT
TTGTTAGGTA CTGTTGTAGT TTTTATAGGC TTGCTGATTT TTCTGGCAGG CATAGATTAT
GGATTTGCAT TTGCAGGCAA ATACATCGGG GAGGTTTTCC TGGATCCTTC ACGTCCCGAG
TGGTTTAAGT GGTTGCTTTT GATTGTTGCG TTTATTTTAG GTGCCGCCAT TACCTTGTCG
GAGCCTGCTG TTACGGTGCT GGGAGAACAG TTGGAAGAGA TGACCAACGG ACATATTGCA
AAGATGACAA TTCGCATGAC TCTTGCCATA GGTATTGGCT TTGCTGCTTT GCTTGGAATG
TTGAAAATAT TGACGGAGAT TAACATATTG TGGTTCTTAA TACCCCTGTA CGCCGTCGCT
CTTATCATGA TGATATTTGC GCCAAAGCTG TTTGTCGGTC TTGCTTTTGA CTCGGGAGGA
GTGGCCGGCG GAGCTTTAAC TTCTGCATTT TTGACGCCGC TTACCCTTGG GGTGGCACAG
GCTGTGGCTG CGACATCACC TTCCGGCGGA CAGCCGATTT TGGTCAACGG TTTTGGAATT
ATTGCATTTA TTTCAGTTAC CCCATTGATT GCTGTACAAT TTTTAGGTAT AGTGTATAAT
ATAAATATTA AGAAAGCGGA AAAAGCTCTG AAAGATGCTG AAATGAATGA TATAAAAGAG
TTGGCGTCCC TTGCGGGTAT CGTTGAAGAA GTCGCAGCTG AAAAAAGTGC GACTCAAGAA
AGTATAGCTG AGAAAGCTGC AGTTCAGGAA AGTATAGTTG AAAAAGGTGG GATTGAAAAA
AGTGTAGTGG ATGAAGAAAG TATAACCGAA AAAAGCATAG TTAAGGAAAA TATAGATGAA
GACGATATAG CGGCTAAAAT GGATGAACAG CGTCAAAATG AGCAGCATCA TGAAGATAAT
ATTGTGAAGG ACATGAAAGA CGGACACGAG GAACTTAAGG CAGGTAAAAA CAGTGCGGAG
TAG
 
Protein sequence
MKKLNLKSKL AIFATTLKEV FISSLPLAAI MIIVCGFIAP LDSGAEYVKL FVGYASVVFG 
QALFLDGLNI SILPIGKLVG GSLIKLKKSI FVIFFGLLFG VLATVAEPAL TVLAKQTNMI
MPIINETVFI WIMGFGIGVM LAFSLFRIMK DLNIKVVFAI LYVITFLLII FVPDEFVALA
FDGSGATTGD ISVPFILALG MGVSTTMSRH KSNDDSFGII GLAPVGPIIS LAIYGIVLKF
LYNGVFPPEQ VYSPETVGTV GEIIVNNLWG VTLALLPVII VFLPFQFLLI KQPKKEFVKI
LLGTVVVFIG LLIFLAGIDY GFAFAGKYIG EVFLDPSRPE WFKWLLLIVA FILGAAITLS
EPAVTVLGEQ LEEMTNGHIA KMTIRMTLAI GIGFAALLGM LKILTEINIL WFLIPLYAVA
LIMMIFAPKL FVGLAFDSGG VAGGALTSAF LTPLTLGVAQ AVAATSPSGG QPILVNGFGI
IAFISVTPLI AVQFLGIVYN INIKKAEKAL KDAEMNDIKE LASLAGIVEE VAAEKSATQE
SIAEKAAVQE SIVEKGGIEK SVVDEESITE KSIVKENIDE DDIAAKMDEQ RQNEQHHEDN
IVKDMKDGHE ELKAGKNSAE