Gene Cthe_2606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2606 
Symbol 
ID4809028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3075458 
End bp3076978 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content47% 
IMG OID640108020 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_001038999 
Protein GI125975089 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTAA GACCTGAAGA GGTAAGTGCG ATAATAAAGC AGCAAATAGA TAAATATGGA 
AACAAATCCC ATGTAGATGA TGTGGGCTAT GTATTACAGG CAGGGGACGG TATTGCGAGG
GTATACGGGC TCAACAACTG CATGTCCGGA GAGCTTCTGG AGTTTGAGAA CGGTGTATTC
GGAATGGCGA TGAACCTGGA AGAGGATAAC ATAGGCTGTG TGCTGTTTCG CGGCGAAAGA
GATGTAAAAG AGGGAACATT GGTGAAGAGA ACGGGAAAGA CTGTTGAGGT TCCTGTGGGA
AAGGCTCTAA TCGGAAGGGT GATTGATCCT CTTGGAAACC CCTTGGACGG CAAAGGAGAA
ATTGAGACGG AAAAGTTCAG GCCTATAGAA TATCCGGCAC CGTCAGTAAT GGACAGAAAA
CCGGTGAACA GACCTCTTCA AACAGGAATA ATGGCAATCG ACGCAATGGT TCCCATAGGA
AGAGGGCAAA GGGAGCTTAT AATCGGAGAC AGACAGACCG GTAAAACCGC TATAGCCGTG
GATACAATAC TGAATCAAAA AGGCAAGGAT GTAATATGCA TTTATGTTGC CATAGGTCAA
AAAGCTTCTT CGGTTGCCGA GGTGATAAAC ACCCTCGAGG AAGGAGGGGC CATGGAATAT
ACGGTGGTGG TGTCTTCCAC GGCCAGTGAG ATGCCAACAC TGCAGTATAT AGCACCTTAT
GCCGGCTGTT CCATTGCGGA AGAGTTTATG TACAATGATC ACAAAGATGT ATTGATAGTG
TATGACGACC TGTCCAAACA TGCCGTTGCA TACAGGGCAA TGTCGCTGCT TTTAAGAAGG
CCGCCGGGAA GAGAGGCTTA TCCCGGTGAC GTGTTTTATC TTCATTCGAG ACTGCTGGAG
AGGGCGGCCC AGCTTAGTGA CGAACTGGGA GGCGGTTCGA TAACCGCATT GCCCATAATT
GAGACCCAGG TGGGTGATGT TTCTGCATAC ATTCCGACCA ACGTAATATC CATAACCGAC
GGACAGATAT ATCTTGAAAC TGAGTTATTT TATTCCGGAC AGAGACCTGC CGTAAATGTC
GGGCTTTCGG TTTCAAGAGT GGGTGGTGCG GCGCAGATTA AGGCCATGAA AAAGGTTGCG
GGAGCACTCA GAATAAATCT TGCCCAGTAC AGGGAGCTGG CAGTGTTTGC ACAGTTTGGA
TCCGACCTTG ACAAAGTGAC CAAAGACAAA CTTATACAGG GGGAAAGACT GGTAGAGAGC
CTGAAACAGT CAAGGCGTGC AACCATGCCG GTGGAAGACC AGGTAATAGT GCTTTATATG
GCTACCAACA AGTATCTTAT GGATTTACCG GTAAAAGAAG TCAGGAGTTT CAACAAGGAG
TTTGTGAAGT TCGTAAACAG CAATTATCCG GAGATTCCCA ATGAAATAAG AGCTACGGGG
GATCTCAGCA GTGAAACCGA GAATATGCTG AAAAAAGCTG CAGAGGAATT CAAAGACCAA
TATCTCAGAA CAAAGAGATA A
 
Protein sequence
MDLRPEEVSA IIKQQIDKYG NKSHVDDVGY VLQAGDGIAR VYGLNNCMSG ELLEFENGVF 
GMAMNLEEDN IGCVLFRGER DVKEGTLVKR TGKTVEVPVG KALIGRVIDP LGNPLDGKGE
IETEKFRPIE YPAPSVMDRK PVNRPLQTGI MAIDAMVPIG RGQRELIIGD RQTGKTAIAV
DTILNQKGKD VICIYVAIGQ KASSVAEVIN TLEEGGAMEY TVVVSSTASE MPTLQYIAPY
AGCSIAEEFM YNDHKDVLIV YDDLSKHAVA YRAMSLLLRR PPGREAYPGD VFYLHSRLLE
RAAQLSDELG GGSITALPII ETQVGDVSAY IPTNVISITD GQIYLETELF YSGQRPAVNV
GLSVSRVGGA AQIKAMKKVA GALRINLAQY RELAVFAQFG SDLDKVTKDK LIQGERLVES
LKQSRRATMP VEDQVIVLYM ATNKYLMDLP VKEVRSFNKE FVKFVNSNYP EIPNEIRATG
DLSSETENML KKAAEEFKDQ YLRTKR