Gene Cthe_2268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2268 
Symbol 
ID4809857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2696804 
End bp2698186 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content48% 
IMG OID640107674 
ProductV-type ATP synthase subunit B 
Protein accessionYP_001038663 
Protein GI125974753 
COG category[C] Energy production and conversion 
COG ID[COG1156] Archaeal/vacuolar-type H+-ATPase subunit B 
TIGRFAM ID[TIGR01041] ATP synthase archaeal, B subunit 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAAGG AATACAGAAC AATAACCGAG GTTGCCGGTC CTCTCATGCT GGTACAGAAA 
GTTGAAGGTG TAAAATACGG CGAACTCGGT GAAATAGAGC TGGCAAACGG TGAAATAAGA
AGATGCAAGG TATTGGAAGT TGACGGGCAA AACGCATTGG TTCAGCTTTT TGAAAGTTCT
ACAGGTATAA ACGTTGCAAC CAGCAAAGTA AGGTTTTTGG GAAGAAGTAT AGAGCTTCCC
GTATCAATGG ATATGCTCGG AAGAGTATTC AGCGGTATGG GAAAGCCCCT GGACGGCGGT
CCGAATATTA TTCCCGACAA AAGGCTTGAC ATAAACGGTC TTCCTATGAA CCCAGCGGCA
AGAAACTACC CTTCGGAGTT CATACAGACG GGTATTTCGG CCATTGACGG ACTGAACACC
CTGGTTCGCG GCCAGAAGCT CCCCATATTC TCCGGTTCCG GTCTTCCCCA TGCCCAGCTT
GCGGCACAAA TTGCAAGGCA GGCAAAGGTT TTGGGTACGG ACAGCAAATT TGCCGTTGTA
TTTGCGGCTG TAGGTATTAC CTTTGAGGAA GCTGACTACT TTATCAGTGA CTTTAAGAGA
ACCGGAGCCA TAGACCGTAC CGTACTGTTT ATAAATCTGG CAAACGACCC TGCCATCGAG
CGTATTTCAA CTCCACGTAT GGCGCTTACG GCAGCCGAAT ACCTTGCTTT TGACAAAGGC
ATGCACGTGC TCGTTATAAT CACCGACATA ACCAACTACG CCGAAGCGCT CCGTGAAGTA
TCCGCCGCAA GAAAAGAAGT TCCCGGAAGA AGAGGTTACC CGGGTTACCT TTATACCGAC
CTTGCGACAA TATATGAAAG AGCCGGAAGA AGAATTGACA GCGAGGGAAG TATCACTTTG
ATTCCAATAC TGACAATGCC CGAAGATGAC AAGACCCATC CTATCCCCGA CCTTACCGGA
TACATAACCG AGGGTCAGAT CATCCTAAGC AGAGAGCTTC ACCGCAAGGG AGTAACGCCA
CCGATAGACG TTCTTCCGTC CCTCTCCCGT CTTAAGGACA AGGGAATCGG AAAAGGCAAA
ACCCGTGAAG ACCATGCGGA TACAATGAAC CAGCTCTTTG CCGCTTACGC AAGGGGTAAG
GAAGCCAAGG AACTTGCCGT AATCCTCGGA GATGCGGCTC TTTCCGACAC GGATAAGCTG
TACGCCAAAT TTGCGGATGC TTTTGAAAAG GAATATGTAT CCCAAGGTTT TAATGAAGAC
AGATCAATTG AAAAAACCCT TGAAATCGGC TGGAAGCTGC TTTCAATACT TCCAAGATCG
GAGCTTAAGC GTATTCGTGA CGAATACCTT GACAAATATT TGCCCAAAGC GGCAGAAAAT
TAA
 
Protein sequence
MLKEYRTITE VAGPLMLVQK VEGVKYGELG EIELANGEIR RCKVLEVDGQ NALVQLFESS 
TGINVATSKV RFLGRSIELP VSMDMLGRVF SGMGKPLDGG PNIIPDKRLD INGLPMNPAA
RNYPSEFIQT GISAIDGLNT LVRGQKLPIF SGSGLPHAQL AAQIARQAKV LGTDSKFAVV
FAAVGITFEE ADYFISDFKR TGAIDRTVLF INLANDPAIE RISTPRMALT AAEYLAFDKG
MHVLVIITDI TNYAEALREV SAARKEVPGR RGYPGYLYTD LATIYERAGR RIDSEGSITL
IPILTMPEDD KTHPIPDLTG YITEGQIILS RELHRKGVTP PIDVLPSLSR LKDKGIGKGK
TREDHADTMN QLFAAYARGK EAKELAVILG DAALSDTDKL YAKFADAFEK EYVSQGFNED
RSIEKTLEIG WKLLSILPRS ELKRIRDEYL DKYLPKAAEN