Gene Cthe_2262 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2262 
Symbol 
ID4810000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2690497 
End bp2692452 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content40% 
IMG OID640107668 
ProductV-type ATPase, 116 kDa subunit 
Protein accessionYP_001038657 
Protein GI125974747 
COG category[C] Energy production and conversion 
COG ID[COG1269] Archaeal/vacuolar-type H+-ATPase subunit I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.181857 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATAG TTAAAATGAA TAAAATCTCA CTTATAGGGC TTGAATCTGA AAAGGAACGC 
ATCCTCGAAA ATCTCATGAA ATTAGGTGTC GTTGAGATTA CGGATGCAAA AGAAAAAATA
TCTTCCGATG AATGGAAAGA GCTGGTAAAT ATTGACGGAG ACAGTGAAAG CGTCTCAAGG
CTTGATGCTC AGATTGACAG AGTATCGGCC GTTATTGACT ATCTGGACAA ATTTGACAAG
CGCAAAAAAC CTCTCTTTTC AGCGAGACGG GATATCAGCA CAAGCGAGTT GAACATGGTT
TTGCAAAACC AGGATAAGCT TTGGTCCGTT ATTGATGAAG TCAACCGGTA TGATGAAATG
CTCGCAAACT TAAAGGCGGA AAAAAACAGG AACTCAAATA TGATTTTGAG TCTCAAACCG
TGGGAAGCTT TGGACATTCC CTTAGAACTT ACGGCAACTG CATCTTCCAC GGTTTTGATC
GGGGTTGTGC CGGAAATGGC AAATACCGAT AAAATAAAAC AGGATCTCGA TGAAAAAGTC
CCGGAAAGCC ATTTTGAAGT TCTTAGCAGA GACAAGGAGC AAAGTTATCT TCTTATCATA
TATTTAACTT CCAAAGAAGA AGACGTTATG AATGTATTAA AGCAGTACGG CTTTTCCAAG
GTAACTTTCA AAGAGCTTTC AGGAACTGTA AAACACAATA TCGATCAAGC TTTGGAAAAC
ATAGAAAGAA TAGAAGAGGA AATCGAGTAT ATCGAAGAAA ATATGACATC TTATGTGAAA
TACAAGGACG ATTTGGAAGT TCTTCACGAT TACCTGTCCA TTGAAAGAGA GCGAAAAATC
GTACTCTCAA ATCTGCTCAA AACCAACAAG GTGTTCATGC TGGAAGGATG GCTCCCCGAA
AACTCGGCGG AAGAAGTTAA AACCTTCCTT GAAAAAAGCA GTGATTGCTA CATAGAAATC
GTCAAGCCGA AAGAAGACGA GGAATTCCCT GTGCTGCTTG CCAACAGGGC CATCCCAAGT
ACTGTGGAAT CAATAACCAA CATGTACAGT GTGCCAAACT GTAAAGAAAT TGACCCGAAC
GCAATAATGG CTCCATTTTT TATATTGTTT TTCGGACTTA TGCTAAGCGA CGGTGGTTAT
GGTGCCATAA TGACCATACT GGCAACTATA ATCCTTAAGG TGTTCAAGCT TGAAGAAAGT
ACGAAAAAGT TTATGAAACT CATGGTTTAC TGTGGTATTT CCACAATGTT CTGGGGCTTG
CTTTTCGGAG GCTGGTTCGG TATTCCGAAC ATACCGGCAG TGTGGTTCAA TCCTACAGAA
GATCCGGAAC TGTTGCTTAG TTTCTCGTTG CTCTTTGGGG CCATCCACAT ATATGTCGGT
CTTGGAGTTC GGGCTGCAAA CCTTATCAAG GATAAAAAAT ACCTTGATGC GGTTTTTGAT
TCGCTGTTCT GGTATATATT GTTTACCGGA TTCATACTGT TTGTACTTCC CTATATCCCA
AAGATTGACG CCGAAAGTGT AACCGGTCTG GTAAACTTAG GCAAGTATCT TATGATTATC
GGTGCAGTTC TTTTGATTCT TACCCAGGGC AGAGGAAACA AAAATATTAT CGCAAAGCTT
TTTGGCGGTG TTGCAAGCCT TTATGACCTT ATAAGCTTCA TGAGTGACGT TTTGTCCTAC
TCAAGACTTC TTGCACTTGG TCTTGCAACT TCGGTTATTG CGTCCATTAT TAACCAGATG
GCAACAATGT TTGGTTTCAA CAACATATTA AAAATAATTG CCGTAGTCGC CATTCTGGCT
TTTGGACACC TGTTCAATTT TGCAATCAAT GCGCTGGGAG CATATGTTCA CTCTTGCAGG
CTGCAGTACA TTGAGTTTTT CGGAAAGTTT TACAAGGGCG GAGGCACAGC CTTTGAACCC
TTTAAAGCAA AAACAAAATA TATAAATCTA AAATAA
 
Protein sequence
MAIVKMNKIS LIGLESEKER ILENLMKLGV VEITDAKEKI SSDEWKELVN IDGDSESVSR 
LDAQIDRVSA VIDYLDKFDK RKKPLFSARR DISTSELNMV LQNQDKLWSV IDEVNRYDEM
LANLKAEKNR NSNMILSLKP WEALDIPLEL TATASSTVLI GVVPEMANTD KIKQDLDEKV
PESHFEVLSR DKEQSYLLII YLTSKEEDVM NVLKQYGFSK VTFKELSGTV KHNIDQALEN
IERIEEEIEY IEENMTSYVK YKDDLEVLHD YLSIERERKI VLSNLLKTNK VFMLEGWLPE
NSAEEVKTFL EKSSDCYIEI VKPKEDEEFP VLLANRAIPS TVESITNMYS VPNCKEIDPN
AIMAPFFILF FGLMLSDGGY GAIMTILATI ILKVFKLEES TKKFMKLMVY CGISTMFWGL
LFGGWFGIPN IPAVWFNPTE DPELLLSFSL LFGAIHIYVG LGVRAANLIK DKKYLDAVFD
SLFWYILFTG FILFVLPYIP KIDAESVTGL VNLGKYLMII GAVLLILTQG RGNKNIIAKL
FGGVASLYDL ISFMSDVLSY SRLLALGLAT SVIASIINQM ATMFGFNNIL KIIAVVAILA
FGHLFNFAIN ALGAYVHSCR LQYIEFFGKF YKGGGTAFEP FKAKTKYINL K