Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2262 |
Symbol | |
ID | 4810000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2690497 |
End bp | 2692452 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640107668 |
Product | V-type ATPase, 116 kDa subunit |
Protein accession | YP_001038657 |
Protein GI | 125974747 |
COG category | [C] Energy production and conversion |
COG ID | [COG1269] Archaeal/vacuolar-type H+-ATPase subunit I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.181857 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAATAG TTAAAATGAA TAAAATCTCA CTTATAGGGC TTGAATCTGA AAAGGAACGC ATCCTCGAAA ATCTCATGAA ATTAGGTGTC GTTGAGATTA CGGATGCAAA AGAAAAAATA TCTTCCGATG AATGGAAAGA GCTGGTAAAT ATTGACGGAG ACAGTGAAAG CGTCTCAAGG CTTGATGCTC AGATTGACAG AGTATCGGCC GTTATTGACT ATCTGGACAA ATTTGACAAG CGCAAAAAAC CTCTCTTTTC AGCGAGACGG GATATCAGCA CAAGCGAGTT GAACATGGTT TTGCAAAACC AGGATAAGCT TTGGTCCGTT ATTGATGAAG TCAACCGGTA TGATGAAATG CTCGCAAACT TAAAGGCGGA AAAAAACAGG AACTCAAATA TGATTTTGAG TCTCAAACCG TGGGAAGCTT TGGACATTCC CTTAGAACTT ACGGCAACTG CATCTTCCAC GGTTTTGATC GGGGTTGTGC CGGAAATGGC AAATACCGAT AAAATAAAAC AGGATCTCGA TGAAAAAGTC CCGGAAAGCC ATTTTGAAGT TCTTAGCAGA GACAAGGAGC AAAGTTATCT TCTTATCATA TATTTAACTT CCAAAGAAGA AGACGTTATG AATGTATTAA AGCAGTACGG CTTTTCCAAG GTAACTTTCA AAGAGCTTTC AGGAACTGTA AAACACAATA TCGATCAAGC TTTGGAAAAC ATAGAAAGAA TAGAAGAGGA AATCGAGTAT ATCGAAGAAA ATATGACATC TTATGTGAAA TACAAGGACG ATTTGGAAGT TCTTCACGAT TACCTGTCCA TTGAAAGAGA GCGAAAAATC GTACTCTCAA ATCTGCTCAA AACCAACAAG GTGTTCATGC TGGAAGGATG GCTCCCCGAA AACTCGGCGG AAGAAGTTAA AACCTTCCTT GAAAAAAGCA GTGATTGCTA CATAGAAATC GTCAAGCCGA AAGAAGACGA GGAATTCCCT GTGCTGCTTG CCAACAGGGC CATCCCAAGT ACTGTGGAAT CAATAACCAA CATGTACAGT GTGCCAAACT GTAAAGAAAT TGACCCGAAC GCAATAATGG CTCCATTTTT TATATTGTTT TTCGGACTTA TGCTAAGCGA CGGTGGTTAT GGTGCCATAA TGACCATACT GGCAACTATA ATCCTTAAGG TGTTCAAGCT TGAAGAAAGT ACGAAAAAGT TTATGAAACT CATGGTTTAC TGTGGTATTT CCACAATGTT CTGGGGCTTG CTTTTCGGAG GCTGGTTCGG TATTCCGAAC ATACCGGCAG TGTGGTTCAA TCCTACAGAA GATCCGGAAC TGTTGCTTAG TTTCTCGTTG CTCTTTGGGG CCATCCACAT ATATGTCGGT CTTGGAGTTC GGGCTGCAAA CCTTATCAAG GATAAAAAAT ACCTTGATGC GGTTTTTGAT TCGCTGTTCT GGTATATATT GTTTACCGGA TTCATACTGT TTGTACTTCC CTATATCCCA AAGATTGACG CCGAAAGTGT AACCGGTCTG GTAAACTTAG GCAAGTATCT TATGATTATC GGTGCAGTTC TTTTGATTCT TACCCAGGGC AGAGGAAACA AAAATATTAT CGCAAAGCTT TTTGGCGGTG TTGCAAGCCT TTATGACCTT ATAAGCTTCA TGAGTGACGT TTTGTCCTAC TCAAGACTTC TTGCACTTGG TCTTGCAACT TCGGTTATTG CGTCCATTAT TAACCAGATG GCAACAATGT TTGGTTTCAA CAACATATTA AAAATAATTG CCGTAGTCGC CATTCTGGCT TTTGGACACC TGTTCAATTT TGCAATCAAT GCGCTGGGAG CATATGTTCA CTCTTGCAGG CTGCAGTACA TTGAGTTTTT CGGAAAGTTT TACAAGGGCG GAGGCACAGC CTTTGAACCC TTTAAAGCAA AAACAAAATA TATAAATCTA AAATAA
|
Protein sequence | MAIVKMNKIS LIGLESEKER ILENLMKLGV VEITDAKEKI SSDEWKELVN IDGDSESVSR LDAQIDRVSA VIDYLDKFDK RKKPLFSARR DISTSELNMV LQNQDKLWSV IDEVNRYDEM LANLKAEKNR NSNMILSLKP WEALDIPLEL TATASSTVLI GVVPEMANTD KIKQDLDEKV PESHFEVLSR DKEQSYLLII YLTSKEEDVM NVLKQYGFSK VTFKELSGTV KHNIDQALEN IERIEEEIEY IEENMTSYVK YKDDLEVLHD YLSIERERKI VLSNLLKTNK VFMLEGWLPE NSAEEVKTFL EKSSDCYIEI VKPKEDEEFP VLLANRAIPS TVESITNMYS VPNCKEIDPN AIMAPFFILF FGLMLSDGGY GAIMTILATI ILKVFKLEES TKKFMKLMVY CGISTMFWGL LFGGWFGIPN IPAVWFNPTE DPELLLSFSL LFGAIHIYVG LGVRAANLIK DKKYLDAVFD SLFWYILFTG FILFVLPYIP KIDAESVTGL VNLGKYLMII GAVLLILTQG RGNKNIIAKL FGGVASLYDL ISFMSDVLSY SRLLALGLAT SVIASIINQM ATMFGFNNIL KIIAVVAILA FGHLFNFAIN ALGAYVHSCR LQYIEFFGKF YKGGGTAFEP FKAKTKYINL K
|
| |