Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2267 |
Symbol | |
ID | 4810005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2695033 |
End bp | 2696802 |
Gene Length | 1770 bp |
Protein Length | 589 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640107673 |
Product | V-type ATP synthase subunit A |
Protein accession | YP_001038662 |
Protein GI | 125974752 |
COG category | [C] Energy production and conversion |
COG ID | [COG1155] Archaeal/vacuolar-type H+-ATPase subunit A |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCCAGG GAACAATAGT TAAAGTCTCC GGGCCTTTGG TAATTGCCGA AGGCATGAGA GATGCAAACA TGTTTGACGT TGTACGTGTA AGTGAACATC GTTTAATTGG CGAAATAATC GAAATGCATG GAGATCGAGC CTCCATCCAG GTATACGAAG AAACCGCAGG CTTGGGCCCC GGCGAACCGG TGGTTTCCAC CGGAGCGCCT CTAAGTGTTG AGCTGGGACC TGGGCTTATT GAAAATATTT TTGACGGTAT TCAAAGACCT CTTGTAAAAA TGAGAGAAAT GGTTGGCAGC AACATAACAA GAGGTATTGA CGTTACTGCC CTTGACAGAA GCAAAAAGTG GGATTTTCAA CCTACCGTAA AAAAAGGTGA CAAAGTAACC GCCGGCGATG TAATAGGAAA AGTCCAGGAA ACTTCCATTG TGGAGCACAG AATAATGGTG CCCTATGGAG TACAGGGAAC AATTGAGGAG ATAAAGAGCG GAAGCTTTAC TGTGGAGGAA ACCGTCGCAA AGGTTCGGAC AGAAAACAAC GAACTGGTTG ATATCTGCAT GATGCAGAAA TGGCCGGTAC GTATCGGCCG TCCATATAGA GAAAAGCTCC CCCCCAACGC TCCACTTGTT ACAGGTCAAA GGGTTATAGA CACTCTATTC CCTTTGGCCA AAGGTGGAGT TGCGGCCGTA CCCGGACCTT TCGGAAGCGG TAAAACCGTG GTTCAGCACC AGCTTGCAAA ATGGGCCGAC GCTGATATAG TTGTCTATAT AGGCTGCGGA GAGCGCGGCA ACGAAATGAC CGACGTTTTA AAAGAATTCC CGGAGCTTAA AGACCCAAAA ACCGGCGAAT CTCTTATGAA GAGAACCGTT CTTATAGCAA ATACGTCAGA CATGCCTGTT GCGGCCAGAG AGGCATCCAT TTATACAGGC ATGACTATTG CGGAATATTT CAGGGATATG GGCTATAGTG TGGCGTTAAT GGCAGACTCC ACTTCCCGCT GGGCGGAAGC ATTAAGAGAA ATGTCCGGAC GTCTCGAAGA AATGCCCGGT GAAGAAGGTT ATCCGGCATA TCTTGGCTCA AGGCTTGCCC AGTTCTATGA AAGAGCGGGA AGAGTTGTAT GCCTTGGTTC CGACGGAAGA GAAGGTGCCC TTACCGCCAT CGGTGCCGTG TCACCTCCGG GCGGTGACCT TTCCGAACCT GTTACACAGG CAACACTGAG AATTATCAAA GTGTTCTGGG GGCTTGACTC AAGTCTTGCC TACAGACGAC ATTTCCCTGC AATCAACTGG CTGCAGAGCT ATTCGCTGTA CCTTGACATA ATAGGAAAAT GGATTAGTGA AAACATTTCA AGGGATTGGG AGACATTAAG ATCCGACACT ATGCGCATTC TGCAGGAGGA AGCGGAACTT GAGGAAATTG TGCGCTTAGT CGGTGTTGAC GCCCTGTCGC CGTCCGACAG GCTTACTCTG GAAGCCGCCA AGTCGATACG CGAAGACTAT CTCCACCAGA ATGCCTTCCA TGAAGTGGAT ACCTATACTT CATTAAACAA GCAGTACAGA ATGTTAAAAC TCATACTGGG ATTCTATTAC AGCGGCAAAA AAGCCCTGGA AGCAGGAGTA AGCATCAAAG AGCTGTTTGA ACTTCCTGTC AGGGAAAAAA TCGGAAGAGC GAAATATACG CCCGAGGATC AGGTAAACAG CCACTTCAAC GAAATTGAAA AAGAACTTAA TGAGCAAATA GAAGCCCTCA TCGCAAAGGA GGTGCAATAA
|
Protein sequence | MSQGTIVKVS GPLVIAEGMR DANMFDVVRV SEHRLIGEII EMHGDRASIQ VYEETAGLGP GEPVVSTGAP LSVELGPGLI ENIFDGIQRP LVKMREMVGS NITRGIDVTA LDRSKKWDFQ PTVKKGDKVT AGDVIGKVQE TSIVEHRIMV PYGVQGTIEE IKSGSFTVEE TVAKVRTENN ELVDICMMQK WPVRIGRPYR EKLPPNAPLV TGQRVIDTLF PLAKGGVAAV PGPFGSGKTV VQHQLAKWAD ADIVVYIGCG ERGNEMTDVL KEFPELKDPK TGESLMKRTV LIANTSDMPV AAREASIYTG MTIAEYFRDM GYSVALMADS TSRWAEALRE MSGRLEEMPG EEGYPAYLGS RLAQFYERAG RVVCLGSDGR EGALTAIGAV SPPGGDLSEP VTQATLRIIK VFWGLDSSLA YRRHFPAINW LQSYSLYLDI IGKWISENIS RDWETLRSDT MRILQEEAEL EEIVRLVGVD ALSPSDRLTL EAAKSIREDY LHQNAFHEVD TYTSLNKQYR MLKLILGFYY SGKKALEAGV SIKELFELPV REKIGRAKYT PEDQVNSHFN EIEKELNEQI EALIAKEVQ
|
| |