Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2276 |
Symbol | |
ID | 4809865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2706755 |
End bp | 2707933 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640107682 |
Product | AAA ATPase, central region |
Protein accession | YP_001038671 |
Protein GI | 125974761 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1222] ATP-dependent 26S proteasome regulatory subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00996257 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACA AAAGAAACGT ATTGATTATT TCCATCGCTA TAATTGCAGT CATCGTCGGC ATAACTCTTT ATTTTGAAAG AGACAACTCT CAATATATGC CTTACCCTGA TTTTTACAAT TACGTGAAGT CAGGCAAGGT GGTTTCCGCA AAAATAGGAA ATGATGAAGT AAGGTTTTAT TTAAACGGTG ACCAGACAGA ATACCGCACC GACAATCCGG AACTGGATTC CTTCAAAGAA TTTTTACTCC TTAACGGAGT TAAAGTTACT TCGGAAAAAA GTGCCGATGA ACTTTTCGTC ACGGTTACTG ATGCAATTTT CAGCATTATT TTTTTCGGGA TTATAATATT CGGCTTGTAT AAATTTATTG ATTTCAGAAG AAGCACATTT AAAGTTATTC GCAACAACGA CACAAAATTT TCAGATGTTG CCGGCATGGA AGATTTAAAA AGAGAAATGC TTCAGGCCGT GGACATCTTA AAACATCCCA AAGAGTATGC GGCAAAAGGA ATACGCCCGA TAAACGGTAT TCTGCTTGAA GGCAATCCCG GAAACGGGAA AACATTGTTT GCAAGGGCAT TGGCAGGAGA AGCCAAAGTC AATTTCATTG CCACCAAGGC CACGGATTTT CAAAGTGCAA TCATGTCCAT AGGACCTGCC AAAATTAAGG CCCTGTTTAG AAAAGCCCGT GCCAACAAGC CCTGCATCAT ATTCATAGAC GAGTTTGATG GAATAGGTGA AAAACGCAAT TATGCCGGAA CCGGAATTGA CAAAGAAAAC AACAGAATTA TTGCCGCAAT GCTAAATGAA ATGGATGGTT TTACCCGTGA AGGCGGTGTT ATGGTTATTG CCGCAACCAA CAATTACAAG GCCCTTGATG AAGCATTGGT GCGTCCGGGA AGGTTTGACA AAAAATACAC CGTTCCTAAC CCGGATTATA AAACCAGAAT TGAATTGATA AAAATATATA CAAAAAACAA AAAATTGTCC GAAAGCATAT CAATTGAACA ACTGGCCGCA AAATTTGAAG GCATGACCTG TTCCCAGATA GAAACAATAC TTAATGAAGC CGCAGTAATT GCCACCGGCG AAGGTCACAG TGATATAACG GAAAGCGACC TTATCGAGGC AGTAAAAAAG ATATGTCATC TTACAACAGT AGGAGTAAAA AGAAAATAA
|
Protein sequence | MKNKRNVLII SIAIIAVIVG ITLYFERDNS QYMPYPDFYN YVKSGKVVSA KIGNDEVRFY LNGDQTEYRT DNPELDSFKE FLLLNGVKVT SEKSADELFV TVTDAIFSII FFGIIIFGLY KFIDFRRSTF KVIRNNDTKF SDVAGMEDLK REMLQAVDIL KHPKEYAAKG IRPINGILLE GNPGNGKTLF ARALAGEAKV NFIATKATDF QSAIMSIGPA KIKALFRKAR ANKPCIIFID EFDGIGEKRN YAGTGIDKEN NRIIAAMLNE MDGFTREGGV MVIAATNNYK ALDEALVRPG RFDKKYTVPN PDYKTRIELI KIYTKNKKLS ESISIEQLAA KFEGMTCSQI ETILNEAAVI ATGEGHSDIT ESDLIEAVKK ICHLTTVGVK RK
|
| |