Gene Moth_0309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0309 
Symbol 
ID3831776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp313280 
End bp314527 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content61% 
IMG OID637828244 
ProductAAA ATPase 
Protein accessionYP_429186 
Protein GI83589177 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1222] ATP-dependent 26S proteasome regulatory subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.902003 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTTGAGC TTAAGGGTAT TATCCTCGAT TTCCGCCAGG CGGTGGCGGC CCAGAACTTA 
AGCAAGGTCG CCTTTGACGC CGCCCTGACG GTGGGTGTCC TCGTCCTTCT GGCCGAGGTG
GCGGCCAATT ACAAAAAGCG CCAGTACGTA AAAAACGCCC TGGTCGGCCT CATCGCCTTC
ATCATTACCT ACGCCTTTAT CCGTTACTTC CCGGCCAGGG GCTCCGAATT CGAGCTGGTG
ACCATGCTCC TGGGGCTCTG GTACGCGGCC CAGGTTATTT ACAATCTCGC CGGCCGCACC
TTGAAGACAT ATAGCGAAGC CTTTACCGGC GGCTGGCACG CCTTCAGGCA GGCGATAAAT
GGCAAGAAAG CCTACCAGCA TAGTTACCGG CAGACTTACC AGGCTTACCA GGAAGCCAGC
CAGGGCAACA ACCAGGAAAA AGAAGCTTCC ACCGCAGGAC CACCGCCTGG AGTGGAATAC
CTGCCGCCCC GGCGCCCGGA CCCCAGGGCC TTCGACGGCC TTATCGGCCT GGACAAAGCG
ATTGACGCCA TCAAAACGGC CCTGGAACTG CCATTGAAGC AGCCGGAAAA GATCCGGGAG
TACAACCTGG AATTGCCGCG GGGGATCCTG CTCTACGGGC CTCCCGGCAC CGGCAAGACG
AGTTTTGCCC GGGCGGCGGC CCGGTACTTC GGCTGCTCCT TCTATGCCGT TAACGCCTCC
TCCCTTATAG GCCGTTATGT AGGCACCAGC GAGGCCAATT TGCGTAACCT CTTCGCCCAC
GCCCGCCGTC ACCGGCCGGC GGTGATCTTT TTCGACGAGA TCGACGCCAT CGGCCGCCGC
CGCGACGGCA GCGACATGAA CCGCGCCTCG GACATCCTAC TGCAGCTACT CCTGGGCGAG
CTGGACGGCT TCGCCAGCCG GGAAGGGATC TTTATTATTG CCGCCACTAA CCGCGCCGAT
GTGCTGGATG AGGCCCTGGT GCGGCCGGGC CGCCTGGACC AGAAGATCGA ACTGCCTCTA
CCGGGCGCCC GCGCCCGGAG GCAGCTCTTT GAGGTCTACC TCCGGAACAG GCCCACCGAA
TTAAACGAAA CCGACTACCA GACCCTGGTG GCCAGGACGA CAGGGGCTTC CGCCGCTGAC
ATCAAAGCTG TTTGCGACCG GGCCGCCCTG GCGGCCTCCC GCGTCCGGGC CAGGATAGAT
TGCGCTTACC TGATTGAAGC CATTAACGAA TTAAGGGGGA GAGGATGA
 
Protein sequence
MVELKGIILD FRQAVAAQNL SKVAFDAALT VGVLVLLAEV AANYKKRQYV KNALVGLIAF 
IITYAFIRYF PARGSEFELV TMLLGLWYAA QVIYNLAGRT LKTYSEAFTG GWHAFRQAIN
GKKAYQHSYR QTYQAYQEAS QGNNQEKEAS TAGPPPGVEY LPPRRPDPRA FDGLIGLDKA
IDAIKTALEL PLKQPEKIRE YNLELPRGIL LYGPPGTGKT SFARAAARYF GCSFYAVNAS
SLIGRYVGTS EANLRNLFAH ARRHRPAVIF FDEIDAIGRR RDGSDMNRAS DILLQLLLGE
LDGFASREGI FIIAATNRAD VLDEALVRPG RLDQKIELPL PGARARRQLF EVYLRNRPTE
LNETDYQTLV ARTTGASAAD IKAVCDRAAL AASRVRARID CAYLIEAINE LRGRG