Gene Moth_0530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0530 
Symbol 
ID3830915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp549550 
End bp551268 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content62% 
IMG OID637828471 
ProductAAA ATPase 
Protein accessionYP_429403 
Protein GI83589394 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID[TIGR00764] lon-related putative ATP-dependent protease
[TIGR02902] ATP-dependent protease LonB 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0252109 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAT GGGGCCAGGG CCTGGGCGGA GTCCTGGGCT TTGTCCAGAT ATTCTTTGCC 
GTGGTAATTG GCCTTTACTT CTGGAACCTC CTGCGCAACC AGCAGGGCAG CCGGGTGGCT
GTGGAAAGGG AATCTCGCAA AGAACTGGAG AAACTCCAGC GGATGCGGGA GATCTCCCTC
ACTGAACCCC TGGCGGAAAA GACCCGGCCC AGCACCTTTG CCGAGATTAT CGGCCAGGAA
GAGGGCTTGA AGGCCCTCCG GGCCGCCCTC TGCGGTCCCA ACCCCCAGCA TGTTATTATC
TATGGCCCGC CGGGGGTGGG TAAAACAGCC GCCGCCCGGC TGGTCCTGGA GGAAGCCAAG
GCCAACCCCC TTTCACCCTT TAAAGAAAAC GCTAAATTCG TCGAAGTCGA CGGTGCGACC
TCCCGGTTTG ACGAGCGGGG CATTGCCGAC CCCCTGATTG GTTCGGTCCA TGACCCCATC
TACCAGGGAG CGGGCCCCAT GGGCATGGCC GGTATTCCCC AGCCCAAACC GGGGGCCGTG
ACCCGCGCCC ATGGTGGTAT CCTGTTTATC GATGAAATTG GCGAACTGCA CCCCATCCAG
ATCAACAAGC TCCTCAAGGT CCTGGAAGAT CGCAAGGTTA TCCTGGAAAG CGCCTATTAC
AGCAGTGAAG ATACTAATAT TCCCAGCCAC ATCCATGATA TTTTTCGCAA CGGTTTGCCG
GCCGATTTTC GCCTGGTAGG GGCGACCACC CGCCTACCCC AGGAGATCCC GGCGGCCATT
CGTTCCCGCT GCCAGGAGAT CTATTTTCGG CCCCTGCTGC CCCATGAAAT CGGGTTAATT
GTCCAGAACG CCGCCCGGAA AATAAACTTA CACATTGCCC CGGAGGCCGT GGAGGTAATC
AAACAGTATG CCACCAACGG CCGGGAAGCA GTGAACATGG TCCAGCTGGC CGCCGGCCTG
GCCCTGACGG ACAGGCGCCA GGAGATTACC CGGGCCGATA TCGAGTGGGT GGTGACCAGC
GGCCAGTATA CCCCGCGAAT GGATAGCAAA GTACCTCCTG AACCCCAGGT GGGGGTGGTC
AACGCCCTGG CAGTCTATGG CCCCAATATG GGGCTGGTGG TGGAGCTGGA GGCCAGCGTG
AACTTCGTCG GCCACCGGCG CGGGCAGGTT ATCGTCACCG GCGTGGTGGA AGAGGAAGAA
ACCGGGAGCG GCGACGGCCG CCGCGTCAGG CGCAGGAGCA TGGCCCGTAA CGCCGTGGAC
AATGTCCAGA CGGTTCTCAG GCGCCTGCTG CAGGTGGATC TACGCGACTA CGACATCCAC
TTGAACTTTC CCGGCGGCGT ACCGGTGGAC GGTCCCTCGG CCGGGGTCAG CATGCTGACA
GCCGTTTATT CCGCCCTCAC CGAGACACCG GTCAACAACC TGGTAGCCAT GACGGGAGAA
GTAGCCATTC GCGGTGGTGT GCGACCGGTG GGAGGCGTAG TGGCCAAGGT GGAGGCCGCC
CGCCTGGCCG GGGCGAAGAA AGTGCTCATC CCCAGGGACA ATTGGCAGGA GATCTTCCGT
AGCCTGCCGG GTATCCAAGT GATCCCGGTC CAGAACGTCA ACGAAGTCCT GGCAGAGGCC
CTGTTGCCCC CGGCCCAGGT GGCCAGCCCG GAAAGGTTCA AGGCCCGTCC CCAGATTTTA
AGTGCGGCAC CAAACATTGG CCGGCCGGTG GGCTGCTAG
 
Protein sequence
MTEWGQGLGG VLGFVQIFFA VVIGLYFWNL LRNQQGSRVA VERESRKELE KLQRMREISL 
TEPLAEKTRP STFAEIIGQE EGLKALRAAL CGPNPQHVII YGPPGVGKTA AARLVLEEAK
ANPLSPFKEN AKFVEVDGAT SRFDERGIAD PLIGSVHDPI YQGAGPMGMA GIPQPKPGAV
TRAHGGILFI DEIGELHPIQ INKLLKVLED RKVILESAYY SSEDTNIPSH IHDIFRNGLP
ADFRLVGATT RLPQEIPAAI RSRCQEIYFR PLLPHEIGLI VQNAARKINL HIAPEAVEVI
KQYATNGREA VNMVQLAAGL ALTDRRQEIT RADIEWVVTS GQYTPRMDSK VPPEPQVGVV
NALAVYGPNM GLVVELEASV NFVGHRRGQV IVTGVVEEEE TGSGDGRRVR RRSMARNAVD
NVQTVLRRLL QVDLRDYDIH LNFPGGVPVD GPSAGVSMLT AVYSALTETP VNNLVAMTGE
VAIRGGVRPV GGVVAKVEAA RLAGAKKVLI PRDNWQEIFR SLPGIQVIPV QNVNEVLAEA
LLPPAQVASP ERFKARPQIL SAAPNIGRPV GC