Gene Moth_0531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0531 
Symbol 
ID3830916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp551366 
End bp553672 
Gene Length2307 bp 
Protein Length768 aa 
Translation table11 
GC content56% 
IMG OID637828472 
ProductLon-A peptidase 
Protein accessionYP_429404 
Protein GI83589395 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000877407 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.534279 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGTCGAG CCCTGCCCCT GTTGCCCCTG CGCGGGGTTA TTGTTTTTCC CTACACGGTT 
ATTCACCTGG ATATAGGGCG GGAACGCTCG GTAAGTGCCA TTGAAGCTGC CATGCTTGGT
GATCGGGTCA TTTTCCTGGC CATGCAAAAA GAGGCTCAGG ACGACGACCC CGGCGAAGAT
GATATCTATA CTACCGGAAC CATAGCTGAG ATTAAACAAT TATTGAAATT ACCTGGAGGC
ACCATCAGGA TCCTGGTCGA AGGTATCCGC AGGGGGGAAA TTAAGGAGTA TATCAGCCAT
GATCCCTTCC TCAAGGTAGA AGTCGAAGAG GCTCCGGAGC CGGCAGAGAC CTCCCCGGAG
ATCGAGGCCC TGATGCGTTG CCTGATTGAC GAGTTTGAAA CCTATGTCAA GATGGCCAAA
AAGATACCTC CGGAAACGGT GGTAGCCGTC GTCAGCCTGG AGGAACCGGG GCGTCTGGCC
GATGTGGTGG CCTCCCACCT CAACCTCAAA TTGACGGATA AACAGGCCGT CCTGGAGGCT
GTGGATATCA AAACGCGGCT GAATATCCTC TGCGATATCC TGGCCAAGGA AAAAGAAATC
CTGGAACTGG AGCGGAAGAT CAGCCTGCGG GTACGCAAGC AGATGGAAAA AGCCCAAAAG
GAGTACTACC TGCGGGAGCA GATCAAGGCT ATCCAGAAGG AACTTGGCGA GAAAGACGAC
CGTGTGGCCG AGGCCGAGGA ACTGCGGGAG AGGATAGCTA AAGCCAGGCT GCCTAAAGAG
ATCCGGGAAC GCGCCCTGAA AGAGGTTGAA AGGCTGGAGA AAATGCCACC CATGGTGGCG
GAGGTAACCG TCGTCCGCAA CTACCTGGAC TGGATCCTGG CCCTGCCCTG GCACAAGCAG
ACCAGGGACC GCCTGGATAT CAAGGTAGCC GAGGAGATCC TGGACGAAGA TCACTACGGT
TTAAAGGAAG TTAAGGAGCG TATCCTGGAA TACCTGGCCA TTCGCCAGCT GGCCAAAAAG
ATGCGGGGCC CCATCCTCTG TTTTGTGGGA CCGCCTGGGG TGGGTAAGAC CTCCTTGGCC
AAATCCATCG CCCGGGCCCT GCAGCGCAAG TTTGTCCGTA TCTCCCTTGG AGGTACCAGG
GACGAAGCCG AGATTCGCGG CCACCGGCGG ACCTATGTGG GCGCCCTGCC CGGACGGATT
ATCCAGGGCA TGAAACAGGC GGGGACGAAG AATCCGGTCT TTTTATTGGA TGAGATTGAT
AAGTTGAGCA GCGATTTCCG GGGCGATCCC GCCTCGGCGC TGCTGGAAGT CCTGGACCCG
GAACAAAACT ATATGTTTAG CGATCATTAT ATTGAAGCTC CCTTCGACCT CTCCAAGGTA
ATGTTCATTA CCACGGCCAA TGTCGAATAC TCGATCCCCC GTCCCCTCCT GGATAGGATG
GAAGTTATCC GTATTCCCGG CTACACCGAG GAAGAAAAGG TCAAGATTGC TGAACTGCAC
CTGCTGCCCA AGCAGCTTGA GGAACACGGC CTTAAGAAGC AGCAACTGGA AGTATCGGAA
AACGCCTTGC GGCGGATTGT CCGGGAGTAT ACCCGGGAGG CCGGCGTCCG AAACCTGGAA
CGGGAAATCG CCACCATCTG CCGTAAGACC GCCCGGGACA TCGTCAGCGG TAAAACCAAA
GCCGTTAAAG TAACGGCCAA CAATGTGGAG CAATACCTTG GTATTCCTCG TTTTCATCAT
ACGCAAGCCA TCCGGAATGA GATGGTGGGT GTGGTCAACG GCCTGGCCTG GACGGAGGTT
GGCGGCGAGG TCCTGAATGT CGAGGTGTCT ATCCTGAAGG GGAAAGGCAA CCTGACCCTG
ACGGGAAAAC TGGGCGACGT CATGAAGGAA TCCGCCTACG CCGGTTTCAG CTACCTCCGC
TCCCGGGCCG CCGAACTGGG CCTGGAGGAA GACTTCCACG AGAAGTTTGA CCTGCATATC
CACGTTCCCG AGGGTGCCAT CCCCAAGGAC GGGCCTTCGG CGGGCATCAC CATGGCTACG
GCCATGGCCT CAGCCCTAAA GGGCGTACCG GTGCGGAGCG ACCTGGCCAT GACCGGGGAA
ATCACCCTGC GCGGCCGGGT ACTGCCGGTG GGAGGTATTA AAGAAAAGAT TTTGGCCGCC
CACCGGGAAG GGATTAAAAA CATCATCTTG CCCCGGGAGA ACGAGAAAAA CCTGGAAGAC
ATCCCGGCCA ACATCAAGCG CAAGATGAAC TTTATCCTGG TCGAGCACAT GGACGAAGTT
CTGAAAGAAG CCCTGGGTAA TAACTAG
 
Protein sequence
MRRALPLLPL RGVIVFPYTV IHLDIGRERS VSAIEAAMLG DRVIFLAMQK EAQDDDPGED 
DIYTTGTIAE IKQLLKLPGG TIRILVEGIR RGEIKEYISH DPFLKVEVEE APEPAETSPE
IEALMRCLID EFETYVKMAK KIPPETVVAV VSLEEPGRLA DVVASHLNLK LTDKQAVLEA
VDIKTRLNIL CDILAKEKEI LELERKISLR VRKQMEKAQK EYYLREQIKA IQKELGEKDD
RVAEAEELRE RIAKARLPKE IRERALKEVE RLEKMPPMVA EVTVVRNYLD WILALPWHKQ
TRDRLDIKVA EEILDEDHYG LKEVKERILE YLAIRQLAKK MRGPILCFVG PPGVGKTSLA
KSIARALQRK FVRISLGGTR DEAEIRGHRR TYVGALPGRI IQGMKQAGTK NPVFLLDEID
KLSSDFRGDP ASALLEVLDP EQNYMFSDHY IEAPFDLSKV MFITTANVEY SIPRPLLDRM
EVIRIPGYTE EEKVKIAELH LLPKQLEEHG LKKQQLEVSE NALRRIVREY TREAGVRNLE
REIATICRKT ARDIVSGKTK AVKVTANNVE QYLGIPRFHH TQAIRNEMVG VVNGLAWTEV
GGEVLNVEVS ILKGKGNLTL TGKLGDVMKE SAYAGFSYLR SRAAELGLEE DFHEKFDLHI
HVPEGAIPKD GPSAGITMAT AMASALKGVP VRSDLAMTGE ITLRGRVLPV GGIKEKILAA
HREGIKNIIL PRENEKNLED IPANIKRKMN FILVEHMDEV LKEALGNN