Gene Moth_0133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0133 
Symbol 
ID3830790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp126833 
End bp128800 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content60% 
IMG OID637828067 
Productendopeptidase La 
Protein accessionYP_429015 
Protein GI83589006 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID[TIGR00764] lon-related putative ATP-dependent protease
[TIGR02903] ATP-dependent protease, Lon family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.126751 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.379491 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAA ATGCCGGAGC AGCAATCAAC ATTAATGACG TGGATTTGCT TCCCATTCTG 
GCCAATGATT TCAATTTAAT GAGTCGCCAG GTGGGAGCCC TTTTTAACGT TCTCTGCGAT
ATCTATGGGA CCGACAAGGT GGTTTTAAAG GCCAGCAAGC TGGAGGCCCT GGACCTGATG
CGCTCGGAAG CTCTGCCAGA GCGAGTCCTT GCCCTGCAGA AGCTGGTCTA CGAGGACCCA
ACCATCACCG ATGTCCCGTC CGAAAGCGAT ATCCCGGCCA TCCTGGTGGC CATCCAGGAG
GAAATAGCCG AGTTTATTGC CCGGCGGACG GTAGAGGATA AACTGGAACA GCGCATCGCG
GAAAAAATGC AGGAGCGCCA CGAGGAGTAT GTCCAGGAGA TCCGGCGCCA GGTTTTAAAG
GAAGGTAGCG GTCCGGAAAA CGCCCAGACC CTTAAAAAAC TGGCTATTCT GGAAAAGCTG
TCCCATACCA GCCTCTCGCG GACAATCATG GAGGCCCTGC GCCCCCGGCG GCTGGAGGAA
ATTGTCGGCC AGGAGCAGGC CGTCCAGTCC ATCCTGGCCA AGCTGGCCTC CCCCTACCCC
CAGCATATGA TTATCTACGG TCCCCCCGGG GTGGGGAAAA CCACAGCCGC CCGACTGGCC
CTGGAAGAGG CCAGGAAAAT CAGCAGCTCA CCCTTTAAAG CCAGCGCACC CTTCGTTGAG
GTTGATGGGA CCACCCTGCG CTGGGATCCC CGGGAGGTAA CCAATCCCCT CCTGGGTTCG
GTTCACGATC CCATCTACCA GGGGGCGAGA CGGGATCTGG CGGAGAACGG CGTTCCCGAA
CCCAAGCTTG GCCTGGTAAC TGAAGCCCAT GGCGGGGTAT TATTTATCGA TGAGATCGGC
GAGATGGACC CCCTTCTTCT CAACAAGCTC CTCAAGGTGT TGGAGGATAA GCGGGTAGAG
TTTGATTCCT CCTATTACGA CCCCAACGAC GAGAGCGTAC CCCAGTATAT CAAGAAGCTC
TTTACCGAGG GAGCACCGGC GGACTTTATC CTCATCGGTG CCACCACCCG GGAACCGGAG
GAGATAAACC CGGCCTTGCG TTCCCGTTGC GCCGAGGTAT TCTTTGAGCC CCTGACCCCG
GCCGACGTAG AGACCATTGT CCGGGAAGGG GCGGGACGGC TGGGGGTAAA ACTGGAACCG
GCCGTACCGG GTCTAATCGC CGAGTACACC ATTGAAGGCC GCAAGGCCAT TAACATCCTG
GCCGAAGCCT ACGGCTTGAG CCTTTACCAG CAACAGCGGA AAAAGGGTCG CCGCCGGCGG
CTGATCACCG TGGCCAACGT CATGCAGGTC ATCCAGAATG CCCGCCTGAC CCCTTACGTC
ACCGTGCGGG CCAAGGACAC GCCGGAGGTG GGCCGGGTGC TGGGCTTGGC CGTAGCCGGC
TTTGTCGGCT CGGTGCTGGA GGTCGAGGCC ATGGCCTTTC CGGCCCGGGA AGCGGGCAAG
GGCAGTATCC GTTTCAACGA GACCGCGGGC AGCATGGCCC GGGATTCCGT TTTTAATGCC
GCTGTCGTCT ATCGCCTCCT GACTGGGGAC GACCTGGCCA ATTACGATGT CCACGTTAAT
GTCGTCGGCG GGGGCAATAT TGACGGCCCC TCGGCCGGCC TGGCCATTAT CGCCGCCATC
ATCAGCGCCA TCCAGGAGCG GCCCGTACGC CAGGATGTAG CCGTTACCGG CGAGATATCT
ATCAGGGGCA AGGTTAAGCC GGTGGGGGGC GTCATGGAGA AGATATATGG CGCCAAACAG
GCGGGAATGA AACTCGTTAT ACTCCCGGCG GAAAATGCGG CGGAGGTACC GGACAGCCTG
CAGGGGATCG CCATCCAGCC GGTAGCCACC GTGGAGGAGG CCCTGGATTA TCTGCTAATG
GGTCCTGGTG AGGGCCGGGG CCGCCGGCGT AACCGGACGG GAGCCTAA
 
Protein sequence
MNKNAGAAIN INDVDLLPIL ANDFNLMSRQ VGALFNVLCD IYGTDKVVLK ASKLEALDLM 
RSEALPERVL ALQKLVYEDP TITDVPSESD IPAILVAIQE EIAEFIARRT VEDKLEQRIA
EKMQERHEEY VQEIRRQVLK EGSGPENAQT LKKLAILEKL SHTSLSRTIM EALRPRRLEE
IVGQEQAVQS ILAKLASPYP QHMIIYGPPG VGKTTAARLA LEEARKISSS PFKASAPFVE
VDGTTLRWDP REVTNPLLGS VHDPIYQGAR RDLAENGVPE PKLGLVTEAH GGVLFIDEIG
EMDPLLLNKL LKVLEDKRVE FDSSYYDPND ESVPQYIKKL FTEGAPADFI LIGATTREPE
EINPALRSRC AEVFFEPLTP ADVETIVREG AGRLGVKLEP AVPGLIAEYT IEGRKAINIL
AEAYGLSLYQ QQRKKGRRRR LITVANVMQV IQNARLTPYV TVRAKDTPEV GRVLGLAVAG
FVGSVLEVEA MAFPAREAGK GSIRFNETAG SMARDSVFNA AVVYRLLTGD DLANYDVHVN
VVGGGNIDGP SAGLAIIAAI ISAIQERPVR QDVAVTGEIS IRGKVKPVGG VMEKIYGAKQ
AGMKLVILPA ENAAEVPDSL QGIAIQPVAT VEEALDYLLM GPGEGRGRRR NRTGA