Gene Moth_2236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2236 
Symbol 
ID3831282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2331989 
End bp2334037 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content53% 
IMG OID637830156 
Productputative ATP-dependent Lon protease 
Protein accessionYP_431066 
Protein GI83591057 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4930] Predicted ATP-dependent Lon-type protease 
TIGRFAM ID[TIGR02653] conserved hypothetical protein
[TIGR02688] conserved hypothetical protein TIGR02688 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000058517 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGCTTG ATGCCCTCGA TAAATTGGCA GCTTCTGTAT TTGACGGGTA TATAGTTCGG 
AAGGACCTGG TGCGCAAATA CAGCCGGCAG TATCCTGTAC CCACCTACGT GGTAGAATTC
CTGCTCGGCC GGTATTGTGC AACTGTCGAT GAAAAGGAGA TCGAGGAAGG CCTGGAGATT
GTTGAGAGGC AATTGCGGGA TCGCACTGTC AGGACCGGTG AAGAAGAACT TTTTAAGGCC
CGGGCAAGGG AGATCGGGTC GATCAAAATA ATCGATCTCA TCAAGGCTCG ACTTGACACC
AAAAACGATT GTTTTGTTGC CCAACTGCCG AGCCTGGGGC TGAAAGACGT CCGGATTGAC
GACGCCCTGG TGCATGAAAA TGAGCGCATG CTTACTGATG GTTTTTATGC CGAAGTCACC
CTCGTCTATG ATGCCACCAT TGCCCAGGAG AAGAACGGTC GTCCTTTCGC CATTGAAAAC
TTGCGGGCCA TTCAACTTTC TAAAGTGGAT GCCCTGGCAG CGCTGCAGCG AGGGAGGAGC
CAGTTTACCA CTGACGAATG GAAACGCCTC CTGATACGCT CTGTAGGTTT GGAACCGGAT
ACCCTTTCAG AACGGGCTCA GGATATCGCC CTGCTGCGGA TGGTGCCTTT TGTAGAGCGA
AACTACAACC TGGTGGAAAT AGGCCCCCGG GGCACGGGCA AAAGCCATTT GTTTCAACAA
ATCTCCCCAT ACTCCCATCT GATTTCAGGC GGTAAAGCCA CAGTGGCCAA AATGTTTGTG
AATAATGCCA CCGGACAGCG GGGGCTTGTC TGCCACTATG ATGTCGTGTG CTTTGATGAA
GTGTCCGGCA TATCCTTCGA TCAGAAGGAC GGCGTCAACA TTATGAAGGG GTACATGGCA
TCGGGCGAAT TCTCCCGCGG CAAGGAGAGT ATCCGTGCTT CCGGCGGCAT TGTAATGCTC
GGGAACTTTG ATGTGGATGT CCAGCAACAG CAACGCATCG GCCACTTATT CAGTCCACTC
CCGCCGGAGA TGCGGGATGA TACGGCTTTC ATGGACCGCA TTCACGCCTA TGTTCCAGGG
TGGGAGTTTC CCAAACTCAA CCCCAATATC CATCTTACGG ATCATTTTGG CTTGGTCAGC
GACTTTCTCT CCGAATGCTG GCATAGACTG CGTGATGGTA GCCGGGTTTC CGTGCTCCAG
GGCCGGGTTA ACTGGGGTGG AGCCCTCAGC GGTCGCGATA TTGAAGCCGT TCATAAAACC
GTTAGCGGCC TGATCAAGCT ACTTTTCCCC GATCCGGAGA TGCCGATACC TGATGAAGAG
CTAGAAAAGA TCGTCCGTTT GGCCCTGGAA TCGCGTCGAA GGGTAAAGGA ACAGCAAAAG
CGCTGCCTTA AGACGGAATT TCGCAATACT CACTTCAGCT TCTCTATGGG CGTGGAGGGG
GTGGAACAGT TTGTTGCCAC GCCGGAACTC CACAGTGATG AAACCATCGA CAGCGATCCC
CTGCCTCCCG GGCAGGTGTG GGCCATCAGT CCCGGCGGCC AGGACGCTTC ACCAGCTCTC
TATCGAATAG AGGTTGCGGC GGGTCCTGGC AGTGGAGTAA AAATTCTTAA CGCACCTGTC
CCGCCGGCAT TTCGGGAAAG CGTTCGTTAC GGAGAACAAA ACCTGTACGT CAGGGCTAAA
GAGCTGGTTG GTGACCGCGA TCCGCGCGCC CGTGAGTTTT CAATCCAATT ACGCGCTATG
GACGTGGAAC GTTCAGGCCA GGGACTTGGT TTACCAGTGC TGATTGCACT TTGCGGCGCT
CTAATTGAGC GAAGCGTTAA GGGTGGATTG ATCATAGTCG GAGCCTTAAA CCTTGGTGGC
TCAATTGAGA TGATACCAAA TCCGGTTGCT GTGGCCGAAC TGGCCCTCGA GAAAGGAGCA
ACGACGCTGT TAATGCCTAT ATCTTCTCGA AGGCAATTGT TTGATCTTCC TGACGAAATG
GCTACGAAGA TCAACATCGA ATTTTATGCT GATGCAACGG ATGCTTTTGT TAAGGCGATT
GTTGACTAA
 
Protein sequence
MELDALDKLA ASVFDGYIVR KDLVRKYSRQ YPVPTYVVEF LLGRYCATVD EKEIEEGLEI 
VERQLRDRTV RTGEEELFKA RAREIGSIKI IDLIKARLDT KNDCFVAQLP SLGLKDVRID
DALVHENERM LTDGFYAEVT LVYDATIAQE KNGRPFAIEN LRAIQLSKVD ALAALQRGRS
QFTTDEWKRL LIRSVGLEPD TLSERAQDIA LLRMVPFVER NYNLVEIGPR GTGKSHLFQQ
ISPYSHLISG GKATVAKMFV NNATGQRGLV CHYDVVCFDE VSGISFDQKD GVNIMKGYMA
SGEFSRGKES IRASGGIVML GNFDVDVQQQ QRIGHLFSPL PPEMRDDTAF MDRIHAYVPG
WEFPKLNPNI HLTDHFGLVS DFLSECWHRL RDGSRVSVLQ GRVNWGGALS GRDIEAVHKT
VSGLIKLLFP DPEMPIPDEE LEKIVRLALE SRRRVKEQQK RCLKTEFRNT HFSFSMGVEG
VEQFVATPEL HSDETIDSDP LPPGQVWAIS PGGQDASPAL YRIEVAAGPG SGVKILNAPV
PPAFRESVRY GEQNLYVRAK ELVGDRDPRA REFSIQLRAM DVERSGQGLG LPVLIALCGA
LIERSVKGGL IIVGALNLGG SIEMIPNPVA VAELALEKGA TTLLMPISSR RQLFDLPDEM
ATKINIEFYA DATDAFVKAI VD