Gene Moth_1187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1187 
Symbol 
ID3832990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1218622 
End bp1220232 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content57% 
IMG OID637829120 
ProductRNA-metabolising metallo-beta-lactamase 
Protein accessionYP_430044 
Protein GI83590035 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1236] Predicted exonuclease of the beta-lactamase fold involved in RNA processing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.177034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAAT TACAGTTTTG CGGCGCTGCT GGAACGGTTA CCGGTTCCTG TTACCTCCTT 
GATACCGGCC GCTACCGCTT ACTAATTGAC TGTGGCCTGT TTCAAGGCTC CAAAGCCATC
AAGGAGCGTA ACTACGGTCC CTTTCCCTTC AATCCCGGCG CGATCGACGC CCTGATCCTG
ACCCATGCCC ATATCGACCA CTGTGGCCTC ATACCCAAGC TCTACCTCCA GGGGTTCAAG
GGGCCTATCT TCACAACTCC GGTTACCGCC GAGCTGGCCC GGGTCCTTCT ACCAGATTGC
GGGCATATCC AGGAGATGGA GGTCGAACGC AAAAACCGCA AGAACAAACG CGCCGGCCTG
CCTATATTAA CACCTATTTA TACTGCGGCC CAGGCGGCTG CCTGCCTGGA TTTCTTCCGG
ACCCTGGACT ATCGGGAAGA GCAGGAGATC CTGCCCGGTG TCCGCCTGCG TCTCCAGGAC
GCCGGCCATA TCCTGGGCTC GGCTATAGTC GAGCTCTGGG TCAAGGACAT GGCTGGTGAA
ATTAAGATTA CCTTTTCCGG TGACCTGGGT AATCCTGGCC AGCCCATCGT CAATGACCCA
ACTCCTATCG CAAGCACCGA TTACCTGGTG ATAGAATCCA CCTACGGTAA TCGCCGTCAT
AATATCCAGG GAGATAAAAT CGAGCTCCTC AAAGAGGTTA TCCTGGCGAC CATGAAAAAG
GGAGGTAACC TGATTATCCC GGCCTTTGCC GTGGAACGTA CCCAGGACCT CCTGTACGCC
CTGAATGTCA TCCTGCAACA GGGTGCCGTC AAGGTAGACA AAATCTACCT GGATAGTCCC
CTGGCGGTTG CCGCTACGGA GATCTTTTGC CGCCACCAGG ATTACTTTGA TGCTGAAACC
AAAGATCTAA GCCATAACGG CAGTACTTGC CCCTTTTATT TACCTGGTAT GCACCTCAGC
CGGACGGCGG AAGAATCCAT GGCTATTAAT AAAATCCACG GCGGGGCCAT TATAATCTCT
GCCAGCGGCA TGGCCGACGC CGGCCGGATT AAACACCACC TGAAGCACAA CCTCTGGCGG
CCGGAGGCTA CCGTTCTCCT GGTTGGTTAC CAGGCGGCAG GTACCCTGGG ACGGCGCCTG
CTGGAAGGGG AGAAACGGGT CCGTATCCAC GGCGAGGAAA TCGCTGTCCG GGCGGATATT
GTCAGTATCG ACGGCTTCTC GGCCCACGCC GACCAGGCCG GCCTCCTCAA CTGGGTAAAA
TCCTTCCGCC AGCCGCCGCG AAAAGTATTT GTGACCCATG GTGAAAAAGA AGCAGCTGAG
GATTTTGCCC GCCTGCTGAC TACCGAACTG GGCCTGGCCA CGGAAGTGCC GGGATGGCTG
GATACCGTCC AGCTACTCCC AATAGCGGCA GAGCTTGCCC CGGCACCGGC AGCTGCCCTC
AGAGATACCA GTACAGCTGC AGAGGCAGAG GCTGCCTACC AGCGACTCCT GGTCCAACTC
AAGGCGCTGG TGGAGGCCGG TTTTGCCCGC CAGGACTATG CCGGTACCAA ACGCAGGCTG
GAACAAATAG CGGCCCTGGT AAACCAGGGC CTGGACGAAA AAGCCGGCTA G
 
Protein sequence
MIKLQFCGAA GTVTGSCYLL DTGRYRLLID CGLFQGSKAI KERNYGPFPF NPGAIDALIL 
THAHIDHCGL IPKLYLQGFK GPIFTTPVTA ELARVLLPDC GHIQEMEVER KNRKNKRAGL
PILTPIYTAA QAAACLDFFR TLDYREEQEI LPGVRLRLQD AGHILGSAIV ELWVKDMAGE
IKITFSGDLG NPGQPIVNDP TPIASTDYLV IESTYGNRRH NIQGDKIELL KEVILATMKK
GGNLIIPAFA VERTQDLLYA LNVILQQGAV KVDKIYLDSP LAVAATEIFC RHQDYFDAET
KDLSHNGSTC PFYLPGMHLS RTAEESMAIN KIHGGAIIIS ASGMADAGRI KHHLKHNLWR
PEATVLLVGY QAAGTLGRRL LEGEKRVRIH GEEIAVRADI VSIDGFSAHA DQAGLLNWVK
SFRQPPRKVF VTHGEKEAAE DFARLLTTEL GLATEVPGWL DTVQLLPIAA ELAPAPAAAL
RDTSTAAEAE AAYQRLLVQL KALVEAGFAR QDYAGTKRRL EQIAALVNQG LDEKAG