Gene Moth_1541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1541 
Symbol 
ID3831927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1584022 
End bp1585374 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content62% 
IMG OID637829473 
Productpeptidase U62, modulator of DNA gyrase 
Protein accessionYP_430393 
Protein GI83590384 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0734056 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTACC AGGAACTAGA GAAAAAATAC CTGGACCTGG CCGGTCAGGT GGTCGAGAAA 
GCGGCCAAAC GAGGAGTCCT GGCAGAAGCC TACCTTACTG CCGGGGAAGA ACTGAGCATT
GAGGTCCGGG ACCAGCAGGT CGAGGCCCTG ACCACAGCCC GGGATCAGGG CCTGGGCTTA
AGGGTTATCC GGGACCACCG GGTAGGTTTC GCCTTTACCA CCGACTTCAG CCCGGCGGCC
CTCGACGCCT GCATCGAACA GGCCCTGGCC AACGCCCGGA TGGCCACTCC TGATGAGCAC
AACTGCCTGC CGGCCCGCTA TCCCGGTTAT CCGGCCCTGG ACCTCTGGGA TCCCGAGATT
ACCGCTACGC CCCTGGAGAA AAAGATTGAG TTAGCCAAAG AGATTGAGCG CCAGGCCAGG
GCCTATGACC CGCGGGTCAA GATAACGGAA AGTTGTTCCT ATAATGACTC CCGCTACCTG
GTGGCCCTGG CGAACTCCCA GGGAATAACG GCAGCCTATC ACGCTGCCAA CTGTGGCGCC
AGCACCTTTG TGGTGGCAGT AGAAAATGGA GAAAGCCAGA CCGGCTTCGG CCTGGCCTAC
GGGTTGAAGT TCAAAAACAT CGACCCTGCC AAGGTGGGCC GGGAGGGGGC CAGCAAGGCC
GTACGCATGC TGGGGGCCAA AAGGGTCAAT ACCCAGCGGG CAGCGGTTGT CTTTGACCCT
TACGTGGCCA CCAACTTCCT GGGCGTCATC GCCCCGGCCC TGGCCGCCGA TGCCGTCCAG
AAGGGCAAAT CCCTCTTCCG CGGCCGGGTC GGCCAGCAGG TAGCCGCGCC GGTGATCAAC
CTCATCGACG ACGGTTGCCG GCCGGACGGC ATTGCCTCCA GCCCCTTTGA CGGGGAAGGG
GTGCCCACGG AACATACCGT CCTGATTGAA AAGGGCGTTT TGCGGTGCTT CCTCCATAAT
ACCTACACCG CCGCCCGGGA CGGGGTGAGA TCCACCGGTA ACGGTGCCCG GGGTTCCTTC
AAGACCACGC CTGAAGTCGG CACCACCAAT TTCTATATCG AGGCCGGATC GCGCTCGCCG
GAAGAAATCA TCAAGGAGAT TCCAAAGGGT CTCTACGTCA CTGAGGTCAT GGGCATGCAC
ACGGCCAACC CCATTTCCGG GGATTTCTCC GTCGGCGCCA CCGGTATCTG GATCGAAAAG
GGCGAGTTGA CCACGCCGGT GCGGGGGGTG GCCATTGCCG GAAACATCAT TGGCCTCCTG
GAGGCCATTG ACGCCGTGGC CAACGACCTG ACCTTTTTCG GTGCCACCGG CGCCCCCACC
ATCCGGATCG CCAGCATGAC CATCAGCGGC TAA
 
Protein sequence
MDYQELEKKY LDLAGQVVEK AAKRGVLAEA YLTAGEELSI EVRDQQVEAL TTARDQGLGL 
RVIRDHRVGF AFTTDFSPAA LDACIEQALA NARMATPDEH NCLPARYPGY PALDLWDPEI
TATPLEKKIE LAKEIERQAR AYDPRVKITE SCSYNDSRYL VALANSQGIT AAYHAANCGA
STFVVAVENG ESQTGFGLAY GLKFKNIDPA KVGREGASKA VRMLGAKRVN TQRAAVVFDP
YVATNFLGVI APALAADAVQ KGKSLFRGRV GQQVAAPVIN LIDDGCRPDG IASSPFDGEG
VPTEHTVLIE KGVLRCFLHN TYTAARDGVR STGNGARGSF KTTPEVGTTN FYIEAGSRSP
EEIIKEIPKG LYVTEVMGMH TANPISGDFS VGATGIWIEK GELTTPVRGV AIAGNIIGLL
EAIDAVANDL TFFGATGAPT IRIASMTISG