Gene Moth_0394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0394 
Symbol 
ID3832333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp399290 
End bp400921 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content49% 
IMG OID637828331 
Producthypothetical protein 
Protein accessionYP_429271 
Protein GI83589262 
COG category[L] Replication, recombination and repair 
COG ID[COG1315] Predicted polymerase, most proteins contain PALM domain, HD hydrolase domain and Zn-ribbon domain 
TIGRFAM ID[TIGR01222] septum site-determining protein MinC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000249945 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.153411 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGTCA CCTCAAAAGA TAAAATAGAA ATACTTCCAG TTGACGAAAA GCAGGAAGGA 
TACTGGCGCA TAAAGATTAG CGGGGATAAG CTGGCAGCCT TACTGCAAAT GCGACCAGGG
ATAATATTGC ACCGGTGCCT AAAAGATTTG CCGCCAGCCA AGGTACTGCA ACTTGAAGTC
CTGGAGCAAG AGAGCCATTA TCCTCCCTTT ACTCTAGAAG ATCTAGTCCG GGAGCTTAAA
AGTCAAGGTA TCCAGTACGG TATCGATTGG CAGACGTGTG CCCGTCTGGT CGAAGAGCCG
GCAGAGGGGA CATGGACCAT TGCCCGTGGC CAACCGGCCT TCCCAGGTAA GGATGCTACC
GTTCAATTAT TATTCACAAC CCAGGACAGG GTGCCTGTAA CCATCAAAGA AGAAGATCAG
CAGGTTAACT TTCGCGAGCG CTTTCAATTC ACTTCTGTGG AACCCGGCAC GGTCCTGGCC
AGGAAAAAAC CTGTTACCGG TGGTCGGCCA GGGAGGGCCG TGACGGGAGA AATAATTCTG
CCGCCTGAAC CCCGTGAAAT TGAGCTCGTT GCCGGGCAGG GTGCAATTTT AAGTGATAAC
GGTTTAGAAG TGGTTGCCAC CCGGTCTGGC CGGCCTATGG CCAGGAAAAC AAAAGACCGG
GTAACCATTG AAGTAGTCCC TGCTCTGATC CATGACGGCA ATGTTAATCT TTCCTCGGGC
AACATTAACT TCAGTGGCGA TGTAATTGTA ACGGGTCAGG TTGAAGAAGG GATGGCTATC
GAGGCGGGAG GGAATGTGTA TGTAGGGGAT ACCGTATCCC GGGGTATAAT CCGGGCAGGG
GGGTCCATTG AGGTGGCCGG CAATATCTTC GTTTCGGTAC TGGCTGCCGG AGGTATTACC
GCCTTTCAGC AGAGGCTGAG CCCGCTCCTG GCGAGAATTG CTGAAGAGCT AGAACAATTG
ATAACCGCCA TTAAGCAGTT ATTAAGGCAC CCTTCCTTTA AAAAAGATGA CCTTAAGGGC
GGGATCGGCC CCCTGGTACT CCTGCTTTTG GAAAAGAAAT TTCAAGGTCT TTCTCCTGCT
ATAGAGCTAT TACAGAAAGA AGTGCAGAAT TTACAAACAA TATCTTTAGG TGAACCGGAA
GGACTTATAA ATGATTTGGA ACGTCTTACT CGTTCGCCTC TCGCGGTTAA AAATTTAGAC
TACCTGGAAA CGATGTTGCA AAAGGTTGCA GGCTGGCAGG AAGGGATTAG TGCCCCTCTC
CAGGGGAGAG CTGATGTTAC CGTTAATTAT GTTGTTAATT CTACCATCAT GGCTTCGGGT
AATATTAGAG TACTGGGGGA TGGCTGTTAC CACTCCCGGT TACAGGCCGG GAAATCAGTA
ACCATAAACG GTGTTTTCCG TGGTGGTGAG ATTCAGGCCC AGGGTGATGT TTATATAAGA
GAACTTGGTT CGCGTAGCGG TATAGAAACC AGGGTAATCA CCAGGAGTGG AGCTAGAGTG
AAAGCAGGGC ATGTTTTTGA GAATACCTCC GTTCAAATAG GACCACGGGT GTATTCTTTT
GGCCGGGAGG AACAGGGGGT TACTTTATAT CTTGATCGGG AAGGAGAGCT AATCAGAAGT
TACATGAATT GA
 
Protein sequence
MPVTSKDKIE ILPVDEKQEG YWRIKISGDK LAALLQMRPG IILHRCLKDL PPAKVLQLEV 
LEQESHYPPF TLEDLVRELK SQGIQYGIDW QTCARLVEEP AEGTWTIARG QPAFPGKDAT
VQLLFTTQDR VPVTIKEEDQ QVNFRERFQF TSVEPGTVLA RKKPVTGGRP GRAVTGEIIL
PPEPREIELV AGQGAILSDN GLEVVATRSG RPMARKTKDR VTIEVVPALI HDGNVNLSSG
NINFSGDVIV TGQVEEGMAI EAGGNVYVGD TVSRGIIRAG GSIEVAGNIF VSVLAAGGIT
AFQQRLSPLL ARIAEELEQL ITAIKQLLRH PSFKKDDLKG GIGPLVLLLL EKKFQGLSPA
IELLQKEVQN LQTISLGEPE GLINDLERLT RSPLAVKNLD YLETMLQKVA GWQEGISAPL
QGRADVTVNY VVNSTIMASG NIRVLGDGCY HSRLQAGKSV TINGVFRGGE IQAQGDVYIR
ELGSRSGIET RVITRSGARV KAGHVFENTS VQIGPRVYSF GREEQGVTLY LDREGELIRS
YMN