Gene Moth_0171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0171 
Symbol 
ID3831111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp169776 
End bp171551 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content37% 
IMG OID637828108 
Producthypothetical protein 
Protein accessionYP_429050 
Protein GI83589041 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGACA AGTTAAGGCG ACTTATAGAC CGCTTATTTA GAAAAATGAT TGTACCTTTC 
CTTGGTGCTG GCGTTAGTTA CAATGCCAAT CCAGATGGAC TAACTAGAAC GCCAGATATG
ATTAGACGTT TGGCACAAGA ATTAATATTA GAAGCTAAAA AATCTACCGA ACTAGCAAAG
TTTTTATTAT GCATCTGTGG CAAACAAGAT GGGACAGAAT CTGATCTTTG CATAGATAAG
TTATGTAATG CTGGTATGGC TGTTCTTGCA GAAGTTTATA TACATCTACA TAATGAAGTT
AAAGCCTGTA ATATTTTAAG GGTTGCCGAG TTTTGTGATT TAGAACCTAC CCCGACGCAT
TGCTATATTG CTTACATAGC ACGGGAAGGC TATATAAACG AGATTATTAC TACGAATTAC
GATACCTGCA TGGAAAAGGC TTATGAAAAA AGCTTTAATA GGAATTTTAC TAGTCGACAG
GTAAGAGTGG TAACGAATTT ATCTGAATAC CGTCATTTTA TCGATGATAG TAATCCCAAA
TACCCACTTC TACATATTTA TAAAATTAAT GGTTGTGCTA AAAAGTTTAA AGAAGATCCC
CAAAACGAAG CTGCTAATAT TGTTCTTACA GAACGCCAAT TGCAGAACTG GCGCGAAAAT
AAATGGGCCC AGGACATGTT CCGTGACCGA TGCCGTACGC GGTCGATATT ATTTTCAGGT
TTTGGAAGCG AGGAACCTCA GATAAGGCAT ACTGTTTTAC AGGTGATGGA TGAATTTATA
AACAACGGGC AGAATTCAGG TAATAATGTA GAGGTATGGG ATTATGAGAA CTCTCCGTTT
ATTGCTGAAT GGGGTGAAAT GACTTTTTAC CAGTTTCAAA TTTTGAGTTC TTACTTGAGA
GCCCACGGGT TAACTCAAAT TACTATTAAT GAAGTCCAAG AAGGAGCTTT TACTAAAAGT
GATAAGGAGC TTTTTGACCG TTACCTACCT TGGAATGGTA AAAACCTTGA TATGGAGCAG
TTTTGGGGCG TTGTTTATTT ACTATTTATA CAGCGATTAT TATCTGAAAA ATATTTTTCC
CCAGGGAGTA AATTTAGCAA TTATCTCCCT ATAAGCCTTT CAATTCAACA ACTTTTATTA
CAAGAATTTC GCCAGGAGAT CTTGGGCCAG GGAGGTAATG AGTTTAAATT ACAGGATCTT
TTTATTTGGC ATGAGAACGC CAGTTATCTC CAAGTCTCAG CTTTGCATCA TAGGATACTT
TTTCCGGGGG AAGAACTTGA GCCATATAAT TATTGCGCTT TTCGCGATGA ACCGGTGGCT
CTTTGCATGC TCTACTTCCT CTGGTGGCTT TGCTATCGTG CTTATAATTA TAAAGATAAA
AGCTGGTTGC TTCCTTGTGC AGAAGGAAAA GCATTAGTAC GAATATTTGT TGCTGAAGAT
AATAAGATCA AAATTCCGGT ATACGTTTGC GGTAGGGATG GCTATTATAA TTTAAATCAA
TTGGTTATGG GTTATGAGCC ACCTTATTCT TTGGGCGTAA TTTTTGTTCT AAATGGATAT
GGTCTTGAAC CTAAAGTATT TCCCATATCC AATATAATAG ATAGCCAGTT AGATATAATC
CAATTTTATA TAATCCCCGA TGTTTATTGT TTTCGTAATT CATGCCAGAG CCTGGAGGAT
ATATTAGAGG GTATAAGAAA TCTTATTACA GACCCTGCTA AAATAAAAAA AGAAGCAGTT
CATTGGAAGA AATGGGTGGA GGAAATCTAT CAATGA
 
Protein sequence
MHDKLRRLID RLFRKMIVPF LGAGVSYNAN PDGLTRTPDM IRRLAQELIL EAKKSTELAK 
FLLCICGKQD GTESDLCIDK LCNAGMAVLA EVYIHLHNEV KACNILRVAE FCDLEPTPTH
CYIAYIAREG YINEIITTNY DTCMEKAYEK SFNRNFTSRQ VRVVTNLSEY RHFIDDSNPK
YPLLHIYKIN GCAKKFKEDP QNEAANIVLT ERQLQNWREN KWAQDMFRDR CRTRSILFSG
FGSEEPQIRH TVLQVMDEFI NNGQNSGNNV EVWDYENSPF IAEWGEMTFY QFQILSSYLR
AHGLTQITIN EVQEGAFTKS DKELFDRYLP WNGKNLDMEQ FWGVVYLLFI QRLLSEKYFS
PGSKFSNYLP ISLSIQQLLL QEFRQEILGQ GGNEFKLQDL FIWHENASYL QVSALHHRIL
FPGEELEPYN YCAFRDEPVA LCMLYFLWWL CYRAYNYKDK SWLLPCAEGK ALVRIFVAED
NKIKIPVYVC GRDGYYNLNQ LVMGYEPPYS LGVIFVLNGY GLEPKVFPIS NIIDSQLDII
QFYIIPDVYC FRNSCQSLED ILEGIRNLIT DPAKIKKEAV HWKKWVEEIY Q