Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0171 |
Symbol | |
ID | 3831111 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 169776 |
End bp | 171551 |
Gene Length | 1776 bp |
Protein Length | 591 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 637828108 |
Product | hypothetical protein |
Protein accession | YP_429050 |
Protein GI | 83589041 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATGACA AGTTAAGGCG ACTTATAGAC CGCTTATTTA GAAAAATGAT TGTACCTTTC CTTGGTGCTG GCGTTAGTTA CAATGCCAAT CCAGATGGAC TAACTAGAAC GCCAGATATG ATTAGACGTT TGGCACAAGA ATTAATATTA GAAGCTAAAA AATCTACCGA ACTAGCAAAG TTTTTATTAT GCATCTGTGG CAAACAAGAT GGGACAGAAT CTGATCTTTG CATAGATAAG TTATGTAATG CTGGTATGGC TGTTCTTGCA GAAGTTTATA TACATCTACA TAATGAAGTT AAAGCCTGTA ATATTTTAAG GGTTGCCGAG TTTTGTGATT TAGAACCTAC CCCGACGCAT TGCTATATTG CTTACATAGC ACGGGAAGGC TATATAAACG AGATTATTAC TACGAATTAC GATACCTGCA TGGAAAAGGC TTATGAAAAA AGCTTTAATA GGAATTTTAC TAGTCGACAG GTAAGAGTGG TAACGAATTT ATCTGAATAC CGTCATTTTA TCGATGATAG TAATCCCAAA TACCCACTTC TACATATTTA TAAAATTAAT GGTTGTGCTA AAAAGTTTAA AGAAGATCCC CAAAACGAAG CTGCTAATAT TGTTCTTACA GAACGCCAAT TGCAGAACTG GCGCGAAAAT AAATGGGCCC AGGACATGTT CCGTGACCGA TGCCGTACGC GGTCGATATT ATTTTCAGGT TTTGGAAGCG AGGAACCTCA GATAAGGCAT ACTGTTTTAC AGGTGATGGA TGAATTTATA AACAACGGGC AGAATTCAGG TAATAATGTA GAGGTATGGG ATTATGAGAA CTCTCCGTTT ATTGCTGAAT GGGGTGAAAT GACTTTTTAC CAGTTTCAAA TTTTGAGTTC TTACTTGAGA GCCCACGGGT TAACTCAAAT TACTATTAAT GAAGTCCAAG AAGGAGCTTT TACTAAAAGT GATAAGGAGC TTTTTGACCG TTACCTACCT TGGAATGGTA AAAACCTTGA TATGGAGCAG TTTTGGGGCG TTGTTTATTT ACTATTTATA CAGCGATTAT TATCTGAAAA ATATTTTTCC CCAGGGAGTA AATTTAGCAA TTATCTCCCT ATAAGCCTTT CAATTCAACA ACTTTTATTA CAAGAATTTC GCCAGGAGAT CTTGGGCCAG GGAGGTAATG AGTTTAAATT ACAGGATCTT TTTATTTGGC ATGAGAACGC CAGTTATCTC CAAGTCTCAG CTTTGCATCA TAGGATACTT TTTCCGGGGG AAGAACTTGA GCCATATAAT TATTGCGCTT TTCGCGATGA ACCGGTGGCT CTTTGCATGC TCTACTTCCT CTGGTGGCTT TGCTATCGTG CTTATAATTA TAAAGATAAA AGCTGGTTGC TTCCTTGTGC AGAAGGAAAA GCATTAGTAC GAATATTTGT TGCTGAAGAT AATAAGATCA AAATTCCGGT ATACGTTTGC GGTAGGGATG GCTATTATAA TTTAAATCAA TTGGTTATGG GTTATGAGCC ACCTTATTCT TTGGGCGTAA TTTTTGTTCT AAATGGATAT GGTCTTGAAC CTAAAGTATT TCCCATATCC AATATAATAG ATAGCCAGTT AGATATAATC CAATTTTATA TAATCCCCGA TGTTTATTGT TTTCGTAATT CATGCCAGAG CCTGGAGGAT ATATTAGAGG GTATAAGAAA TCTTATTACA GACCCTGCTA AAATAAAAAA AGAAGCAGTT CATTGGAAGA AATGGGTGGA GGAAATCTAT CAATGA
|
Protein sequence | MHDKLRRLID RLFRKMIVPF LGAGVSYNAN PDGLTRTPDM IRRLAQELIL EAKKSTELAK FLLCICGKQD GTESDLCIDK LCNAGMAVLA EVYIHLHNEV KACNILRVAE FCDLEPTPTH CYIAYIAREG YINEIITTNY DTCMEKAYEK SFNRNFTSRQ VRVVTNLSEY RHFIDDSNPK YPLLHIYKIN GCAKKFKEDP QNEAANIVLT ERQLQNWREN KWAQDMFRDR CRTRSILFSG FGSEEPQIRH TVLQVMDEFI NNGQNSGNNV EVWDYENSPF IAEWGEMTFY QFQILSSYLR AHGLTQITIN EVQEGAFTKS DKELFDRYLP WNGKNLDMEQ FWGVVYLLFI QRLLSEKYFS PGSKFSNYLP ISLSIQQLLL QEFRQEILGQ GGNEFKLQDL FIWHENASYL QVSALHHRIL FPGEELEPYN YCAFRDEPVA LCMLYFLWWL CYRAYNYKDK SWLLPCAEGK ALVRIFVAED NKIKIPVYVC GRDGYYNLNQ LVMGYEPPYS LGVIFVLNGY GLEPKVFPIS NIIDSQLDII QFYIIPDVYC FRNSCQSLED ILEGIRNLIT DPAKIKKEAV HWKKWVEEIY Q
|
| |