Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0189 |
Symbol | |
ID | 3832262 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 184654 |
End bp | 186300 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637828125 |
Product | hypothetical protein |
Protein accession | YP_429067 |
Protein GI | 83589058 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTGCGGA GGGTTTCTAC GAGGGTTTTA ATCTGCTGGA CGTTGTTGTT CTTGATGCTG TTGTCCCTGG TGGGCGGGGC GGTCGCCCTG GCCGCGGGGC TGGACCTCCC CGGCCTTTAC GACCAGTTGA AGAATGATCC CACCTACCAG CCCTATCGCC AGGAACTGCT GAATGAATAC CTTAAGTTAA ATAGCAACGA GCAGGCCATG GATAACGACC TGCGGTCTTT CCTGGCCGAT GTGCAGGCCC GGCTTTCCCA GGCTGATACG AGCAGCCTCA AGGATGAAGC CGGCGTGAAC GCCCTGGTCA TGAAAACGGC GGCGGAAGTG CTGTTTACGG ATAAGAAATA CACTACCCTG GCAAATGCCT TGGCGGCAAC TTCGGACATT CAATCCATCC TGAAGGGCAA TTTCCCGCCG TCCCTGGAGC CTGTCCGGAA AGAGGTTGTG AACGCTTTGC TGGGGAGCGG CGGTCAGGCC GCGGCCGGCG GGGGCGGGAC CGCGGCGCCG TCGGCCGGTA CGGAAGAAGA GATCCAGCGC GACACTGCCG CCGGGGTTGT TACCTGGCGG GTGAACCCGG AGGCGGCCGC CGGGATAAAG GATAACAAGC TGGTCCTTTC CCTGGCCGGG GAGACCGCTC CCACCCGGAC CTTCTCCCTG CCGCCGGCTC TCCTCCAGGA CCTGGGGCAA AAAAAGGCGG ACCTGGAGCT GGACTACGGC CCGGTGGTCC TGACCCTGCC GGCAGCCTCC CTGGCGGATT TGAGCAGCTC CGGGGGAGCA GACCTGACGT TTAGGCAGGA GAGCCTGGAC CCGGCGCGGG TAAAAGACGT CAACGGGCCG GGCTACAATG GGGCCGGCAT GGTTTATACC CTCACCGCCG GGGATAAGGG AGGGCTCCCG GCCGGCGTCC CCGCTAGGAT AAAATATGAA GAGCGGCAGG GGTTGGACCG GGACCTCCTG GGCGTTTACC AGGTGAAAAG CGATGGTTCT CTGGTTTACC TCGGCGGTCA TGGGGTGGAC GGGAGTCCCT ACCTGGAATT TTCGTTACCG GGTGATGGCA GTTATGCCGT TTTGGAGTAC CGGGCTGATT TTGCCGACCT GGCGGGCCAC TGGGCGGCCA GGGATGTCCA GGTCATGGCG GCCAGGCATA TCGCCGCCGG CGTGGGCGAG GGGCGGTTTG AGCCCGACCG AGACATTACC CGGGCCGAAT TTACGAGCCT CCTGCAGCGG GTCCTGGGCC TGCCGGTAAA AGGAACGGTA ACCGGCTTTA GCGATGTGCC GGCCGATGCC TGGTACGCTC CCTCCGTGGC CGCCGCCGTC CGGGCGGGGC TGGTGCACGG GTATGAGGAT AGTACCTTTA AACCCGATAA TCCGGTGACC AGGGCCGAGA TGGCGGCCAT GCTGGGCAAC GCCCTGGCCT TGCAGGGCCT GGCCGTGAAG GTGGAGCCTG GTCAGGTAGA AGCCGTCCTG CAACCCTATC GGGACCAGGC GGCCGTACCT TCCTGGGCCC GGCCGGCCAT GGCCGCGGCT GTGACGGCCG GTATTGTCGG CGGCCGCGAA GGCGGCCTGG CGCCCCTTGA GCGCGCCACC AGGGCCGAGG CCATAGTGAT GCTTGAGCGG CTGATGGATA AAGCCGGCTG GAGATAG
|
Protein sequence | MLRRVSTRVL ICWTLLFLML LSLVGGAVAL AAGLDLPGLY DQLKNDPTYQ PYRQELLNEY LKLNSNEQAM DNDLRSFLAD VQARLSQADT SSLKDEAGVN ALVMKTAAEV LFTDKKYTTL ANALAATSDI QSILKGNFPP SLEPVRKEVV NALLGSGGQA AAGGGGTAAP SAGTEEEIQR DTAAGVVTWR VNPEAAAGIK DNKLVLSLAG ETAPTRTFSL PPALLQDLGQ KKADLELDYG PVVLTLPAAS LADLSSSGGA DLTFRQESLD PARVKDVNGP GYNGAGMVYT LTAGDKGGLP AGVPARIKYE ERQGLDRDLL GVYQVKSDGS LVYLGGHGVD GSPYLEFSLP GDGSYAVLEY RADFADLAGH WAARDVQVMA ARHIAAGVGE GRFEPDRDIT RAEFTSLLQR VLGLPVKGTV TGFSDVPADA WYAPSVAAAV RAGLVHGYED STFKPDNPVT RAEMAAMLGN ALALQGLAVK VEPGQVEAVL QPYRDQAAVP SWARPAMAAA VTAGIVGGRE GGLAPLERAT RAEAIVMLER LMDKAGWR
|
| |