Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1414 |
Symbol | |
ID | 3832242 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1458303 |
End bp | 1459580 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637829350 |
Product | hypothetical protein |
Protein accession | YP_430270 |
Protein GI | 83590261 |
COG category | [S] Function unknown |
COG ID | [COG3681] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000000941621 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTTGC TTGATCAGCA AACTTTGATA AATTTATTAC ATCAAGAAGC CGATGTAGCA ATCGGGTGTA CGGAACCGGT AATGGTGGCC CTGGCCGCTG CTAAAACCAG GGATATGCTG GGTACTTTAC CGCGGCTGGT GGACATTTCC GTGAGTTCGG CGGTTTGGAA GAATGCTCGC CGGGTTGGTT TGCCGGGAAC CGGAGAGAAG GGCCTGGCAA TGGCGGCGGC CATGGGATTG CTGGCGCCGG TAGAGGCAGG CCAGCGCCTG CTGGCCGCTT TAACTCCCGT GCAGGTGGAA CAGGCAAAGA TATTGGTCCG GGAGGGAGTT GTCAAGGTCG GGGTTGTTGC CGCCAAGGAG GGTTTATATG CCCGGGCTGT GGCCCGGTCC AACCAGCATG AGGCCATAGT CGAGTTAAAC GGCAGCCATA AAAACTTCTC GGCCTTATGG TTGGATGGGA GGATGGCAGG AGGCGCAGGA GAGAATTTAA ATTTAAAACT GGAAGCGTTG TTGGCGCAGG ACTACCAGTC CCTGCTAAAA CAAGTTTTAT CCCTATCACC GGAGGAGCTA TATTTTTTAT ACCAAGGGGC TGAAGATATT CTAACCTTTG CCCGGGAAAT CCATCAAGGC GGTAGGAATC CCCTTTCCGC CATGGCTTCG TTTTTCAGGC GAACAGAAAG TGGAGGGGAA AGTTTAGAAG TACTTATCCG TAACCTCACA GGTATCGCGG TGGCAGAGCG GATGGCGGGA GCTACATACC CCGTCTTGAC CTGCGCCGGC AGTGGGAACC AGGGTATCTT GGCAGCAGTA TCGTTGCTAT TAGCAGGCCA GGAATTGCGA GCCGGTCCGG AGAGTGTGAC CCGGGCCCTG GCAATAGCTC ACTTTACCAA CATGTATCTG AAGGCCTATA CCGGGAAGCT ATCACCATTA TGCGGGGCAG TGACCGGGGG TGCCGGTGTG GCGGCAGCCA TCTGCTGGCT TTTGGAAGGT AGCTGCCAGC AAATCATTAA CGCTATGCAA ATCGTATTGG GTAATCTTTG CTGCGTTATA TGCGACGGAG CCAAGGAAAG CTGTGCTTTA AAAATAAGCA CTGCAGCCGT TGAAGCAGTC CGGGCAGGCT ACATGGCATG TCAGGGGATA AACCTGGAGG CCGGTACGGG TATTGTGGGC AAAAAGTTGG AGGATACCAT GGAGCTGGTT AGAAAGGTGT ACCAGGGAGG GCTGGGCGAA ATAGATTACT ACTTGGGCAA GGTCGATTAT CTTCTGTCAA CCAACTAA
|
Protein sequence | MNLLDQQTLI NLLHQEADVA IGCTEPVMVA LAAAKTRDML GTLPRLVDIS VSSAVWKNAR RVGLPGTGEK GLAMAAAMGL LAPVEAGQRL LAALTPVQVE QAKILVREGV VKVGVVAAKE GLYARAVARS NQHEAIVELN GSHKNFSALW LDGRMAGGAG ENLNLKLEAL LAQDYQSLLK QVLSLSPEEL YFLYQGAEDI LTFAREIHQG GRNPLSAMAS FFRRTESGGE SLEVLIRNLT GIAVAERMAG ATYPVLTCAG SGNQGILAAV SLLLAGQELR AGPESVTRAL AIAHFTNMYL KAYTGKLSPL CGAVTGGAGV AAAICWLLEG SCQQIINAMQ IVLGNLCCVI CDGAKESCAL KISTAAVEAV RAGYMACQGI NLEAGTGIVG KKLEDTMELV RKVYQGGLGE IDYYLGKVDY LLSTN
|
| |