Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0345 |
Symbol | |
ID | 3832757 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 351196 |
End bp | 352242 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637828280 |
Product | hypothetical protein |
Protein accession | YP_429222 |
Protein GI | 83589213 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.905049 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.621263 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGCAA TTCAAACCAA CCCCCAACCG GCGGCCCTGG CCATCGACGT GGGCTTCGGT TACACCAAAG CGGTATCTTC AACCGGCGGT AAAGTTATTT TTCCTTCGGT TGTCGCCCCG GCAGGTTCTC CCGACGCCTT TGACCGTCTG GATAAAAGCG ATACCGGCTA CCGGGTGAGG ATTAAAAAGG GGATAGACGG TCTCTTAGAA GAATGGCTGG TAGGCGAACT GGCCCTGAAG GAAGGGCGGG AGGTGCAGTA TTTCCAGGAC TGGGAGAAAC ACAGCCACCC GGCCCATGAC GCCGTATTGC TGGCCGCTGC TGTTTTAACC TGGAACTGGC CCCGGGCTGG TAGCGGCATA ATGGGGATCA GCAACCCTGC CCTGGTGGTA GGGCTGCCGG TGGATGTCTG GCGCGACGAA CTGCAGCGGG AAGGCCTGAA GAAACACCTG GCAGGCCTGG CAGCCGAGGT GAGCGTCAAC GGGAACGACC CGGTGCGGGT GACCTTCAGC AGAGTTTATG TCTACCCCCA GGCCGCCGGC GCTTTCCTTA CCGTTCCTGA TCTTCCCGAC AGCGGCATCG TGGCCCTGGT AGACGTGGGA CAGAAAACCA CTGATAGCGC GGCGATAGAG ATTGTCAACG GCCGGCAGAG GCTGGTAAAA ACCATGTGCT TCAGCATCAA CAAAGGCATG GCGGCCCTGG TGGAAGCCGT GCGGGAGGAA TTCCGGCGGC AGACCGGGGC GCCCCTGCCG CCCCAGCAGG CCTGGGAGAC GGTAAAAAGC GGATCCCTCT GGTACCGGGG CAAGCAGATT GATATGGCCC CGGCGATTAA AAAGGCCCGG TCGGAGATAG CCCGGGCCAT AGCCGACCAG GTGCTGGCCG GGTGGGGCGA GAGGGCCGAT TTTGTCAGGA AAGTATACCT GGCCGGCGGC GGGATATTAG ACTTGCCGGA TTTAAAAAAT ATGTTCCCGG CGGCTGCGGT CCTGCCTGGT CCCCAGTGGG CCAATGCCCT GGGGTTCTTA AAAGTAGCGA GAGGACTGGC TGTTTAA
|
Protein sequence | MLAIQTNPQP AALAIDVGFG YTKAVSSTGG KVIFPSVVAP AGSPDAFDRL DKSDTGYRVR IKKGIDGLLE EWLVGELALK EGREVQYFQD WEKHSHPAHD AVLLAAAVLT WNWPRAGSGI MGISNPALVV GLPVDVWRDE LQREGLKKHL AGLAAEVSVN GNDPVRVTFS RVYVYPQAAG AFLTVPDLPD SGIVALVDVG QKTTDSAAIE IVNGRQRLVK TMCFSINKGM AALVEAVREE FRRQTGAPLP PQQAWETVKS GSLWYRGKQI DMAPAIKKAR SEIARAIADQ VLAGWGERAD FVRKVYLAGG GILDLPDLKN MFPAAAVLPG PQWANALGFL KVARGLAV
|
| |