Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0906 |
Symbol | |
ID | 3831294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 941110 |
End bp | 942174 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637828837 |
Product | hypothetical protein |
Protein accession | YP_429766 |
Protein GI | 83589757 |
COG category | [R] General function prediction only |
COG ID | [COG0820] Predicted Fe-S-cluster redox enzyme |
TIGRFAM ID | [TIGR00048] radical SAM enzyme, Cfr family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000000883258 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTACCA GAATTGACTT GCGGGGGCTG TTGCCCCAAG AATTGGAGGA GCTGGCAGTT CGGCTGGGGG AGGCGCCCTA CCGTGGCCGG CAGATCTTTC GCTGGTTGCA CGCCCGTCGG GCGAAAGGAA TAGAGGTTAT GTCCGATTTG CCCCGGGCTT TCCGGGAGCG TCTGGCGTTA GTAGCCGAAC TACCTCCGGT AAGGGTTCTG AACCGCCTGG TGGCGGCTGA CGGCCTGACG CGCAAGTTGC TCCTGGGCCT GGGTGACGGT AATAGCATCG AATGTGTTCT CATGATTTAC AAAGACGGCC GCCGCAGGAA TACCGCCTGC CTGTCCAGCC AGGTGGGTTG CGCCATGGGA TGCAGTTTTT GCGCCACCGG TCAGGGCGGC CTCCAGCGTA ACCTGACCGC CAGTGAGATT ATCCTCCAGG CCCTGGCCCT GGGGGCGGAA CTGGCGGAGG GGGAAGGGGG GAACCGGATC AGCAATATCG TCTTTATGGG TATGGGGGAA CCACTCAATA ACTATGAGGC CGTCATGAAA GGGGTACGTA TTTTCGAAGA TCCTTCGGGA TGGGGCATCA GCCACAGGCG GATTACCCTG TCCACCTGCG GCATTGTTCC CGGCATCGAG CGACTGGCCA GGGAAAAACC GCCCCTGGAG CTGGCTGTTT CCCTGCATGC GGTCACTAAC GAACTGCGGG ATAAGCTGAT GCCCATCAAC AGGCGTTACC CCCTGGAAGA GCTTATCCCG GCCTGCCGCC GTTATGCTGA AATAACCGGG CGGCGGGTTA CCTTCGAGTA TGCCCTGATA GCCGGGGTCA ACGACCGTCG GGAGGATGCC CGGGGTTTAA GCAGGCTTCT CCGGGATATG CTGGCCTTCG TAAACATAAT CCCCCTGAAC CCGGTGGCCG GGAGCGGGTT CAAAGGGGTA CCCCCGGCGG CAGCCAGGGC TTTTGTTGCG CTGTTGCAGG AGGCGGGGCT GGAGGCAGCC ATCCGTGATA GCCGGGGACA GGATATCGCC GCTGCTTGTG GCCAGTTACG TTTCGCGTCC AGGGAGGTGT TATAA
|
Protein sequence | MTTRIDLRGL LPQELEELAV RLGEAPYRGR QIFRWLHARR AKGIEVMSDL PRAFRERLAL VAELPPVRVL NRLVAADGLT RKLLLGLGDG NSIECVLMIY KDGRRRNTAC LSSQVGCAMG CSFCATGQGG LQRNLTASEI ILQALALGAE LAEGEGGNRI SNIVFMGMGE PLNNYEAVMK GVRIFEDPSG WGISHRRITL STCGIVPGIE RLAREKPPLE LAVSLHAVTN ELRDKLMPIN RRYPLEELIP ACRRYAEITG RRVTFEYALI AGVNDRREDA RGLSRLLRDM LAFVNIIPLN PVAGSGFKGV PPAAARAFVA LLQEAGLEAA IRDSRGQDIA AACGQLRFAS REVL
|
| |