Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2224 |
Symbol | |
ID | 3830831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2318855 |
End bp | 2319784 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637830144 |
Product | NLP/P60 |
Protein accession | YP_431054 |
Protein GI | 83591045 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.231098 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000179311 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTTAAGG GTGGGCGTTA TATAAGCATC TATTATTTTA TTATAGCTCT GGCCATGGTT ATAGCAGGGG GATTCCTGTT GAGCAGGCAG GCGAAAAGGT TGCCGCCACC GGTACAACTT CCCCCCGGGG CGACAACTCC CCGGGCGGAA AGCTGGTATG TGGGGGTAGC GGTGGCTGAC GTGCGGGCTA ACCCGGATCA GGGCGCCGAA CGGGTTACCC AGGCACTCCT GGGGGATGAG GTAAAACTCC TGCGGGACGA AGGCGAATGG CTCCAGGGTC AGGTGCCTGA TGGTTACATC GGCTGGTTGC AAAAGGGGAA CCTGGTCAGG GCGACGCCCC CGCTGGCCCG GGACCTGGTG GCCGTCAGGG TGCCCAGGGC CATATTATAC AAGGAGCCGG GGAGCGATGC TCAAGCCGGG GAGGCCTTGC TGGGCACTGA CCTGCCCCTG CTGGCGCAAA AGGAGGATTG GCTGGAGGTC TGGCTGCCGG GCCGCCCGCC AGCCTGGCTT AGCCGGCAGG AAGTCGACCT CTGGCCCGGG GGTCAATTAA CGGATAAGCG TTCCGGTTCG GACGTGATTA AGGTGGCCGA ACGGTTGGAG GGGGTGGCCT ACCTGTGGGG TGGGGTCAGC CTCTACGGTA TCGACTGCTC CGGCCTGACC TATATAGCCT ATTTCTTAAA CGGCGTTAAG CTGCCCCGGG ATGCCGACCT GCAGTTTAAA GTCGGCCGGC CTGTAGCCCG GAAAGACCTG CAGCCCGGCG ATCTGGTCTT TTTTAATACC AGTGGCGGAA CGCAGCCTAC CCACGTTGGT ATCTATACAG GCAACGGGCA GTTTTTAAAC TCACGTTCCC GCCAGGGGGT GGTTGTCAGC CGCCTGGATG AGCCCTCTTT TAGCGCCGGG TATCTGGGCG CCCGCCGGTA CCTGCCGTGA
|
Protein sequence | MFKGGRYISI YYFIIALAMV IAGGFLLSRQ AKRLPPPVQL PPGATTPRAE SWYVGVAVAD VRANPDQGAE RVTQALLGDE VKLLRDEGEW LQGQVPDGYI GWLQKGNLVR ATPPLARDLV AVRVPRAILY KEPGSDAQAG EALLGTDLPL LAQKEDWLEV WLPGRPPAWL SRQEVDLWPG GQLTDKRSGS DVIKVAERLE GVAYLWGGVS LYGIDCSGLT YIAYFLNGVK LPRDADLQFK VGRPVARKDL QPGDLVFFNT SGGTQPTHVG IYTGNGQFLN SRSRQGVVVS RLDEPSFSAG YLGARRYLP
|
| |