Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1696 |
Symbol | |
ID | 3833296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1733727 |
End bp | 1735142 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637829621 |
Product | sporulation protein and related proteins-like |
Protein accession | YP_430541 |
Protein GI | 83590532 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning |
COG ID | [COG2385] Sporulation protein and related proteins |
TIGRFAM ID | [TIGR02669] SpoIID/LytB domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.600188 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGGAAAA CTTACCGCCG GGGGATGGCT GCCCTGCTCC TGGCCTTAAT ACTAATCCCT GCTGCCTCCG GGGAGGCGGC TTCCCGGCCC ATCCGCGTCC TGCTGGACAG TAGCCCCGGC GAGGTGGAAT TCCAGGTGGA GCAGGGCGGT TACCAGCTGG TTGACGATCA CAGCGGCCAG GAGATCGCCA CGGCCACTTC CGGGGTTAAG TGGACGGTCA GGCAGGACGG CAGCACCCTG CAGCTTTTAA AAGATGGTGC CCCTGTAGGC AGCTTCAATG GCCCCATTCA GCTAAAACCT GCCAGGGCAG GTCTCAACCT CTTCAGTTAC CGGGGCAACC GCTACCGGGG GAGCCTGAGT ATCCTGCGGG GCGAGGGCGG TTTGCTGGTT ATCAACATCG TCGACCTGGA ACAATACCTT TACGGCGTCG TTGGTAAAGA AATGCCGGCC AGCGCGGCCC TGGAAGCCCT TAAGGCCCAG GCTGTAATCG CCCGTACCTA TGCTATCACC AGGATGCAAC CGTCCCAGCT CTACGACGTC ACCGACGATA CCTCGACCCA GGTATACGGC GGTTATGAGG CCGAGGTCAA TTACGGCGCC GCCAGGGATA AAGTTCTGCA GGCGGTAGAC AGCACCCGGG GAGAGGTGAT CTATTATGAC GGCAAGGTCA TCCAGGCCTA CTTCCACGCC AACGCCGGCG GCTACACCGA GGATAGTGAG AACGTCTGGA GCAATCCCCT GCCCTACCTG CGGGGCGTGC CCTCGCCCGA TGACGACTGG GCCGTCAAGT ATCCCTACCA GACTCCTGGC GGTTACCCGG CCAATACATA TAACTGGACG GTGACCCTGA CCAGGCAGCA GGTCCAGGAC CAGGTTAATA GCTGGCTTGC CGGTCAGGGT AAAGGCGCGG TCGGGGAGGT GGTCGACCTG GTCCTTTCGC GGCTGGGGCG TGACGGCCAA AAGGAGACGG TATCCGGCCG GGTAACCAGG ATGGATATCC GCACCACCAC CGGAACGGCC CAGGCTTTCC GGGACGGCAT TCGCGCCGTC TTTGGCCTGA AAAGTACCCT CTTCACGGTG CAGATGGACT CCACGGTGAA TGTCCTGGAC GGTTCCGGGC AGCAACGGGC GGTGAATTAC GGCGCCGAAC TGGTAGCCCT GGGAGCCGGC GGCGTCCTCA ACGCCCCTAA CGGTGCCGCC GGAGATTATA CGGTAGCCGG GCGCGACGGC ACACGCCAGG TACCCAAGCT CTTCACCCGG GTAATCTTCC AGGGGAAGGG ATACGGCCAC GGCCTGGGCC TCAGCCAGTG GGGGGCCATG GGCATGGCCG AAAAAGGGTA TACTTACCAG CAAATTATCG AACACTACTA CAACCAGGAT CATTATGACG GCCACCTGAA GATTGCGACC TATTGA
|
Protein sequence | MRKTYRRGMA ALLLALILIP AASGEAASRP IRVLLDSSPG EVEFQVEQGG YQLVDDHSGQ EIATATSGVK WTVRQDGSTL QLLKDGAPVG SFNGPIQLKP ARAGLNLFSY RGNRYRGSLS ILRGEGGLLV INIVDLEQYL YGVVGKEMPA SAALEALKAQ AVIARTYAIT RMQPSQLYDV TDDTSTQVYG GYEAEVNYGA ARDKVLQAVD STRGEVIYYD GKVIQAYFHA NAGGYTEDSE NVWSNPLPYL RGVPSPDDDW AVKYPYQTPG GYPANTYNWT VTLTRQQVQD QVNSWLAGQG KGAVGEVVDL VLSRLGRDGQ KETVSGRVTR MDIRTTTGTA QAFRDGIRAV FGLKSTLFTV QMDSTVNVLD GSGQQRAVNY GAELVALGAG GVLNAPNGAA GDYTVAGRDG TRQVPKLFTR VIFQGKGYGH GLGLSQWGAM GMAEKGYTYQ QIIEHYYNQD HYDGHLKIAT Y
|
| |