Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2374 |
Symbol | |
ID | 3832013 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2499278 |
End bp | 2500258 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637830293 |
Product | sporulation protein and related proteins |
Protein accession | YP_431199 |
Protein GI | 83591190 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning |
COG ID | [COG2385] Sporulation protein and related proteins |
TIGRFAM ID | [TIGR02669] SpoIID/LytB domain [TIGR02870] stage II sporulation protein D |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 59 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000760464 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGCAAGC TCATGGGGAT TTTCATTATC CTGGTATTTG CCGCGGTCAT AATTACGCCG GTTGTAATTA TCGAAGGCAT CCGCCTGTTT CAGCCGCCCG TCCAGGTCCA GACCGGCAAA CAACTGGTAA GGGTCTACTT TCACCAGGCA GGTATCACTA AAATCATGCC CCTGGAGGAA TATATAGCCG GGGTGGTCGC CGGGGAGATG CCGGCCAACT TCGAGCCTGA GGCCCTGAAG GCCCAGGCCA TTGCCGCCCG CACCTACACC TTGAAAAAAA TCGAAGAAGC AAAGATCAAG CCCGATGCCA GCCATCCCAA CGCCGACATC TGTACCGACC CGGCCCACTG CCAGGCCTGG GCCGGGGATG ATGTCCTGCG CCAGCGTTGG GGCCTGATAG GCTTCTGGCG TTACAAAAAC AAAATCCAGT CCGCAGTCCA GGCCACCAGC GGTATGGTCC TGACCTACCA GGGACAGCTC ATTGACCCCG TCTATCATGC CAACGGCGGT GGTCGGACCG AAAGCGCGGC TGCCGTCTGG GGCCGGGACG TACCCTACCT CCAGAGCGTG CCGTCACCCT GGGATAAAAC GTCACCCCGT TATAGCGACA GCCGGACCTT CAGCCTCCGG TATCTGGATA GCAAACTGGG CGTCAACCTG GAGGCCGTAC CGGCGGCAGC CCTGGCCGCG CCCGGGGGCA CAGCTATCAG GGTCCTGGAG AAAACCCCCA CCGGTCGAGT CAAAACCATC AAAATCGGCG GCAAAACCTT TGCCGCCACC GATTTACGAA AACTACTGGG ATTATCCTCG ACGGATTTCA CCTGGGAGGT CCAGGGGGAC CGGATAACCT TTCATACCAT CGGCTACGGC CACGGCGTCG GCATGAGCCA GTACGGAGCC AACGGTATGG CCCGGGAGGG CAAAAACTTC GCCGAGATTC TGGCTTACTA CTATCGCGGT ACGAAGATTG AGAACAGATA G
|
Protein sequence | MRKLMGIFII LVFAAVIITP VVIIEGIRLF QPPVQVQTGK QLVRVYFHQA GITKIMPLEE YIAGVVAGEM PANFEPEALK AQAIAARTYT LKKIEEAKIK PDASHPNADI CTDPAHCQAW AGDDVLRQRW GLIGFWRYKN KIQSAVQATS GMVLTYQGQL IDPVYHANGG GRTESAAAVW GRDVPYLQSV PSPWDKTSPR YSDSRTFSLR YLDSKLGVNL EAVPAAALAA PGGTAIRVLE KTPTGRVKTI KIGGKTFAAT DLRKLLGLSS TDFTWEVQGD RITFHTIGYG HGVGMSQYGA NGMAREGKNF AEILAYYYRG TKIENR
|
| |