Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0926 |
Symbol | |
ID | 3832927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 960192 |
End bp | 961169 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637828857 |
Product | germination protease |
Protein accession | YP_429786 |
Protein GI | 83589777 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01441] GPR endopeptidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.00655753 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCAGCGTG CCGAATTCTA CCGGCTGGCC GGGGTAGTTC TGGACCTGGC AATAGAGGCC CATGACCTCT TACGTGGGGC TACCAGGCAG GAAGTCCCCG GGGTACGGGA AGACAAGGAA AAATTTCCCA ATGCCATTGT CACAACCATT ACCATTCTGG ATGAAGCCGG AGAACAGGCC ATGGGGAAAT CCCGCGGTAC TTACATAACC ATCGACGCCC CCGCCCTCCT GGCCGAAAAC CCGCCGGTAC ATGAGGAAAT AGCTGGATTA CTATCCCAGA AACTCAACCT CTTACTGCAA AACTTGCAGG TTGGCCCTAC CGACCCTGTC CTGGTGGTCG GCCTGGGCAA CTGGGAGGCA ACCCCGGATT CCCTGGGCCC CAAGGTTATC AACCAGTTTA CCGTCACCAG GCACCTGTTA AAATATGCTC CCCAGAGCAT GCCTCCGGGA ACCAGGCCGG TCAGCGCATT GGCCCCCGGC GTCCTGGGAA CCACGGGCAT TGAGACAGCT GAAATTATCC GGGGTGTAGT CGAGAAGACC GGACCACGGG TAATCATTGC CATCGATGCC CTGGCTGCCG GGGATTTGAA TCGCGTTGGC AGCAGCATCC AGATTAGCGA CACCGGCATC AGCCCCGGTT CGGGAGTCGG GAACCAGCGT CTGGGTATAA ATTTGCAAAC TATGGGTGTA CCGGTAATCG CCATCGGTAT TCCTACCGTC GTCCATGCCG GGGTCATAAT CTTTGAGGCC CTTAATCAGT TACAGCAGGC CTTTCCCAAT GTCAACTTGC AAATTAATCA GGCTCTGGCT CAAAATCTCT CCCGGAATGT CCTTTCGCCC TTTGGCGGCA ATCTGACGGT TACACCCAAA GAGGTTGATG ACCTGGTCCA CAACCTGGCC TGGGTCATTG GCAACGCCCT GAACAGATCC CTGCATACCA ACCTTCTCCG TACCCATGCG GCTATCCCGT TACATTAA
|
Protein sequence | MQRAEFYRLA GVVLDLAIEA HDLLRGATRQ EVPGVREDKE KFPNAIVTTI TILDEAGEQA MGKSRGTYIT IDAPALLAEN PPVHEEIAGL LSQKLNLLLQ NLQVGPTDPV LVVGLGNWEA TPDSLGPKVI NQFTVTRHLL KYAPQSMPPG TRPVSALAPG VLGTTGIETA EIIRGVVEKT GPRVIIAIDA LAAGDLNRVG SSIQISDTGI SPGSGVGNQR LGINLQTMGV PVIAIGIPTV VHAGVIIFEA LNQLQQAFPN VNLQINQALA QNLSRNVLSP FGGNLTVTPK EVDDLVHNLA WVIGNALNRS LHTNLLRTHA AIPLH
|
| |