Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1052 |
Symbol | |
ID | 3831858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1082781 |
End bp | 1083749 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637828980 |
Product | phosphoesterase, RecJ-like |
Protein accession | YP_429909 |
Protein GI | 83589900 |
COG category | [R] General function prediction only |
COG ID | [COG0618] Exopolyphosphatase-related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000000287381 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000192246 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTGATGA GTGAGATCGG GCAGATAGCG GCTATTCTGG CAACCGCCCG GGAAGTGGCG GTTGCCACCC ATATTATACC GGATGGCGAT TGCCTGGGGT CGATGCTCGG CCTTACCCTG GCCCTCCGGA AACGGGGCAC CAGCGTAATA GCTATTAATG CCGACCCGGT TCCGGAGATG TTTCAGTACC TGCCTGGCCA GGAAACCATC ATCGACCCGG ACCAGGTAAC CTCTATGCCG CCGTTACTGG TCATGGTCGA CTGCACTGAT ATGGAACGGG CCGGTAAGGG TTTTAGCAAC TGGCAACAGC GGGTAGAGAA AATAATTAAT ATCGATCATC ACGTCAGCAA CACCCGTTTC GGCCACCTGA ACCTGGTTGA CAGCCGGGCG GCGGCCACGG CGGAATTGAT TTACGCCGTC CTAGAACAAA TACCGGCAAC CTTTACGCCG GAGGTAGCAA CCTGCCTTTA TACAGCCCTG GCCACTGATA CCGGCTCTTT CCAGTATGAA AATTGTACGG CCAGGACCCT GCGCCTGGCA GCCAGCCTGC TGGAGAAAGG GGCCGATATG CCCTTAATCC GGGAGCACCT CTGGGAGAGT AAGCCCTTAA ACAGCATCCG CCTTCTGGCG GCTACCCTAC CCACCCTGAC TCTGGCCTAT GAAGGTCGGG TGGCCTGGAT GACAGTATCC AGGGCAGCCC TGGAGGCCAA TGGCGCCAGG CCGGAACACG CCGAGGGCCT GGTAAATTAT CCTCGCAGTA TTGCCGGCGT AGAAGTAGGC ATGCTTTTCC GGGAATTGCC GGATGGCAAG GTCAAAGTAA GCCTGCGTTC GAAAAAAATT GTCGACGTCA ACAGGGTGGC GGCATTATTT GGCGGCGGCG GTCACCGCCG GGCCGCTGGC TGTACCCTTG ACGGCGATCT GGATACAGTA GTCGCCAGGG TTGTGGCTGC AGCCGGTGAG GCCCTGTAA
|
Protein sequence | MLMSEIGQIA AILATAREVA VATHIIPDGD CLGSMLGLTL ALRKRGTSVI AINADPVPEM FQYLPGQETI IDPDQVTSMP PLLVMVDCTD MERAGKGFSN WQQRVEKIIN IDHHVSNTRF GHLNLVDSRA AATAELIYAV LEQIPATFTP EVATCLYTAL ATDTGSFQYE NCTARTLRLA ASLLEKGADM PLIREHLWES KPLNSIRLLA ATLPTLTLAY EGRVAWMTVS RAALEANGAR PEHAEGLVNY PRSIAGVEVG MLFRELPDGK VKVSLRSKKI VDVNRVAALF GGGGHRRAAG CTLDGDLDTV VARVVAAAGE AL
|
| |