Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1025 |
Symbol | |
ID | 3832645 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1053899 |
End bp | 1054984 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637828953 |
Product | DNA processing protein DprA, putative |
Protein accession | YP_429882 |
Protein GI | 83589873 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | [TIGR00732] DNA protecting protein DprA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00128425 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.00000000696536 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGGAGGAAA AACCATTCTG GGTAGCCCTG CAGCAGACAC CCGGCCTGGG AGCCCGCCGG GTCCTGCAAC TGGTCAAATA CTTTGGCGGT GCCCGGGCTG CCTGGGAGGC TCCGGAAGGG GAACTCTTAA CCCTGGAGGG CCTGGGTAAA GGAGCAGTTT CCCTGCTAAA TTGGCGCCGC CAGGTACATC CGGAGAAAAT AATGGTTTCC TTGGCTGCTG CCGGCATCGG GGTCATAACC ATCGAAGAAG AAGTCTATCC TCCGGAATTG AAACGTATTT ACGACCCGCC CCCGGTTCTT TACTGGCGGG GCAGCCGGTT GCCGGGGGAA GGCTTTAAGA TAGCCATTGT TGGTACCCGC CGGGCGACGG CCTACGGTCT GAAGGTGGCC GAAGAACTGG CGGCCGGCCT GGCGGAGGCT GGCGTAGGGG TAGTCAGTGG CCTGGCCCGG GGTATTGATG CTGCTGCCCA TAGGGGCGCC ATCAAGGGCG GGGGATTGAC CTGGGGCATC CTTGGCTGCG GCGTTGATAT AGTTTACCCG CGGGAACACC GGGAACTCTA CCGCCAGGTT ATGGAACACG GGGCAATTAT CTCGGAGTTT CCCCCAGGGA CGCCGCCGGA TGCCGGCCAT TTTCCGGCCA GGAACAGGAT TATCAGCGGC CTGACGGCTG GGACAGTGGT AGTCGAGGCG GCGGCCAGGA GCGGCGCCCT CATAACTGCC GACCTGGCCC TGGAGCAAAA CCGCGATGTT TTTGCGGTCC CAGGTCCCAT CACCAGCCGT TATAGCCAGG GCCCACACGA TTTGATTAAG CAAGGAGCCA AGTTAGTAAG CGGCGTAGCA GATATTTTGG AAGAATATGA GCCCCGGTCG CTATGGAGTT TACCCCGGGA AGAAACTCGG GCCTCTGTAA CCCTGAACGC TATCGAGGAG AAGGTGTTGG CCGTTTTGGA GGCGACCCCA TCTCACCTGG ATGTCATTAT GGCGGCCACT GGTTTACCGG CCGGTGAACT TAATACAGCT TTAATCATGT TGGAGATGAA GCAATTGATC CGGCGGTTGC CGGGAGGTTT TTATGTGCGT TGCTAA
|
Protein sequence | MEEKPFWVAL QQTPGLGARR VLQLVKYFGG ARAAWEAPEG ELLTLEGLGK GAVSLLNWRR QVHPEKIMVS LAAAGIGVIT IEEEVYPPEL KRIYDPPPVL YWRGSRLPGE GFKIAIVGTR RATAYGLKVA EELAAGLAEA GVGVVSGLAR GIDAAAHRGA IKGGGLTWGI LGCGVDIVYP REHRELYRQV MEHGAIISEF PPGTPPDAGH FPARNRIISG LTAGTVVVEA AARSGALITA DLALEQNRDV FAVPGPITSR YSQGPHDLIK QGAKLVSGVA DILEEYEPRS LWSLPREETR ASVTLNAIEE KVLAVLEATP SHLDVIMAAT GLPAGELNTA LIMLEMKQLI RRLPGGFYVR C
|
| |