Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1917 |
Symbol | |
ID | 3830841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1987003 |
End bp | 1988676 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637829850 |
Product | hypothetical protein |
Protein accession | YP_430760 |
Protein GI | 83590751 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00808951 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAGTT ATCAGGAGAT ACTTAAGGAC GCGCCGCTGG ATTTCCTACA GAAGCTGGCC GGGAATCTGA ATTTGGCGAC AAAGGGTGGA AAAAGAGCAG GCAGGGGTTC AGAGGACGGC TGGCAGGCAT TATACAATAA ATTGGTGGCG TATTATAGTT CCCCGGACAA CCTGGAAACC CTCTGGCAGA AAATCGGGCC TTCAGGCCAA CTGGCGCTGG AAACAATCCA TTTTAGCGAA CCTTATTACG ATGAGGTGAG CAGGGTCCAT GAGAGGCTCA ACAAAATCCT CGGAAAAAGG GCGGCAAGGG AGGCGCGGGA ACTTCTCCTG GGTTGGGGCC TGATATTTTT AAGTGAAAAC GAATACGGAC TGGATTATTA CGATCTACCT CTAGAAGTAC GTAAATTTAT CAACTATAAA GTACTTCCCC TGCTGGTAAA AAAAGACGGC ATCCCGCTAC CGGAACAAAG GGAAAACCAC GGTCTCTTTT TCTGGCTGGA TTTTTATATT TTACTGGCCG GGGTCCTGCA ACGCGAGGTC AGGGTGACCC AGACCGAGCG TGTTTTTTAT AAAAGGGACC GCAAGAAGAT CATGCTCTGC CAGCACTACC CTGATGACGA AAGCCGCTAC CTTTTGCTGG AAGAGGTGGC CTGGGCGAAT GACTTTTTAG TTGAAAAAAA TGGTTGCGCC CGGTTGAGCG CGAAAACTTG GCAATGGTTG CAGCTACCGC GCTATAAACA ATGGCTTACA TTTGTCGACT GGGTAATAAA CCGTTATTTC CAATGCCGTA ATATTTGGGC CACCCATATC CTGGGATTTC TGTTAACATT ACCGCCGGAG AAATGGCTCT CCTTGCCAGC CCTTTACCAG CTAATCCATA AGTATAACAC GCCCTCCCAG GCGAATTATA TCCTGGCGAA TAACAAGGAG TTGTTGCAAA GATTTCTCTG GCTGGGCCTG ATTGAGGTTG CCGGCGGTTT GGAACAAGGA TGCATCAGGA TAACCGATCT TTTCCGCCGC TACTTTAATG TTTTGCTCCA GCACGACCAA GAAGTAGAAG AAACGGACGG GGAGGTTTTC AGGGAGGCGA TAGAGGGTTT CTTTCCGGAA GCATCCAGTT TTATCGTGCA ACCCAATTTT GAAGTCATCG CCCCCATGGA ACTCTCCCCC AATCTGTTCA TGCAGCTAAG CACCTTTACC GATCTAGTCA GCGCCGACCG CATGTTTATT TTCAGCCTCA ATGAGAAAGC ATTTTACCGG GGTTTTACCA GGGGCCGGCA ACCGGAAGAG ATGCTAAAAT TCCTGCAGGA ACACAGCAAG TATGAATTAC CGCCCAATGT TCTTACAACG GTGGAGGAAT GGGCGGCAAA AATGGGGAAG GTCTCCTTAG TGAAAGGGGT GCTGGTGCGT TGCCAGAAGG AAGAACTGGC GGAACAGGTG AAAGCCCTGC TGGAAGCCAG GGGGTGGCTG ATCGAGGCCA TAACGCCGCA GGTCTTCCTG GTGCCGGAGA ATAGGGGTGA AGAATGCCTG GAGTTGCTGG AGAAACAGGG CTTTATGCCC CATCCCCAGC TGATTGCTTT GCGGGCGGGA GGGGAGGACG ACGTCGACCT CGATGAAAGC CCGAATACCT TGCTGGCCCG GTTTATAGAG GCTGCCTTAA AAAAAAGAGG GTAA
|
Protein sequence | MDSYQEILKD APLDFLQKLA GNLNLATKGG KRAGRGSEDG WQALYNKLVA YYSSPDNLET LWQKIGPSGQ LALETIHFSE PYYDEVSRVH ERLNKILGKR AAREARELLL GWGLIFLSEN EYGLDYYDLP LEVRKFINYK VLPLLVKKDG IPLPEQRENH GLFFWLDFYI LLAGVLQREV RVTQTERVFY KRDRKKIMLC QHYPDDESRY LLLEEVAWAN DFLVEKNGCA RLSAKTWQWL QLPRYKQWLT FVDWVINRYF QCRNIWATHI LGFLLTLPPE KWLSLPALYQ LIHKYNTPSQ ANYILANNKE LLQRFLWLGL IEVAGGLEQG CIRITDLFRR YFNVLLQHDQ EVEETDGEVF REAIEGFFPE ASSFIVQPNF EVIAPMELSP NLFMQLSTFT DLVSADRMFI FSLNEKAFYR GFTRGRQPEE MLKFLQEHSK YELPPNVLTT VEEWAAKMGK VSLVKGVLVR CQKEELAEQV KALLEARGWL IEAITPQVFL VPENRGEECL ELLEKQGFMP HPQLIALRAG GEDDVDLDES PNTLLARFIE AALKKRG
|
| |