Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1420 |
Symbol | |
ID | 3832248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1464718 |
End bp | 1465587 |
Gene Length | 870 bp |
Protein Length | 289 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637829356 |
Product | hypothetical protein |
Protein accession | YP_430276 |
Protein GI | 83590267 |
COG category | [S] Function unknown |
COG ID | [COG2014] Uncharacterized conserved protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.691077 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.507757 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAATGG CGGGAGTTAG CTATCAACCG GGTACGATCC TGCGGGAAAC CATACATTCT ATCCGCAACA TCCTTGGCGA TTCCCTGGAT GATTTAACAG TGGAGCGGGT GGTTATCGGG GTATTCTATA CGGGTGTAAA GCTGAGTAAC GGCCAGGGCG GCTTGTGTTT TACACCTATA AAAGCAATTC CCGGGGCTGT ATGCTGCCCC AGTTCTGCCA GGGCTATGCC CGCCTCGGGA GAGTTGAGGG GCCGGAAGGC GACGGCGTTC CTTGAAGGGA TGTTCGCCGA TCAGGCTTTG CGGAGGGCCC TGGGGATAGC CGTACTCAAT GCCCTGTCGG CCACCTGTTG GCAGGTGCGA CCGCCCATGA ACTATACTCT TAAAACAGGC GCCAATGCCC TGGACCAGGT AATAATACCA GGTGAGGGTC AGGTAGTGGT TGTTGGCGCT CTGGCTCCTT TCCTTAAGGT CCTAAAAAGG CAGGACTGCC GGTTTACTAT TCTTGAACTA GATCCTGCCA CCCTTAAAAA GGATGAATTA CCGTTTTATC GACCGCCAGA GGATGCCCCG GAAGTGATAC CTTGGGCCGA CCTCCTGATT ATCACCGGGA CTACCCTGAT CAACGATACC TTAGAAGGTT TGTTGAGCGT TGTTAAGCCC GGGGCACAGG TGGTGGTAGT TGGACCAACG GCGAGCATGC TCCCCGATGC TTTCTTCCGC CGGGGTGTCA ACCTCTTGGG AGGCACCCTG GTTACCAAAC CGGACGAGTT ACTGGATGTT CTGGCGGAAG CCGGCTCCGG GTACCATTTC TATGGTCGCG CGGCTGAGAT GATGGTTTTA CGGCTTTCGG ATCATGGCGG CACAATTTAA
|
Protein sequence | MVMAGVSYQP GTILRETIHS IRNILGDSLD DLTVERVVIG VFYTGVKLSN GQGGLCFTPI KAIPGAVCCP SSARAMPASG ELRGRKATAF LEGMFADQAL RRALGIAVLN ALSATCWQVR PPMNYTLKTG ANALDQVIIP GEGQVVVVGA LAPFLKVLKR QDCRFTILEL DPATLKKDEL PFYRPPEDAP EVIPWADLLI ITGTTLINDT LEGLLSVVKP GAQVVVVGPT ASMLPDAFFR RGVNLLGGTL VTKPDELLDV LAEAGSGYHF YGRAAEMMVL RLSDHGGTI
|
| |