Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1107 |
Symbol | |
ID | 3833073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1133645 |
End bp | 1134595 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637829035 |
Product | hypothetical protein |
Protein accession | YP_429964 |
Protein GI | 83589955 |
COG category | [R] General function prediction only |
COG ID | [COG5006] Predicted permease, DMT superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAATAA ACCGGGAGGG ATATATGAAA AAAGCGAAAG ATAGTTCAGG GGAAGGACGG GGTCTTTTCC TGGCGGTCCT GGCGGCCGCC GCCCTGGGCC TGGAAGGAAT ATCTGCCAAG CTGGCCTACG CCGGTGGCGC CAATATCTTG AGTATCCTGG CCATACGATT CTTAGCAGCA GGCATTCTTT TCTGGGGCAG CCTGATAGTT TTTCCCCTCG ATTGGAAACT GAACCTGGGT ACCATGGTAC GTTTAACCGT CCTGGCCCTG GGAGGCCAGG CGACCACTAT TTTATTGCTA TTCTATGCCT TTGAGCGCAT TCCGGCAACG GTAGCCATGT TATTCTTCTA CCTTTACCCG GTGATTGTTA GCCTCCTAGC TACCGTTTTT CTAAAAGAAA CCCTCACCCG GGCCAAAATC GGCGCCCTGG TCCTCGCCTT TACAGGGCTT GCAATCATCC TTGGTGTCCC TACCGGCAAT CTGGAAATAT GGGGTATTGT CACAGCCCTT CTGGCCGCTT GCACCAATGG TATATATATG GTCGGCCAGA CGGGCTTATT GAAAACGATA GAACCACGGG TTTTTAACGC CTATGCAACC CTGACTATAG GCGTGGCCTA CTTTATTCTG GCCATAGTTA CCGGTACCTT CAGTCTTGCT TTTAATAGCC AGGCTATCCT GGCTATTGCC ACCTTGAGCT TGATTTGTAC TCTACTGGCA TATACGGCCG TGGCCTGGAG CCTGAAATAT ATCGGCGCCT CCCGGGCGGC CATTATTTCC ACCCTGGAGC CGGTGGTTAC CGCTGTGCTG GGCTTCCTGA TCTTGGGGGA GAGACTGCAT CCTATCCAGC TTCTGGGAGG GGCCTTGATC CTGGCTGGAG TAACGGTGCA ACAGGTGCTA ACGTCGAAGG ATAGCGGTGA AGGTCATGGT TATGGTATAA TTAAGCAATA A
|
Protein sequence | MGINREGYMK KAKDSSGEGR GLFLAVLAAA ALGLEGISAK LAYAGGANIL SILAIRFLAA GILFWGSLIV FPLDWKLNLG TMVRLTVLAL GGQATTILLL FYAFERIPAT VAMLFFYLYP VIVSLLATVF LKETLTRAKI GALVLAFTGL AIILGVPTGN LEIWGIVTAL LAACTNGIYM VGQTGLLKTI EPRVFNAYAT LTIGVAYFIL AIVTGTFSLA FNSQAILAIA TLSLICTLLA YTAVAWSLKY IGASRAAIIS TLEPVVTAVL GFLILGERLH PIQLLGGALI LAGVTVQQVL TSKDSGEGHG YGIIKQ
|
| |