Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0259 |
Symbol | |
ID | 3833222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 266099 |
End bp | 267439 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637828195 |
Product | hypothetical protein |
Protein accession | YP_429137 |
Protein GI | 83589128 |
COG category | [S] Function unknown |
COG ID | [COG0391] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01826] conserved hypothetical protein, cofD-related |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 65 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGATGGCT TAAAATGGCT CTACCCGGGG CTTAAAATTA AACGTTGGCT GTTGCTGGCA GTCCTGGGTT TGCTGCTGCT TGTTTCTGGT CTAACGGTTA TCTTGGGGAT AACCCTGCTG GCTTCGGCGG AAAAAGGAGT TACCTGGTTT ATCCTTCATA CCCTGGGTGG CCTGGGCTCG CCCCTGCTGG CTGGTCTTTT GGCCATGGCC CTGGGAGCGG TCTTTATTGG GGTGGCCGTC CGGAATCTGG CCCGTTCGGT TATCCAGGTT CTCTTGCCCG GTCATACCGC CAATCCCTGG CAGGTTTTTT ACCGGCGCCA GTACCTGGCC CGGGGCCCCC ACCTGGTGGC CATCGGCGGG GGGACGGGGC TGGCCGTCCT CTTGCGGGGT TTAAAAAACT ATACCCGCAA CCTGACGGCC ATCGTCACCG TGGCCGATGA CGGGGGAAGT TCCGGTCGCC TGCGCCAGGA ATTGAGCATC CCGCCCCCGG GGGATATCCG CAATTGCCTG GTGGCCCTGG CCGATACGGA AAGCCTCATG GAGGATCTCT TCAGCTACCG GTTCCGCCAG GGCGAGGGCC TGGCCGGTCA CAGCCTGGGG AATCTCCTCC TGGCGGCCAT GACGGATATG GCCGGTGATT TTGACCGGGC CATCCAGGAA CTGGCCCGGG TCCTGGCGGT AGGGGGGCGG GTCATCCCCT CGACGACCAC CCATGTCGTC ATGGGTGCCG AACTGGCCGA TGGCAGCACC GTCCTGGGTG AAAGCAATAT CCCCCTGGCC GGCAAACCCA TTAAAAGGGT GTTTTTAAAA CCGGCTGACT GCCGGCCGCC GGCGGCGGCC CTGGAAGCCA TTGCCCGGGC CGACGCCGTG ATAATCGGCC CGGGGAGTCT GTATACCAGC GTCCTGCCAA ACCTGCTGGT GCCGGGTATT GTCGAGGCCC TGCGGGATAC CCCGGCACCG GTCTTTTATG TTTGCAACAT CATGACCCAG CCGGGAGAAA CGGACGGTTA CACGGTGGCC GACCACCTGC GGGCCCTCAT CGACCACTGC GGCCAGGGGA TAATAGATAC GGTAATCGCC CACAGCGGCC CCATTTCCCG GGCCGCCCGG CGGCGTTACG GCGAAAAGGG AGCCCGGCCG GTCCTGATTA ACAGCCCGGC AATCGCCAGG ATGGGGGTAG AGCTGCGCCG CGGCTGGCTG GTAGACGAGA CCCATGTCGT CCGCCATCAC CCCGAACGAT TGGCCAGCCT GGTCATGGAA GAGGTTTACC GGCACCAGGC CCGCGGCCGG CGGCGTTTTT TTTACCTGGT ACGGGAGAGA TTTCGCACCC TGGCCCGGTA G
|
Protein sequence | MDGLKWLYPG LKIKRWLLLA VLGLLLLVSG LTVILGITLL ASAEKGVTWF ILHTLGGLGS PLLAGLLAMA LGAVFIGVAV RNLARSVIQV LLPGHTANPW QVFYRRQYLA RGPHLVAIGG GTGLAVLLRG LKNYTRNLTA IVTVADDGGS SGRLRQELSI PPPGDIRNCL VALADTESLM EDLFSYRFRQ GEGLAGHSLG NLLLAAMTDM AGDFDRAIQE LARVLAVGGR VIPSTTTHVV MGAELADGST VLGESNIPLA GKPIKRVFLK PADCRPPAAA LEAIARADAV IIGPGSLYTS VLPNLLVPGI VEALRDTPAP VFYVCNIMTQ PGETDGYTVA DHLRALIDHC GQGIIDTVIA HSGPISRAAR RRYGEKGARP VLINSPAIAR MGVELRRGWL VDETHVVRHH PERLASLVME EVYRHQARGR RRFFYLVRER FRTLAR
|
| |