Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2153 |
Symbol | |
ID | 3833002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 2255942 |
End bp | 2256940 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637830075 |
Product | hypothetical protein |
Protein accession | YP_430985 |
Protein GI | 83590976 |
COG category | [S] Function unknown |
COG ID | [COG5464] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01784] conserved hypothetical protein (putative transposase or invertase) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 64 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.101261 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCGGAAA ACGCAGGGCA GCTCCGGCCT CCCCACCACC CCCACGACAA GGGTTACAGG CAGCTTCTTT CCGACAAGAG AGTTTTCCTG GAATTGCTGA AAACCTTCGT CCGGGAAACC TGGGTAGAGG CTATCGACGA AAAAGATCTC ATCCTGGTGA ACAAATCCTA CGTCCTCCAG GATTTCAGCG AGAAAGAAGC CGATATCGTT TACCGGCTTA AGACGAAAGA TAGAAACGTC ATCTTTTACG TCCTGCTGGA GCTGCAGTCA ACGGTAGACT ACCTGATACC CTTCCGGCTG CTGCTCTATA TGGTCGAGAT CTGGCGGGAA ATCTACACCA ACACCCCGCA GCACGAGCGG GAGAGCAAGC ATTTCCGCCT GCCGCCCATC ATCCCGGCGG TGCTCTACAA CGGGGCCGGA TCCTGGACGG CGGCGCTCTC CTTCAAAGAA ATGCTGGACA GTTACCAGGA TTTCAGCGGG CATCTCCTGG ACTTTCATTA CCTGCTGTTT GATGTCAACC GTTACAGCGA AGAAGAGCTG ATCAGAGCGG CAAACCTGAT CGCCGGTGTC TTTCTCCTGG ACCAAAAGAT GCGGCCGGAA GAGCTGGTTG GACGGCTGCA GAAACTGGCG GGGGTCTTAA GGCGGCTCAC GCCTGACGAG TTCCGCCATA TCACCAGCTG GCTGAAGAAC GTCGTCAAGC CCAGAATGCC TGAAGGTTTT AAGGAAAAGG TTGACCGCAT CCTGGACGCA AGCAATCCCT GGGAGGTGGA ACGGATGATT TACAATCTGG AATTAACCCT GGAAGAGATG CAACAGCAGG CTTTGTTAAA AGGCTTAAAA GAAGGCGAAC AGAAGGGGAA ATTGGAAGGG AAATTGGAGG GCAAACGAGA AGTAGCCCGG AACCTGCTAC TGCTCAACGT CGATATAGAG ACCATTGTTA AAGCTACGGG GCTTACTCCG GACGAGATCG CCGTGTTAAA GAAACAGCTG GAACAGTGA
|
Protein sequence | MPENAGQLRP PHHPHDKGYR QLLSDKRVFL ELLKTFVRET WVEAIDEKDL ILVNKSYVLQ DFSEKEADIV YRLKTKDRNV IFYVLLELQS TVDYLIPFRL LLYMVEIWRE IYTNTPQHER ESKHFRLPPI IPAVLYNGAG SWTAALSFKE MLDSYQDFSG HLLDFHYLLF DVNRYSEEEL IRAANLIAGV FLLDQKMRPE ELVGRLQKLA GVLRRLTPDE FRHITSWLKN VVKPRMPEGF KEKVDRILDA SNPWEVERMI YNLELTLEEM QQQALLKGLK EGEQKGKLEG KLEGKREVAR NLLLLNVDIE TIVKATGLTP DEIAVLKKQL EQ
|
| |