Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0409 |
Symbol | |
ID | 3832091 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 413769 |
End bp | 414779 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637828346 |
Product | hypothetical protein |
Protein accession | YP_429286 |
Protein GI | 83589277 |
COG category | [S] Function unknown |
COG ID | [COG5464] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01784] conserved hypothetical protein (putative transposase or invertase) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.650641 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00708611 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGCCGGAAA ACGCAGGGCA GCTCCGGCCT CCCTACCACC CCCACGACAA GGGTTACAGG CAGCTTCTTT CCGACAAGAG AGTTTTCCTG GAATTGCTGA AAACCTTCGT CCGGGAAACC TGGGTAGAGG CTATCGACGA AAAAGATCTC ATCCTGGTGA ACAAATCCTA CGTCCTCCAG GATTTCAGCG AGAAAGAAGC CGATATCGTT TACCGGCTTA AGACGAAAGA TAGAAACGTC ATCTTTTACG TCCTGCTGGA ACTGCAGTCA ACGGTAGACT ACCTGATACC CTTCCGGCTG CTGCTCTATA TGGTCGAGAT CTGGCGGGAA ATCTACACCA ACACCCCGCA GCACGAGCGG GAGAGCAAGC ATTTCCGCCT GCCGCCCATC ATCCCGGCGG TGCTCTACAA CGGGGCCGGA TCCTGGACGG CGGCGCTCTC CTTCAAAGAA ATGCTGGACA GTTACCAGGA TTTCAGCGGG CATCTCCTGG ACTTTCATTA CCTGCTGTTT GATGTCAACC GTTACAGCGA AGAAGAGCTG ATCAGAGCGG CAAACCTGAT CGCCGGTGTC TTTCTCCTGG ACCAAAAGAT GCGGCCGGAA GAGCTGGTTG GACGGCTGCA GAAACTGGCG GGGGTCTTAA GGCGGCTCAC GCCTGACGAG TTCCGCCATA TCACCAGCTG GCTGAAGAAC GTCGTCAAGC CCAGAATGCC CGAAGATTTT AGGAAAAAGG TTGACCGCAT CCTGGACGCA AGCAATCCGT GGGAGGTGGA ACGGATGATT TATAACCTGG AATTAACCCT GGAAGAGATG CAACAGCAGG CTTTGTTAAA AGGCTTAAAA GAAGGCGAAC AGAAGGGGAA ATTGGAAGGG AAATTGGAAG GAAAATTGGA GGGCAAACGA GAAGTAGCTC GAAACCTGCT ACTGCTCAAC GTCGATATAG AGACCATTGT TAAAGCTACG GGGCTTACTC CGGACGAGAT CGCCGTGTTG AAGAAACAGC TGGAACAGTG A
|
Protein sequence | MPENAGQLRP PYHPHDKGYR QLLSDKRVFL ELLKTFVRET WVEAIDEKDL ILVNKSYVLQ DFSEKEADIV YRLKTKDRNV IFYVLLELQS TVDYLIPFRL LLYMVEIWRE IYTNTPQHER ESKHFRLPPI IPAVLYNGAG SWTAALSFKE MLDSYQDFSG HLLDFHYLLF DVNRYSEEEL IRAANLIAGV FLLDQKMRPE ELVGRLQKLA GVLRRLTPDE FRHITSWLKN VVKPRMPEDF RKKVDRILDA SNPWEVERMI YNLELTLEEM QQQALLKGLK EGEQKGKLEG KLEGKLEGKR EVARNLLLLN VDIETIVKAT GLTPDEIAVL KKQLEQ
|
| |