Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2242 |
Symbol | |
ID | 3831288 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2342228 |
End bp | 2345674 |
Gene Length | 3447 bp |
Protein Length | 1148 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637830162 |
Product | hypothetical protein |
Protein accession | YP_431072 |
Protein GI | 83591063 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.297499 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAACC GGGATATTTT CCAGCGTGAC CCTGTGGTTT CAAAACTGCT TAATGACGGC GTGGCTACAG TTACTGAAGC GGCCACATCC AAAGAGATTG AAACCCTTCG CTATGAACTG GAGCATTTTG TTTGCGAGGG GCAGTATAAA GACGGACTTA TACGCATTCT TGAGTCTTAC TTGAGCAATG CGGATGCTAC AAGCCAGCCG GCGGTCTGGG TAAGCGGTTT TTTTGGCAGC GGTAAATCAC ATCTTCTGAA GATGCTCCGT CACTTATGGG TTAACACCAA ATTTGAATCG GATGGAGCGA CTGCCCGTGA ACTGGCAAGG CTACCTGTGG AAGTAAAAGA CCTGTTAAAA GAATTGGATA TATTAGGGAA AAGATGTGGC GGGCTGCATG CTGCGGGGGG AACGCTGCCA TCCGGGGGAA GGAAAAGTCT TCGTCTGGCA GTGCTCAGCA TTATTTTTCG CTCGAAAGGT CTGCCCGAAT CTCTTCCCCA AGCACAATTT TGCCTTTGGC TTCAAAAAAA TGGCATCTAT ACTCGGGTTA AAAAGGTTGT GGAAGATTCA GGCAAGGTTT TTCAACAGGA ACTCCGTCAT TTGTATGCCA GTCCGGTATT AGCCAGGGCC CTGTTGGAGG CTGACCCTGA TTTTGCCTCC GATCTCAAAC AGGTGCGGGC CACCATCCAG GCCCAGTTTC CTGACGTAGA CGACGTCTCG ACATCCGAAT TTATCCAGAT CATTTGTGAT GTCCTTTCTG TAAACAGGCA ACTTCCCTGC ACGGCGATTG TACTCGACGA AGTCCAGCTC TTTATAGGTG ACAGTACGGC TCGTTCCTAT GAAGTACAAG AAGTAGCGGA AGCCCTTTGC AAGCAGCTTG ATAGCCGGAT ACTGCTTATA GGGGCTGGCC AGACTGCTCT GAGCGGCAAT CTTCCGCTGC TCCAGCGCTT GAAAGATCGT TTTACCATAC CTGTGGAACT CTCTGATACG GATGTTGAAA CTGTAACCCG CCGGGTGGTG CTGGCTAAAA GGGCAGATAA GCGCAAGGCC ATCGAAGAAA TATTAACTGC TTACGCTGGT GAAATTGATC GACAGCTTGC CGGTACCCGC ATTGCACCCA GAAGTGAAGA CCGGGCGATT ATCGTCGAAG ACTATCCCCT TTTACCTGTC CGCCGCCGGT TCTGGGAGCA TGTCCTGCGA GCTGTTGATA TTCCTGGAAC CAGTAGCCAG CTGCGCACGC AGCTCCGTAT TGTTCATGAT GCTGTCCGGG AGATTGCTGA AAAACCCCTG GGTACAGTCG TGCCCGCTGA TTTCATATTT GACCAGCTAC AGCCGGACTT GCTTCGCACC GGGGTTCTTT TACGGGAAAT CGATGAAACC ATTCGCAATC TGGACGATGG GACTCCGGAA GGTTTATTGG CCAGGCGTAT TTGTGCCCTG GTATTCCTCA TCCGCAAACT CCCCCGTGAA GCCGTCGTTG ATATTGGTAT CCGTGCTACC GCGGAGACGC TGGCCGATCT TCTGGTAAGC GACCTGGCCG GCGATGGTCC CGCCCTGCGC AGGGAGATTC CCCGGGTTTT AGAGAAACTG GTTAAGGAAG GAAAGCTTAT CAAGGTGGAC GAGGAATACA GCCTTCAGAC CCGGGAGAGC AGCGAATGGG ACCGGGAATT TCGTAACCGG CAAACCCGGT TGAACAATGA TCTTGCGGCT TTGGCCAGTA AACGTTCGGC CCTGTTGAAT TCGGCCTGTA TGACCGCCCT TGGCAACATT AAACTCATCC AGGGAAAGGA CAAAGTGCCG CGTAAGTTGG CAATTCACTT TGGCAGCGAG CCGCCCGAAA CTAAAGGCCA CGAAATTCCA GTTTGGGTCC GCGACGGCTG GGGAGAAAAC GAGAATACCG TGGTGGCCGA TGCCCGGGCC GCCGGCAACG ATAGCCCGAT TATTTTCGTC TATATCCCCA AAGCAAATGC CGAGGACCTG CAAAGAACAA TTATTGAATA TGAGGCGGCT AAAGCGACCC TCGAATTCAA GGGAACCCCT ACAAATCCAG AGGGGCAGGA AGCCCGCGAC GCGATGTCCA CACGTATGAA GACGGCCGAG GCTACCCGGG ATGAGATTAT TAAAGAGGTT ATTAATGCCG CCAGGGTGTT CCAGGGTGGG GGCCAGGAGC GTTTTGAGCA TTCCATGAAA GAAAAAGTGG AGGCAGCCGC CGAAGCGTCC CTGGACCGTC TCTTTCCCCA CTTCCGGGAT GGTGATGATG ATCAATGGCC GGCTGTTATC AACCGGGCCA AGAGCGGCGA CGAAGCCGCC CTTCAGGCCA TAGGCTGGAA TGATGCTCCC GAAAAACACC CTGTCTGTGC GGCCATTCTT GCTGAAATCG GATCCGGTAA GACAGGCAAG GAAATCAGGG ATATCTTTAT GAAAACACCT TATGGTTGGG GGCAGGATGC TGTCGACGCC GCTTTGATTA CGCTTTTTGC CACCGGGCAT CTTCGCGCTG TATATAAAGG GGTCCAGCTT GACCGTGGCC AATTAGATCA AGCTAAAATC CCGGCTACCG ACTTCCGGGT GGAAACGGTA ACCATAGATG TCCATAGCCG CATGAAGCTG CGCAAGCTTT TTCAAACAGC TGGCTTTAAC TGCAAGGCGG GGGAGGAATC TTCCGTAGCC GGGCAATTTC TTGCCAAGCT TATGGACCTG GCCGATCGTG CCGGCGGCGA TCCACCAATG CCTGAGCGCC CATCTAAGAC CCATTTGGAA ACAATGCGGG GACTAGCTGG TAACGAACAA CTTGCAGCGA TACTGGAACA GTTTGACACC CTGGCCCAAC AACTGCAGAA TTGGTCAGCA TTGGCGGACT TGGCGGCAAA GCGCAGACCT GCGTGGGATA GACTTCAGAT CCTGTTGAGA CACGCCCATG GCCTTCCTGA AGCGGAAGAT CTACAGAGTC AGGCCCATGC TGTGCGCGAC GAACGGCGCC TGTTAGCAGA GCCCGATCCG GTTCCGGACA TTTACCAGGC AGTCGCCAGG GTGCTTCGTA CCGCTGTCAG GCAGGCTTAC GCCAACTTTG AAATGGTATA TAACCGGGAG AAAGCGGCAC TGGAGGCAAA TGCGAACTGG CAGAAACTAT CTCCGGAACA GCAGCAAAAA ATTCTTACGG CTGAAGGGAT TGCCAGCGTT CCCCGGCTTT CCATAGAAGA TGACGACGCC TTGCTTCAAA GCCTCCAGGA AACCCCTCTG TCAGGCTGGA AAACCAGGAC AGATGCCCTG CCCCAGCAGT TCAGTAATGC GGCCATGGCA GCTGCGAAGC TACTGGAGCC AAAAACGCAG AAGGTAAAAC TGACCAGCAG CACCTTGAGA ACGAAAGACG AAGTCAGGGC CTGGGTAGCC GGAGTTGAGA GAGAGCTTTT AGAAAGGATT AAAAACGGCC CCGTCGTGAT CTCGTAG
|
Protein sequence | MKNRDIFQRD PVVSKLLNDG VATVTEAATS KEIETLRYEL EHFVCEGQYK DGLIRILESY LSNADATSQP AVWVSGFFGS GKSHLLKMLR HLWVNTKFES DGATARELAR LPVEVKDLLK ELDILGKRCG GLHAAGGTLP SGGRKSLRLA VLSIIFRSKG LPESLPQAQF CLWLQKNGIY TRVKKVVEDS GKVFQQELRH LYASPVLARA LLEADPDFAS DLKQVRATIQ AQFPDVDDVS TSEFIQIICD VLSVNRQLPC TAIVLDEVQL FIGDSTARSY EVQEVAEALC KQLDSRILLI GAGQTALSGN LPLLQRLKDR FTIPVELSDT DVETVTRRVV LAKRADKRKA IEEILTAYAG EIDRQLAGTR IAPRSEDRAI IVEDYPLLPV RRRFWEHVLR AVDIPGTSSQ LRTQLRIVHD AVREIAEKPL GTVVPADFIF DQLQPDLLRT GVLLREIDET IRNLDDGTPE GLLARRICAL VFLIRKLPRE AVVDIGIRAT AETLADLLVS DLAGDGPALR REIPRVLEKL VKEGKLIKVD EEYSLQTRES SEWDREFRNR QTRLNNDLAA LASKRSALLN SACMTALGNI KLIQGKDKVP RKLAIHFGSE PPETKGHEIP VWVRDGWGEN ENTVVADARA AGNDSPIIFV YIPKANAEDL QRTIIEYEAA KATLEFKGTP TNPEGQEARD AMSTRMKTAE ATRDEIIKEV INAARVFQGG GQERFEHSMK EKVEAAAEAS LDRLFPHFRD GDDDQWPAVI NRAKSGDEAA LQAIGWNDAP EKHPVCAAIL AEIGSGKTGK EIRDIFMKTP YGWGQDAVDA ALITLFATGH LRAVYKGVQL DRGQLDQAKI PATDFRVETV TIDVHSRMKL RKLFQTAGFN CKAGEESSVA GQFLAKLMDL ADRAGGDPPM PERPSKTHLE TMRGLAGNEQ LAAILEQFDT LAQQLQNWSA LADLAAKRRP AWDRLQILLR HAHGLPEAED LQSQAHAVRD ERRLLAEPDP VPDIYQAVAR VLRTAVRQAY ANFEMVYNRE KAALEANANW QKLSPEQQQK ILTAEGIASV PRLSIEDDDA LLQSLQETPL SGWKTRTDAL PQQFSNAAMA AAKLLEPKTQ KVKLTSSTLR TKDEVRAWVA GVERELLERI KNGPVVIS
|
| |