Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2241 |
Symbol | |
ID | 3831287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2340019 |
End bp | 2342214 |
Gene Length | 2196 bp |
Protein Length | 731 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637830161 |
Product | hypothetical protein |
Protein accession | YP_431071 |
Protein GI | 83591062 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 45 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0974985 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGATGC CGGTACTATC AGTAGAACAG CGCAATAAAC TTGAACGTAC GGTTGTCGAG GCCCGGGATG TTGCCGAAGC CGGGGCCAAG GCGGCCCTGG AGGCCCTGGC CGTGCACCAT CACGAGCCTT ACAGCCATAT GACCCCAGAA CAGCGCCGGC TTCGCAATCA CCTCCGGGCC CGGGCGCGCC AGCTGGGGGA CAGGCAGGAC CAAAGGGGAA AAATGGAAAT CACCCACCTG ATCCAGGAGT GGGCTTACGA GCACTGGCAC CGCATGCTCT TTGCCCGTTT CCTGGCGGAA AACGACCTCC TGATCGAGCC GGAGATGGGG GTGGCTGTCA GCCTTGAGGA GTGTGGGGAA CTGGCCCAGG AGGAAGGTAC CGATCTCTGG ACCCTGGCCA GCCGTTTTGC CCAGCAGATG TTGCCGCAGA TCTTTCGCCC CGATGACCCG GTGTTGCAGG TCACCTTTGC GCGCGAATAT CAACTGAAGC TGGAGCAGTT GCTCAACGAC CTGGAGCCGG GTATCTTTAA AGCCAGCGAT GCCCTGGGAT GGGCCTACCA GTTCTGGCAG AGCAAGCGCA AGAAACAGGT GAACGAATCG GGCAATAAAA TCGGCGCCGA TGAATTGCCG GCTGTTACCC AGCTCTTTAC AGAGCCTTAT ATGGTGAATT TCCTCATTCA CAACACCATT GGCGCCTGGT ACGCGGGCAA GGTGCTGGCG GAGAACCCGC AGCTTGCCGG GCAGGCAGAA AGTGAAGAAG AGCTTCGCAG GGCGGTTGCG TTGCCCGGTG TGACTTGGGA TTACCTACGC TTCGCCCGTA GCGGCGATGG GGAGGGGCCG TGGCGTCCCG CTGCCGGGAC CTTCGAGGGC TGGCCGCAGC GGGCAGCGGA GCTTAAAATC CTGGACCCCT GCTGTGGTTC GGGCCATTTC CTGGTAGCGA CATTTTATCA TTTAGTTCCG ATTCGAATGG CCGAGGAAGG CCTTACGGCC CGGGAGGCAT GTGACGCCGT CCTGAGTGAT AACCTCCACG GCCTGGAGAT CGATGAGCGC TGCACGCAGA TCGCCGCCTT TGCCCTGGCG CTGGCGGCGT GGACCTACCC CGGCGCCGGT GGCTACCGCC CCTTGCCCGG GTTGCGAATA GCCTGTTCCG GCATCGCTCC CAATACGAAA AAGGAAAACT GGCTGGCCCT GGCGGGGGAT GACGAGCGCC TCCGTAACGG CATGGCCCGG CTTTATGACC TTTTCCGGGA AGCGCCTATT CTAGGAAGCC TTATCGATCC TGGTTCCGCT CTAGAAAACA ATCTGATAGA AGCCGGGTTT GATGAACTCC GTCCCTTGCT GGAAAAAGTA ATGTCCGCGG AGAAAGACGA TTACGAGCAG CACGAGTTAG GGGTTGCCGC CTGTGGTATC GCTGATGCAG TTCAAATTCT TAGTGGTCGT TATCATCTAG TGATTACCAA CGTGCCGTAT CTTGCCCGCG GAAAACAGGG AGCCGGACTC AAGAATTACC TTGATACCCA TCATGCCCGT TCGAAACAAG ACCTGGCCAC TGCCTTTATC GAGCGCAATC TAATGTTGTG CCTAGAAGGT GGTTCTACTG CTTTAGTTAC TCCTCAGCAG TGGCTGTTTC AAACTGGGTA TTCGAAGATT CGTAAGAAGC TGCTTAGAGA TTTACGATGG GAACTCTTAG CTATACTTGG TGAACACGCA TTCCGAAGCT CTGAAGCTGC AGGGGCATTT CCTGCAATAT ATGTTTGTTC TAAATTGCTT CCGAAAGACA ATCATCTTTT CTTTAGTATT AACGTTTCAG AGAGGTCTTC TGCCTGTGAC AAAGACTTAT ATCTCCAACA CGCTTCCTTA AACAAAATGC AACAGATAAG TCAACTTAAA AACCCGGAAC ACATTATCAT TACAAACACG TTTTCTGGGT TTAGAGTCTT TGGTGATTAT GCCGATTGTT ACCAAGGCAT ATCAACAGGC GATAATCCTA GATTTTGTAT AAAGTTTTGG GAACTCCCAT CTATTTTACC AGGTTGGGAA AGGTTTCAAA CACCACCAGA GGAAACCCAG TACTATGGTG GGCGAGAGCA TCTAATACGT TGGGAAGAAC GTGCTAGTAT TCAAGAGCTA GGTGCTATTC GAGGGAAAAA TGCTTGGGGA AAATGGGGCA TTGTCATAGG TGTTGATTGT CAATAG
|
Protein sequence | MPMPVLSVEQ RNKLERTVVE ARDVAEAGAK AALEALAVHH HEPYSHMTPE QRRLRNHLRA RARQLGDRQD QRGKMEITHL IQEWAYEHWH RMLFARFLAE NDLLIEPEMG VAVSLEECGE LAQEEGTDLW TLASRFAQQM LPQIFRPDDP VLQVTFAREY QLKLEQLLND LEPGIFKASD ALGWAYQFWQ SKRKKQVNES GNKIGADELP AVTQLFTEPY MVNFLIHNTI GAWYAGKVLA ENPQLAGQAE SEEELRRAVA LPGVTWDYLR FARSGDGEGP WRPAAGTFEG WPQRAAELKI LDPCCGSGHF LVATFYHLVP IRMAEEGLTA REACDAVLSD NLHGLEIDER CTQIAAFALA LAAWTYPGAG GYRPLPGLRI ACSGIAPNTK KENWLALAGD DERLRNGMAR LYDLFREAPI LGSLIDPGSA LENNLIEAGF DELRPLLEKV MSAEKDDYEQ HELGVAACGI ADAVQILSGR YHLVITNVPY LARGKQGAGL KNYLDTHHAR SKQDLATAFI ERNLMLCLEG GSTALVTPQQ WLFQTGYSKI RKKLLRDLRW ELLAILGEHA FRSSEAAGAF PAIYVCSKLL PKDNHLFFSI NVSERSSACD KDLYLQHASL NKMQQISQLK NPEHIIITNT FSGFRVFGDY ADCYQGISTG DNPRFCIKFW ELPSILPGWE RFQTPPEETQ YYGGREHLIR WEERASIQEL GAIRGKNAWG KWGIVIGVDC Q
|
| |