Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1452 |
Symbol | |
ID | 3831338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1495326 |
End bp | 1496564 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637829385 |
Product | hypothetical protein |
Protein accession | YP_430305 |
Protein GI | 83590296 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.275657 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTTTT CACCCGCCAT TGAAGAGAGT CTTTTTCCAC CCCTTTATCA AAATTTACAT GAAAAAATAA CCGGTGCCCT GGGAGAAGCT ACCCTACGAT CTTGCCTTAC CTGCGGCGTC TGCAGCGGCG GCTGTCCCAC CGGCGATATA GGCGCACCTG TTGATCCCCG TAAAATCGTC CGCCTGCTCC TATGGGGGAT GGAGGACAAA GTCCTGGCAT CGGACATGAT CTGGCTCTGC ACCATGTGTG GCCGCTGTAC GGTTTACTGC CCGGTAGGTG TGAATATGGG CGACCTGGTC CGGGCCCTGC GCAGCCACCT GGCGGAGGAA GGCCGGGTCC CCGAAAATTT ACAAAAAGTT GTTGATCTAG CAGTTACTTC TGGTAATAAT ATGGGGATCA GCCGGGAGGA TTATCTCGAT ACCCTGGACT GGATGCAAGA AGAACTCCAG GCTGAGTTCG GCCCCCGGGC GGAAATACCG GTAGATAAAA AAGGCGCCAG GGTAATGTAC GTTATCAACC CGCGGGAAGC AAAGTTTTTC CCTCTATCCA TCCTGGCGGC CGCCAAGGTG TTTTATGCCG CCAGCGAGAG CTGGACCCTC TCCAGCCGCT CCTGGGACGC GACCAACTAC GCCCTTTTCT CCGGGGACGA CAAAGCCGGC GCTATCCTGG TGCAGCGCCT GGCGGATGAG GTGGAGCGCC TGGGCTGCCA GGAGTTGATC ATGACCGAGT GCGGCCATGC CTTCCGCGCC ATCCGCTGGG GGCCCGAACG CTGGCTGGGG CATAAACTCC CCTTCCCCGT ACGCAGTATT GTCCAGCTAA TGGCCGAATA CCTGGATGCA GGCCGTATCC GCCTGGACCC CTCCCGCAAC AGCGAGCCGG TAACCTATCA TGACCCCTGC AATCTAGGCC GCAAGGAAGG TATCTTTGAA GAACCGCGGC GGGTACTGCA GGCAGCGGTC ACCGATTTTC GCGAAATGAC GCCGAACCGT GAGAATAACT ACTGCTGCGG CGGCGGTGGC GGCATGCTCT CTTTGAGCGA GTTCGGCCAG GAACGTCTGG CCAAAGGCAA GGTTAAAATA GAGCAAATTC AGCGCACCGG GGCCGGGATA GTGGCTACTC CCTGCCACAA CTGTGTTGAT CAATTAAATG ACCTTTGCCG TCATTATCAT CTCAATGTTA AAGTTAAGAA CCTGGTCGAA TTGGTAGCCG ATGCCCTGGT AATCGCTGGT AAGGAGTGA
|
Protein sequence | MPFSPAIEES LFPPLYQNLH EKITGALGEA TLRSCLTCGV CSGGCPTGDI GAPVDPRKIV RLLLWGMEDK VLASDMIWLC TMCGRCTVYC PVGVNMGDLV RALRSHLAEE GRVPENLQKV VDLAVTSGNN MGISREDYLD TLDWMQEELQ AEFGPRAEIP VDKKGARVMY VINPREAKFF PLSILAAAKV FYAASESWTL SSRSWDATNY ALFSGDDKAG AILVQRLADE VERLGCQELI MTECGHAFRA IRWGPERWLG HKLPFPVRSI VQLMAEYLDA GRIRLDPSRN SEPVTYHDPC NLGRKEGIFE EPRRVLQAAV TDFREMTPNR ENNYCCGGGG GMLSLSEFGQ ERLAKGKVKI EQIQRTGAGI VATPCHNCVD QLNDLCRHYH LNVKVKNLVE LVADALVIAG KE
|
| |