Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0697 |
Symbol | |
ID | 3832698 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 726414 |
End bp | 728000 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637828629 |
Product | AIR synthase related protein-like |
Protein accession | YP_429559 |
Protein GI | 83589550 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [S] Function unknown |
COG ID | [COG0309] Hydrogenase maturation factor [COG1992] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00000561591 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000203748 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCGGCA AAGTAGATGA CGCTTTCTTC CGGCAGGCTA TCCTGCCCCA TACGGGAGCA GGGGATCCTG AGGTGGTAGT CGGGCCGCGC ATGGGAGTGG ACGCGGCGGT ACTTAAAATA GGGGAGGAGT ACCTGGCCGT CGCAGAGGAC CCCATTTTTC CTGGACCGAC GACTTCCCCC GATGACTTCG GCTGGATCAC CGTCCATATC GGCGCCAGCG ACGTCGCCGT CATGGGTATC AAGCCCCGTT TTATGACCTA TTCCCTCTTG CTGCCGCCGG GGACGCCGGA GGACTACATC GCCGGGCTGG TCCGCAGCAT CAGCACCTAT GCCCGGGAGC TGGGCATTAC TATCGTCGGC GGTCATACCG GTTTCTACGG GGCCGTGACC ATACCTACCA TTGGCGGAAT TACCGTCTGG GGCCGGGGTC GGGAAGTAGT CACCCCCGCT GGGGCCCGGG TCGGCGACGC CGTAATTATT ACTAAAGGGG CGGCCATTGA AGCGGCAGCC CTGGTGGCCT GCGAGCTGGG TGAGAAGCTC CTGGCTGCCG GGATCTCCCC GGACCTGGTG GCAAGGGCTA AAAAGCGGTT GCGGGAGATG TCCGTGGTGG CTGAAGCCGG TATTGCCGTA GAAGTCGGCG GTGTACATGC CATGCACGAT GCCACGGAAG GGGGCCTGGC GCGGGGCCTC TGGGAGGTGG CCGAAGCTTC CGGTGTGGGT TTAAGGATCG AACGCGCCCG GGTACCGGTA CCTGCCGATA TCCGAGCAGT TTGTGACTAT ATTGGCCTTA ACCCTTACGA AGTAATCAGC GAGGGCACCC TGGTGCTCAC CTGCGCGCCG GAAAAGGCTG ACGCCATGCT GGCGGCCTTT AAAGAAGCCG GCATCGAAGC GGCGGTTATC GGCCGGGTAG TACCAGCGGG CGCAGGCCGC GCCTGGCTGG AGGATGACGG CCGGGAAGAG CAGCTCCTGC CGCCGGCGGT GGACCGCTTC TGGGAGGTCT TTTTTAACGC CCTGGCCCTA AAAAACGATA CCCGTACTCC GGCGGAAGTG GCCCTGTGCC GGGAACTGGG ACAGGCCGTC AGGGAGCTCG AGGAAGCTAA CGTTGCCGCC CTCATCCCCG AGATCGGCGC CAACCTGGCC TATTGCTTGC CGGAGGCAAA AGAACTCCGG GACATCGCTG CTATACCCGG CCGCCTGCTG CGTTTTAAGG GGCGAGTGGC AACCCTGGGT GAGCCGGAGA TGGGCTGTTC CCACCACATG GGCGGCACCA TCCTGGTGGT GCGGGAGTTC TTCCCGCAGG CACGCTGCGT CATCAACCTC CGCAACAACG CCCGGGTGCG TCAGGCCTGC GCCGATCTGG GTTATAAGGT TGTCAGCATG CCCGTGCCCC CGGACTACCG CCAGACGGAC GATGATTTCT ATACCGACCT GCGCCGGACC ATGGCGGCCT GCCGGGAACT TCCTGACGTA ATTGAAATAC CCGATCGCAT CAACCTGGAG CGCCTCATCC TGGTCCTGGG CCGGAACCCC GGTGAAATCG TCAGCAAGGT AACCTCCCTG GCCACCAGGG TGGCGGAATT GGAGTAG
|
Protein sequence | MIGKVDDAFF RQAILPHTGA GDPEVVVGPR MGVDAAVLKI GEEYLAVAED PIFPGPTTSP DDFGWITVHI GASDVAVMGI KPRFMTYSLL LPPGTPEDYI AGLVRSISTY ARELGITIVG GHTGFYGAVT IPTIGGITVW GRGREVVTPA GARVGDAVII TKGAAIEAAA LVACELGEKL LAAGISPDLV ARAKKRLREM SVVAEAGIAV EVGGVHAMHD ATEGGLARGL WEVAEASGVG LRIERARVPV PADIRAVCDY IGLNPYEVIS EGTLVLTCAP EKADAMLAAF KEAGIEAAVI GRVVPAGAGR AWLEDDGREE QLLPPAVDRF WEVFFNALAL KNDTRTPAEV ALCRELGQAV RELEEANVAA LIPEIGANLA YCLPEAKELR DIAAIPGRLL RFKGRVATLG EPEMGCSHHM GGTILVVREF FPQARCVINL RNNARVRQAC ADLGYKVVSM PVPPDYRQTD DDFYTDLRRT MAACRELPDV IEIPDRINLE RLILVLGRNP GEIVSKVTSL ATRVAELE
|
| |