Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0626 |
Symbol | |
ID | 3832522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 650478 |
End bp | 651590 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637828569 |
Product | hypothetical protein |
Protein accession | YP_429499 |
Protein GI | 83589490 |
COG category | [S] Function unknown |
COG ID | [COG3323] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR00486] dinuclear metal center protein, YbgI/SA1388 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.743702 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGCAA AGTGCGGCGA GATCATAGCC ATTATGGAAG CCCTGGCCCC GCCGGAACTG GCTGCCGGCT GGGACAACGT CGGCCTCATG CTCGGCTCGC CTGAGGCGGA GGTCAGACGC GTCCTGGTAT GCCTGGACGT AACCCCGTCG GTAGCCGCCG AGGCTGCCGC CCGGGCCGTT AACCTGATTA TCAGCCACCA CCCCCTCTTC TTCCGGCCGG TGAAGAACCT GCGCTTTGAC GAGCCCGTGG GAGAACTGGT GCGGCGCCTC CTCCAGGATA ACATCATGGT CTACTCGGCC CACACCAATA TGGATAGCGC CGACCTGGGG GTCAGCTACC ACCTGGCCTC CAGGCTGGAG CTGGAGGACA TCCGGGTCCT GGTCCCCACC CACCGTGAGA AGTATTACAA GCTCGTCACC TTCGTCCCCG AAGACCACGA AAAGGTCGTT CGCGAAGCCC TCACCCGGGC CGGAGCCGGC TGGATCGGCA ACTACTCCGA CTGCACCTTC CGGGTGGCCG GTACCGGCAC CTTCATGCCC CTGGCCGGCA CCCGTCCCTA TACCGGTGAA GAGGGCAAAC TGGCGGAGGT CAAAGAGTAC CGCCTGGAGA CCATCATCCC CACCGGCCGG CTGCCGGAGG TCCTGCGGGC CCTGCTGAAA GCCCACCCCT ACGAGGAAGT GGCCTATGAC GTGTACCCCC TGGCCAACGA AGGACCGGCC CAGGGCATCG GCCGCACCGG CGTGCTGCCC CAGGCCGTCA CCCTGGAGGA ATTCGCCCTG CGGGTGAAGG AGTCCCTGGG GGCCGGCCGG GTCAACCTGG TGGGCGACCG GGAGCGTAAG GTCAAAAGGG TGGCCGTCTG CGGCGGCGCC GGCAGCGACG TTATGGCCGC CGCCCGGGAT GCGGGGGCGG AAGTCCTGGT CACCGGGGAC CTCAAGTACC ACGAAGCCCG CACGGCCCAG GCCATGGGCC TGGCCGTCGT CGACGCCGGC CATTTCGCCA CCGAAAGGCT GATTGTCCCG GCCCTGGTGA CCTATCTCCA GGAACAGTTG CAGGAGCGCG AGGTGATGGT CCTGGCCTCC CAACAGGAAC AGGAACCCTG GTACGCATTA TAA
|
Protein sequence | MAAKCGEIIA IMEALAPPEL AAGWDNVGLM LGSPEAEVRR VLVCLDVTPS VAAEAAARAV NLIISHHPLF FRPVKNLRFD EPVGELVRRL LQDNIMVYSA HTNMDSADLG VSYHLASRLE LEDIRVLVPT HREKYYKLVT FVPEDHEKVV REALTRAGAG WIGNYSDCTF RVAGTGTFMP LAGTRPYTGE EGKLAEVKEY RLETIIPTGR LPEVLRALLK AHPYEEVAYD VYPLANEGPA QGIGRTGVLP QAVTLEEFAL RVKESLGAGR VNLVGDRERK VKRVAVCGGA GSDVMAAARD AGAEVLVTGD LKYHEARTAQ AMGLAVVDAG HFATERLIVP ALVTYLQEQL QEREVMVLAS QQEQEPWYAL
|
| |