Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1372 |
Symbol | |
ID | 3832295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1417828 |
End bp | 1418805 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637829308 |
Product | hypothetical protein |
Protein accession | YP_430228 |
Protein GI | 83590219 |
COG category | [S] Function unknown |
COG ID | [COG3580] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAGTGA AGGTTAAGCA GTGGCGATTG GGGGTGCCTC GCGCCCTCTT TTATTATTAC TACGCCCCCT GGTGGGAAGC CTTCCTCCAG GCCCTGGGCG CCCAGGTGGT AGTTTCTCCG CCCACCACCA GGGAGATAAT GGACCTGGGG ATAAGCCTGG CCGTTGCCGA AGCCTGCCTG CCGGTGAAGG TTTATTACGG CCATGCCGCC TGGCTGGCGC CACGGGTGGA GGCCCTCTTT GTCCCCAGGC TGGTCAGCGT GGAGCAAAAG AGCTTTATCT GCCCCAAACT CATGGGTCTG CCCGATATGT TGCGGGCAGC TATTAAAGAC TGCCCGCCGG TGATCGACGT CACGGTCGAC ATGTCGCGGC GGCCGGAGGA AGGCCTCAAG GCCGCCATCC GGGATACGGC CCGGGCTATC GGTTCCAGGG GGAGGGAAGT TTACCGGGCC GGGGAGATAG CCCGGGCCAG GTACCGGGAT TGGCTGGCTG GGCAGCAGGA GCAAGGCAGG GAATTCTCTC CTGGGAAGAA GGAAGGCATA ACCCCCGGGG GTCAGCCTCC CCTGGCGGTG GGGCTGGTTG GCCATAACTA CCTGCTCCAC GATCGCTATC TGGGCATGGA TATTGCCGGT AAAGTTACGC GCCTTGGTGG CAGGGTGATC TTACCGGAAA ATTTTGCACC TGAACTGGGG GAAGCCGCCT GCCGCCGGTT GCCCAAGCGC CTTTACTGGA CCCTTGGTCG CAAGATCATG GGTGCAGCCC TGCACCTGAT GGAGCAGGAG GAAGTCGCCG GTTTGATTCA CCTTACAGCC TTCGGCTGCG GTCCCGATTC CCTGGTGGGC GACCTGGCCG AGCGCTATGC CCACCGGCAT GGTAAGCCTT TTTTACTATT AACCTTGGAT GAGCATACCG GCGAGGCTGG CGTGGAAACC AGACTGGAGG CCTTTATGGA TATGCTGGCC CGGAGGCGTC CGGCATGA
|
Protein sequence | MVVKVKQWRL GVPRALFYYY YAPWWEAFLQ ALGAQVVVSP PTTREIMDLG ISLAVAEACL PVKVYYGHAA WLAPRVEALF VPRLVSVEQK SFICPKLMGL PDMLRAAIKD CPPVIDVTVD MSRRPEEGLK AAIRDTARAI GSRGREVYRA GEIARARYRD WLAGQQEQGR EFSPGKKEGI TPGGQPPLAV GLVGHNYLLH DRYLGMDIAG KVTRLGGRVI LPENFAPELG EAACRRLPKR LYWTLGRKIM GAALHLMEQE EVAGLIHLTA FGCGPDSLVG DLAERYAHRH GKPFLLLTLD EHTGEAGVET RLEAFMDMLA RRRPA
|
| |