Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0161 |
Symbol | |
ID | 3831873 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 161222 |
End bp | 162274 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637828100 |
Product | ATP:guanido phosphotransferase |
Protein accession | YP_429042 |
Protein GI | 83589033 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3869] Arginine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTGA ACCTGGAGCG CAACAGCAAA TGGATGGAGG GCTCCGGCCC CCAGGCCGAT ATCGTCATTT CCAGCCGCAT CCGCCTGGCC CGCAACCTTA AAGGCCTGCC CTTCCCCAAC CTCATGAACA GCGACCAGCA GGCCCGGGTA GTCAGCCAGG TGAGCCGGGC CATCCAGGCG CCCATGGTCT TCCAGGCCGT GGGGGAGCTG AAACTCCAGC CCCTGCGGGA ATTGGCGCCG GTGGAGCGGC AAATACTAGT AGAAAAGCAC CTCATCAGTC CCGACCTGGC CGCAGGCGGC GGCGAAAAGG CCGTTGTCCT GCGGGACGAC GAGGCCGTCA GCATCATGGT CAACGAGGAA GACCACCTGC GGCTGCAGTG CCTCCTGCCG GCCATGATGC TCCACGAAGC CTGGCGCCTG GCCGACGCCG CCGACGACGC CCTGGAGAAC GAGCTGGACT TCGCCTTCGA CCAGGAGCGC GGCTACCTGA CCGCCTGCCC GACCAACGTC GGCACGGGCC TCAGGGCTTC CACCATGCTC CACCTGCCGG CCCTGGTCCT GACCAGGCAG GCGGGGCCGG TGCTTTCGGC CCTGACCAAG GTGGGGGTGG CCGTCCGGGG CCTCTACGGC GAGGGCACCG AGGCCCAGGG CAACATCTTC CAGGTGTCCA ACCAGATCAC CCTGGGCCGT TCTGAAGAGG AAATTATCAA CAACCTCTCG GCGGTGACGG TGCGCCTGGC CGATCAAGAA CGGGAGGCCC GGGAGCTCCT GCGCCGCCAG AGCCGCTGGC AGCTGGAGGA CCGGGTTGGC CGGGCCTACG GCGTCCTGAC CAACGCCCGG ATCTTGAGCT CCCAGGAAGC CCTGCAGCTC CTCTCCGACG TGCGCCTGGG GGCGGAGATG AAGATCATCC GCGGCCTGGA CCAGCGCCTG CTGAATCAGC TCATGGTCCG CATCCAGCCG GCCTTCCTCC AGTTTAGCGC CGGCAAAGAG ATGACGCCCA TGGAGCGCGA CGTCCACCGG GCGGCCATGG TCCGGGAATT GCTGGCAGGC TAA
|
Protein sequence | MSLNLERNSK WMEGSGPQAD IVISSRIRLA RNLKGLPFPN LMNSDQQARV VSQVSRAIQA PMVFQAVGEL KLQPLRELAP VERQILVEKH LISPDLAAGG GEKAVVLRDD EAVSIMVNEE DHLRLQCLLP AMMLHEAWRL ADAADDALEN ELDFAFDQER GYLTACPTNV GTGLRASTML HLPALVLTRQ AGPVLSALTK VGVAVRGLYG EGTEAQGNIF QVSNQITLGR SEEEIINNLS AVTVRLADQE REARELLRRQ SRWQLEDRVG RAYGVLTNAR ILSSQEALQL LSDVRLGAEM KIIRGLDQRL LNQLMVRIQP AFLQFSAGKE MTPMERDVHR AAMVRELLAG
|
| |