Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1815 |
Symbol | |
ID | 3830736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1874554 |
End bp | 1875444 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637829745 |
Product | agmatinase |
Protein accession | YP_430658 |
Protein GI | 83590649 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family |
TIGRFAM ID | [TIGR01230] agmatinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000000239855 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAACT CCGCTATCTT AGAAGGCTTT ACCAGGGGCC AGGGATTCCT GGCCAGCAGC GACGATTTTG CCGGGGCCCG TGCTGTCCTG GCCGGCATTC CCTTTGATGC CACCACCAGC TTTCGTCCCG GCACCCGCTG GGGCCCCCGG GCCATCCGTT CCGTATCAGA AGTCCTGGAG GAATACAGCC CCTACCTGCA GCGGGAGCTG ACGGGAGTTC CCTTTTACGA CGCGGGAGAC CTCGACCTCC CGCCGGGCAG GGTGGAGGCC AGTTTAGAAC GAATGGAGGC GGCGGCCGAC GCTATTTTTG CCGCCGCCAG GATCCCATTT TTTATGGGGG GGGAGCACCT GGTGAGCTAT CCCCTCATCC GGGCTGCCTA CCGGCACTAT CCCGATCTGG CCGTTCTCCA TTTCGACGCC CATGCCGACC TGCGGGAGGA TTACCTGGGG GAAAGGTTCT CCCATGCCAC CGTCATGCGT CTGGTGGCGG AGGAGATCGG CCCCGATCGC CTCTACCAGT TTGGCATCCG GTCCGGTACC TGGGGCGAAT TCGCCTACGG CCAGGAAGAA ACCAATTTTT TTATGGACGT GATTACCCCG CCCCTGGCCA AACTGGTACC GGAACTGGTC CGGCGCCCCT TGTATGTGAC CATAGATATC GACGTGGTCG ACCCGGCCTT TGCCCCCGGT ACCGGTACCC CCGAACCGGG GGGCTGCCCG CCGGGGGAGA TCTTTAAAGC CATCCAGATC CTGCAGGGGG CCAATGTAGT CGCCTTTGAC CTGGTGGAGG TCTGCCCGGC CTACGACCAG AGCGACATTA CCGCCATCCT GGCGGCCAAG ATCCTCCGCG AGGCCATTCT GGCCTGGGGA GGGAAAAACG ACCAGGCCTG A
|
Protein sequence | MNNSAILEGF TRGQGFLASS DDFAGARAVL AGIPFDATTS FRPGTRWGPR AIRSVSEVLE EYSPYLQREL TGVPFYDAGD LDLPPGRVEA SLERMEAAAD AIFAAARIPF FMGGEHLVSY PLIRAAYRHY PDLAVLHFDA HADLREDYLG ERFSHATVMR LVAEEIGPDR LYQFGIRSGT WGEFAYGQEE TNFFMDVITP PLAKLVPELV RRPLYVTIDI DVVDPAFAPG TGTPEPGGCP PGEIFKAIQI LQGANVVAFD LVEVCPAYDQ SDITAILAAK ILREAILAWG GKNDQA
|
| |