Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0041 |
Symbol | |
ID | 3830907 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 41660 |
End bp | 43069 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637827973 |
Product | arginine decarboxylase |
Protein accession | YP_428923 |
Protein GI | 83588914 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1982] Arginine/lysine/ornithine decarboxylases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000185697 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAAGACCC AAGGGCAGGC GCCTCTATGG GAGGCGCTTT TAAACTATCG TCAGCAAGGA CTTAATTCCT GGCATACTCC CGGTCACAAG GACGGGGCCT ATACCTTGCC CCTGTGGCGG GATTTCCTGG GCGCGGCCCT GGCTCTGGAC CTGACTGAGG TACCTGGCCT GGATAACCTG GCCTGTCCCG GGGGGGCTAT CGCCGCCGCC CAGGAGCGGG CGGCCCGCTT TTATGGCGCA GCCCGGACTT TTTTCCTGGT AAACGGTGCG AGCGCCGGCC TGATGGCCGT CATCCTGGCC ACCTGCCGCC CCGGGGATGA AGTTTTGCTG CCCCGGTACG CCCACCGGGC GGTTTTTAAT GGCCTGATTT TAAGTGGGGC CCGGCCGGTT TACCTGGCCA CGGAATGGCT GGCCAGCCCG GGCCTGCCCC TGGGGGTAGC CCCGGAAAGC CTGGCCGGAA CCTTAAGGGA ACATCCCGGG GCCAGGCTCC TGCTCCTGGT CCATCCTACC TATGAAGGTG TAGTACCCCG GAGCGAAGAA CTAATTGCCC TGGCCCACGC CCACGGCGTG GCGGTCCTGG CCGATGCCGC CCATGGGGCC CACTTTGGCC TGGCTCCGGG CCTGCCACCG TCTCCCCTGG ACCTGGGGGC TGATTTTGTC GTCCAGAGTA GCCATAAAAC CCTGGCGGCC CTGACCCAGG CAGCCATGCT GCACCTGCGG GAGGAGGCAG CGGCAGCAAG GGTGGCGGCA GCCCTGAACC TGCTCCAGAC TACCAGTCCT TCCTACCTCC TGCTGGCTTC CCTGGATACG GCCCGTCTCC TGGCGGAAGA ACGGGGCCGG CAGGACTGGG GTCTGACGGT AGCCCGGGCT ACCGCGGCCC GGGCCAGGCT GGATCGGGCC GGACTACCGC CCCTGGCAAT GGCAGATGTT ACCGGGCCGG CGGCCAGCGG CCTGGATGTA ACCCGCCTGC TCCTGCCTAC GGCGCCCCTG GGCCGGAGGG GAACGGAGGT GGCTGCCACC CTGCGCCGCG CCGGCCAGGA GGTAGAACTG GCGGGAGAGG ATTATATCGT GGTAATCATC ACCCCCGGCG ACGGTGAGGA GAAAATCGAG GCCCTGGTGA CCGCCCTGCT GGCCCTGCCG CGATCCGACA GTAGGCAGCC GGCAGCTTTA GCCCCGGCAC CGCCCGTCAG GATACCCGAG ACGGCTTTGA CGCCCCGGGA GGCCTGGCTT GCCCCCCGGC GGGAGTTGCC CCTGGGGGAG GCTACGGGGA AGATTGCCGC CGAACTCATC TCCCCCTGCC CGCCGGGCCT GGCCCTGACG GTACCGGGGG AGGTTCTGAC GCCGGAGGTC CTCGAGCGGC TCAGGGATTT ACGGGGACCG TCTGGCCGGG TGCTGGTAGT GGATGGCTAA
|
Protein sequence | MKTQGQAPLW EALLNYRQQG LNSWHTPGHK DGAYTLPLWR DFLGAALALD LTEVPGLDNL ACPGGAIAAA QERAARFYGA ARTFFLVNGA SAGLMAVILA TCRPGDEVLL PRYAHRAVFN GLILSGARPV YLATEWLASP GLPLGVAPES LAGTLREHPG ARLLLLVHPT YEGVVPRSEE LIALAHAHGV AVLADAAHGA HFGLAPGLPP SPLDLGADFV VQSSHKTLAA LTQAAMLHLR EEAAAARVAA ALNLLQTTSP SYLLLASLDT ARLLAEERGR QDWGLTVARA TAARARLDRA GLPPLAMADV TGPAASGLDV TRLLLPTAPL GRRGTEVAAT LRRAGQEVEL AGEDYIVVII TPGDGEEKIE ALVTALLALP RSDSRQPAAL APAPPVRIPE TALTPREAWL APRRELPLGE ATGKIAAELI SPCPPGLALT VPGEVLTPEV LERLRDLRGP SGRVLVVDG
|
| |