Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1058 |
Symbol | |
ID | 3833322 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1089626 |
End bp | 1090783 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637828986 |
Product | Serine-type D-Ala-D-Ala carboxypeptidase |
Protein accession | YP_429915 |
Protein GI | 83589906 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1686] D-alanyl-D-alanine carboxypeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0287613 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00070784 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAAGCATG TGAAACATGT AGTCCTGATT TTAGCGCTGG TTGTTTTTTG TTTCCTGGGT TCCAGCCCGG TAAAGGCGGC TGATGAACTT CAAATTACGG CGCCGGCGGC CGTTTTGATG GAAGCAAGCA CAGGGCAGGT CCTTTATGAG CGAGGTGCCA GGGAAAAGCG ACCCCCGGCA AGCACGACGA AGATCATGAC GGCCATCCTG GCCCTGGAGT TGGGCCGGCT CGATACCCCT ATTAAGGTAA GCGAAAATGC CGCTACCACC CCGGGGGCCA GTATCTACCT GCAAAAGGGG GAAGTTTTGA CCCTGGGGGA TTTAGTAAAA GGGGCTCTCC TTGAATCCGG TAACGATGCC ACCGTAGCCA TTGCCGAGGG ACTGGCCGGT TCCGAGGCCG GCTTTGCCTT CTTGATGAAC CGCAAGGCCT GGCTCCTGGG AGCCCGTACG ACCCACTTCA ATAACCCCAA CGGTCTCCCT GATCCGGGCC ATTATACCTC AGCCTACGAC CTGGCCTTAA TAGCCCGCTA TGCCCTGGGT AACCCTGTCT TTCGCCGCCT GGTGGCCACA GTTGAAGACC AGATCCCGGG ACCGGATGGG GTACGCCATC TCTATAATAC CAACCGCCTC CTGGAAAGCT ACCCCGGGGC CGACGGGGTC AAGACCGGCA CTACCGCAGC CGCCGGCCAG TGCCTGGTGG CCTCGGCGAC AAGGGGGGGC CGCCAGTTGA TTGCCGTGGT CCTGGGCAGC GCTGATCGTT ACGGCGACAC CAGGACCCTC CTGGATTATG GCTTTAAAAA TTTTTATTGC GAAATGGTGA AAGCCGGGGA ACCCCTGGGC CAGGTATATA TTCCCAACGG AGAAATGACC AGCATCGGCG TGGCCCCGGC TATGGACGCT GGCTTCACGG CACCCCTGGC CCGGAGCGGC CAGCTGGAAA AAAAGGTGCT TTTACCGCGT GCCGCCAGGG CGCCGGTAAG AAAGGGTCAG GAATTGGGGC GGGTGCAGAT ACTTTTCGCA GGCCGTGAGG TCGCGGCCGC TCCCCTGGTG GCCACCGGGG ATGTCAGGGC AATTCCCTGG TGGTCGCGGT TAATCTCCCT GGGTAGTAAT ATTTTTCATG CCGGCAGGTT TTCGGCCCTT CTGGAACGAA ATAGATGA
|
Protein sequence | MKHVKHVVLI LALVVFCFLG SSPVKAADEL QITAPAAVLM EASTGQVLYE RGAREKRPPA STTKIMTAIL ALELGRLDTP IKVSENAATT PGASIYLQKG EVLTLGDLVK GALLESGNDA TVAIAEGLAG SEAGFAFLMN RKAWLLGART THFNNPNGLP DPGHYTSAYD LALIARYALG NPVFRRLVAT VEDQIPGPDG VRHLYNTNRL LESYPGADGV KTGTTAAAGQ CLVASATRGG RQLIAVVLGS ADRYGDTRTL LDYGFKNFYC EMVKAGEPLG QVYIPNGEMT SIGVAPAMDA GFTAPLARSG QLEKKVLLPR AARAPVRKGQ ELGRVQILFA GREVAAAPLV ATGDVRAIPW WSRLISLGSN IFHAGRFSAL LERNR
|
| |