Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0738 |
Symbol | |
ID | 3831130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 771443 |
End bp | 773122 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637828669 |
Product | peptidase M1, membrane alanine aminopeptidase |
Protein accession | YP_429599 |
Protein GI | 83589590 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00291351 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGTCT TAAGCCTCGC CATCCTGGCT TTCTTTTTTT ATCCCCGGCT CGTCCCGGAG CCGGCGGCGA TTGCGACGGA TGCTACTTTC AAACAGGTGG TCCTGGACCC ACCGCCCCTG GAAGCAGACG GGCAGGTACT CGTTTCGGCG GGGGATTTTC TACCCCTGGC CGGCGGCAGC GCCGACTGGT ATAATGGCCA GGGCGTCCTC AGGGGGGCGG GGAGCGCCAG AGTTGTTGCC GGCAGCCCGG TGGCTACCCT CAACGGCGAG CCCCGGCAGC TCAAAGTACC CCCCCGCCTG GTGGGCGATA AACTCTACAT CCCCCTCCAG CTGGCCCTGG CAGTGCTGGG GACTTCGAGT AGCCAGGGGG GATCTGACCT GCCCCTGCCG CCCCTCCGGG GCAGCACAGG GGTCCGTTTT CGTTACTACC CCGTCTACGA CCTCCAGGCC CGTTACGACC CGGCCAGCGG CGAGATTCAG GGAGAACTGC TCCTGGCTTA CCAGAATCCT TTCCTCACTC CTTTGCGGCA GCTCAATTTC AACCTCCCGG CCAATGCTCC CTTCGGTAAT GGCGCCAGCC TGGCGGTCAC CAGGGCGGTC GTAAACAACC GGCCGGTGGC GGTACACTTC AAGGGCAGCC GGTTGGAGGT GCCCCTACCA CGGGCCCTGG CCCCTAGGGA GACCCTTTCC GTGGTCCTTT CCTTTAAGAC TATAGTACCG CCGGGGGATA TGCGCCTGGG CCGGGATGGG AACCTGAGCA CGGTATCCGG CTGGTACCCT ATCCTGGCTC CCCCCACGGA GGATACCTGG GCCGGGGTGG CCGGTACGGC CTATGGCGAC CCTTACTTCG CCGGCGCAGC CTATTACCTG GTGCGGCTCA CCCTGCCATC CGGCTATCAG GTCCTGGCCA GTTCCCGGCT GACGGGACGC CAGGAAAGGG GAGAATGGAC AGACTGGTCT TTTAATAGCG ACCAGCCGGT ACGGGAGTTT GCCTTTACAG CGGCCCCGGA CTGGCAATTC ACCACCCGCC AGGCGGGCAG GGTGCAACTG GTAGTTGCGT CCCGGGGGGA GATGGCGCCG GCGGTCCTGG ATGTCGCCGC CCGCGCTCTG GAGTTTTTCC AGAAGCTCTA TGGACCTTAT CCTTACAGCT ACCTGCATAT AGCCTTTGTC CCCCTGGACA ACCTGGCCGG CATGGAATAC CCTGGCCTGC TGCTCCTGAG CAACCGCAAA CCATATAACC CTGCCGTAGT CGTTCACGAG GTCGCCCACC AGTGGTGGTA CAATCTGGTG GGTAACGACA CCCGGCAGGC GGCCTGGATT GACGAGGGCT TGGCCGAGTA CAGCACCCTC CTGTTTTACC GGCACTTCGA TCCCGGCCTT TATCAGGCCA AACTGGCCGA GATTACCCAA CTCGCTGCCC GCACCGGTGC GCCCATCAAC CTCCCCCTGG AGGAATACGG TAGCGAACAG GACTACCGCC AGGCTGTCTA CAACCGGGGG GCGATGTTCT GGCTGGAACT GGAAAGGATG GCCGGTGAAG AACAATTAAA GGAGGCCCTG GCCTATGTCC AGCGCTATTA CCGTTACGAG ATTATCCCGC CCAGGGCCCT GCTGACAATT ATTACGTATT ATGGCAAGCT AGATTCCAAC AATTTCAGTC CGTTTTTACG GAACAAATAA
|
Protein sequence | MAVLSLAILA FFFYPRLVPE PAAIATDATF KQVVLDPPPL EADGQVLVSA GDFLPLAGGS ADWYNGQGVL RGAGSARVVA GSPVATLNGE PRQLKVPPRL VGDKLYIPLQ LALAVLGTSS SQGGSDLPLP PLRGSTGVRF RYYPVYDLQA RYDPASGEIQ GELLLAYQNP FLTPLRQLNF NLPANAPFGN GASLAVTRAV VNNRPVAVHF KGSRLEVPLP RALAPRETLS VVLSFKTIVP PGDMRLGRDG NLSTVSGWYP ILAPPTEDTW AGVAGTAYGD PYFAGAAYYL VRLTLPSGYQ VLASSRLTGR QERGEWTDWS FNSDQPVREF AFTAAPDWQF TTRQAGRVQL VVASRGEMAP AVLDVAARAL EFFQKLYGPY PYSYLHIAFV PLDNLAGMEY PGLLLLSNRK PYNPAVVVHE VAHQWWYNLV GNDTRQAAWI DEGLAEYSTL LFYRHFDPGL YQAKLAEITQ LAARTGAPIN LPLEEYGSEQ DYRQAVYNRG AMFWLELERM AGEEQLKEAL AYVQRYYRYE IIPPRALLTI ITYYGKLDSN NFSPFLRNK
|
| |