Gene Moth_0123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0123 
Symbol 
ID3830780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp120040 
End bp121470 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content59% 
IMG OID637828057 
Productputative aminopeptidase 1 
Protein accessionYP_429005 
Protein GI83588996 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1362] Aspartyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.870895 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAAA GAGAAGAAAG CAGGGGTAAG AAACTACAAA AAGAACTGGG TCTGCCCCGT 
AAGAGCGCCT GGGAGCGCCT GGAGCCAGGT ACCAGGGAAA AGGTCTTTGC CTTTGCCGAG
GGGTATAAGG GTTTCTTAAG CCGGGCCAAG ACCGAGCGAG AGGCCATTCG GGCCGCTCTG
GAGGCTGCCC GGGGTCGCGG CTTTATCTCC CTGAACGATA TTCCCTCCGG CCGGGGGTTG
GATCCCGGCA GCCGCTTCTT TTTAACCTGG CGAGATAAGG TCATGCTCCT GGGGATAGCC
GGTACTGCTC CCCTGGAAGA AGGGATCAGG CTGGTCGGGG CCCATGTCGA CGCCCCGCGC
CTGGACCTGA AACCCATGCC CCTGTATGAA AAGGACGGTC TGGCCATGTT TAAAACCCAC
TACTACGGGG GGATCAAGAA GTACCAGTGG ACCGCCCTGC CCCTGGCCCT CCATGGCGTG
ATCATGCTCC GCGACGGTCG CCGGGTGGAA GTGGTAATCG GCGAAGACGC CGGCGATCCC
ATCTTTAGCG TCAGCGACCT CCTGCCCCAC CTGGCCAAGG AGCAGATGAA AAAGAATATG
GAAGAAGCCT TAAGCGGCGA CGACCTGAAC CTGGTGGTTG GCAGCTTGCC CTTTGAGGGT
GAAGAAGAGC TCAAGGATAA GATCCGCCTG GCTATCCTCC AGCTCCTGAA CCAGCGTTAC
GGCCTGGTGG AAGAAGACTT CATCACCGCC GAACTGGAAC TGGTACCGGC CGGTCCGGCG
CGGGACCTGG GTCTGGATCG TTCCCTGGTA GGGGCCTACG GCCAGGACGA CCGGGTGTGC
GCTTATACCG CCCTGCAGGC CGTCCTGGAA CTGGAGAACC CCGGCCATAC AGCTCTGGTA
CTCCTGGTCG ATAAAGAAGA GATCGGCAGT ACCGGCAACA CCGGCGCCCA CTCCCGCTTC
CTGGAGTATG CCCTGACGGA GCTGGCGGCC CGTATGGGTT CGGCCAGTAT CGTCAATGTC
GGCCGGATAA TGGCCAATTC CCAGGCTATA TCCGCCGACG TAACCGCCGG GGTGGATCCT
ACCTACGAAA ATTACTTTGA TATGTATAAC GCCTCTTTCC TGGGATACGG CGTAGTCCTC
AATAAATACT CAGGCTCCCG GGGTAAATAC GATGCCAATG ATGCTAGCGC CGAGTTCATG
GGCCGGTTAA GGGATATCTT TAACCACAAC CGCGTCATCT GGCAGAGTGG CGAACTGGGT
AAAGTAGACG CTGGCGGCGG GGGCACCATT GCCAAGTTCC TGGCCTATTT CGGCCTGGAT
GTCGCCGATT GCGGTCCGGC CCTCCTCTCC ATGCACGCCC CCCTGGAGAT CGCCAGCAAA
GTAGATATCT ATATGGCTTA CCGGGCCTAT GGCGCCTTCC TTGCCAGTTA A
 
Protein sequence
MAEREESRGK KLQKELGLPR KSAWERLEPG TREKVFAFAE GYKGFLSRAK TEREAIRAAL 
EAARGRGFIS LNDIPSGRGL DPGSRFFLTW RDKVMLLGIA GTAPLEEGIR LVGAHVDAPR
LDLKPMPLYE KDGLAMFKTH YYGGIKKYQW TALPLALHGV IMLRDGRRVE VVIGEDAGDP
IFSVSDLLPH LAKEQMKKNM EEALSGDDLN LVVGSLPFEG EEELKDKIRL AILQLLNQRY
GLVEEDFITA ELELVPAGPA RDLGLDRSLV GAYGQDDRVC AYTALQAVLE LENPGHTALV
LLVDKEEIGS TGNTGAHSRF LEYALTELAA RMGSASIVNV GRIMANSQAI SADVTAGVDP
TYENYFDMYN ASFLGYGVVL NKYSGSRGKY DANDASAEFM GRLRDIFNHN RVIWQSGELG
KVDAGGGGTI AKFLAYFGLD VADCGPALLS MHAPLEIASK VDIYMAYRAY GAFLAS