Gene Moth_1539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1539 
Symbol 
ID3831925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1582320 
End bp1583399 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content62% 
IMG OID637829471 
Productpeptidase M24 
Protein accessionYP_430391 
Protein GI83590382 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0730424 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCAGCA GATTAACCCG GCTGCGGGAG CTCATGGAGC GGGAAGGGAT TACCGCCCTC 
TGGGTGCACC AGGACGAAAA CCGCCGCTAC TTAAGCGGTT TTACCGGCGA CAGCGGCACC
CTACTCATTA CCCCCACAGC CCAGTACCTG TTAACCGACG GCCGCTTTAC AGAGCAGGCC
CGGGAAGAAG CGCCGGACTT TCAAATCATC GACCTGGGCC CGCATCCATG GGAGCAACTG
GGCCAGACCC TGGCCGCCGC CGGCATAGAG AAACTCTTTT TTGAAGCCGA GCACTTGACC
TATGCCACCT ACGAAGAATT CCTAGAGAAA GCCAGGGACT GGCCGCGGCC CGTCAGCCTG
GCGCCGGTGA AGGGCCTGGT AGCCAGGCTG CGCCAGGTGA AAGACGCGGA GGAAATCGCC
GTCCTGGAAA AGGCCATTGC CATAGCTGAC GCCGGCTACA ACCACCTATT AAGTATCCTG
CGTCCCGGCC TTACCGAGCG GGATATAGCC CTGGAACTGG AGTATTTTAT GGGTAAGCAG
GGTTCCAGGG GGCCGTCCTT TACCACCATT ATCGCCAGTG GGCCCCGGTC GGCCCTGCCC
CACGGGGTGG CCTCGGACCG GGTCCTGCAA CCGGGAGACA TGATAGTCAT GGATTTTGGC
GCCGTTTATG GCGGCTACCA TTCCGACCTG ACGCGGACGG TGGCCCTGGC CCCGGTGACA
GCCGAATGGC GGCGCCTCTA TGATATTGTC CTGGAGGCCC AGCAACAGGC CATAGCCGCC
CTTCGCCCCG GGATTCAAGG CAGAGAAGCT GATGCCGTGG CGCGGGAGGC TATTGCCGCT
GCCGGATATG GCGATTATTT CAGCCACGGC CTGGGACATG GAGTCGGCCT GGCCATCCAC
GAAGACCCCA CCCTCTCAAG CCGGAGCGAG GTCAAACTGG CTCCGGGGAT GGTAGTCACG
GTGGAACCGG GTGTTTACCT CCCGGGACGG GGGGGCATCC GCATCGAGGA TGTTGTTCTC
ATCCAGGAGG GAGGCGCTCG GGTCCTCTCC CGCGCCCCCA AAGAGTTTAT TGAGCTGTGA
 
Protein sequence
MSSRLTRLRE LMEREGITAL WVHQDENRRY LSGFTGDSGT LLITPTAQYL LTDGRFTEQA 
REEAPDFQII DLGPHPWEQL GQTLAAAGIE KLFFEAEHLT YATYEEFLEK ARDWPRPVSL
APVKGLVARL RQVKDAEEIA VLEKAIAIAD AGYNHLLSIL RPGLTERDIA LELEYFMGKQ
GSRGPSFTTI IASGPRSALP HGVASDRVLQ PGDMIVMDFG AVYGGYHSDL TRTVALAPVT
AEWRRLYDIV LEAQQQAIAA LRPGIQGREA DAVAREAIAA AGYGDYFSHG LGHGVGLAIH
EDPTLSSRSE VKLAPGMVVT VEPGVYLPGR GGIRIEDVVL IQEGGARVLS RAPKEFIEL