Gene Moth_1813 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1813 
Symbol 
ID3830731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1872523 
End bp1873680 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content64% 
IMG OID637829740 
Productpeptidase M24 
Protein accessionYP_430656 
Protein GI83590647 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000026363 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAGC TGGAATACAT AGAATATAAG AATCGCCTGA GGCGTTTCCA GGAGTCCCTG 
CAAGCTCTGG ACCTGGACGG GGCCCTGGTC TACCAGGCCG CCGACCTGTA CTACCTGACA
GGAACGGCCC AGAGCTGCCA CCTTTTCGTT CCGGCTGCCG GGGAGCCCCT CCTCCTGGCC
TACCGGGATT TTGAGCGGGC ACGGGAGGAA TCCGCCTGGC AGGTCAGGCC CCTGGGTAGT
TTTAAGGATA TCCCAGGTCT CCTGGCGGAG GCCGGGTTGA CCGGGCTGCG GCGCCTGGGA
CTGGAGCTGG ACGTCATTCC CTTAAGCCTT TTCCGGCGCT ACGAGGCCCT CTTGCCCGGC
GTTCAGTGGG CCGACATCGG CCAGGTCCTG CGACGGCAAC GAATGGTCAA ATCGCCGGCC
GAACTGGAGG CCCTGCGGTG GTCCGCCGCC AAACACGCAG AGGTCTTCCG TTACATAACT
GCCAGGATCC GGCCTGGTAT GACAGAGCTG GAGATTGCCG CCGAGTTTGA AAGCTATGCC
CGCCGCCTGG GCCACCAGGG CGCCAAGCGC TTCCGGGGTC AGGAGCAGGG CATGATTCCG
GGCCTGGTTG CCGCCGGGGC CAACTCCGCC CGGACCTCCT GCTTCAACCT GCCTCTGGCC
GGCCTGGGCC TCTCACCCCT TTACCCTATG GGAGCCAGCC AGCACGTCTG GGAGGAAGGT
GAACCCCTCC TTATCGACTA CGCCGGGGTT TACGGCGACT ACACCGTCGA CCAGACCAGG
ATCTACCTTG GTAAGGGGGT ACCGGAAGAC TTACGGCAGG CCCAGGAAGT GGCTATGGAG
ATTGCCAGCC GGGTGGCGGA AGAGGCCCGG CCCGGAGTAA CGGCCGGCGC CCTCTACGAC
CTGGCCGTGG CCATGGCGGC CCGCGCTGGT TTGCAGGAGC ACTTTATGGG CTACGGCCGG
CAGGTGACTT ACATCGGTCA CGGCGTCGGC CTGGACCTGA ACGAGTGGCC GGTGATAGCC
AGGGGGGACA AGACTGTCCT GGCCGCGGGC ATGGTCTTCG CCCTGGAGCC CAAGTTTGTC
TTCCCGGGGA TGGGCAGCGC CGGGGTGGAG GATACGTATG TGGTTACTGA TAGGGGAGCG
GAAAAGTTGA CATATTAG
 
Protein sequence
MAKLEYIEYK NRLRRFQESL QALDLDGALV YQAADLYYLT GTAQSCHLFV PAAGEPLLLA 
YRDFERAREE SAWQVRPLGS FKDIPGLLAE AGLTGLRRLG LELDVIPLSL FRRYEALLPG
VQWADIGQVL RRQRMVKSPA ELEALRWSAA KHAEVFRYIT ARIRPGMTEL EIAAEFESYA
RRLGHQGAKR FRGQEQGMIP GLVAAGANSA RTSCFNLPLA GLGLSPLYPM GASQHVWEEG
EPLLIDYAGV YGDYTVDQTR IYLGKGVPED LRQAQEVAME IASRVAEEAR PGVTAGALYD
LAVAMAARAG LQEHFMGYGR QVTYIGHGVG LDLNEWPVIA RGDKTVLAAG MVFALEPKFV
FPGMGSAGVE DTYVVTDRGA EKLTY