Gene Moth_1357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1357 
Symbol 
ID3832279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1401602 
End bp1402759 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content57% 
IMG OID637829293 
ProductSerine-type D-Ala-D-Ala carboxypeptidase 
Protein accessionYP_430213 
Protein GI83590204 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1686] D-alanyl-D-alanine carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTCAAGA AGTATGGCCC CCGGTTCCTT GCTGCCGGGG TCTTTTTATT TATTTTCCTT 
AGCTTTATTA AACCAGGCCG GGCTGCTGCC CCGGACATTC AGGCCGAGAG TTATGTATTA
ATGGATTTTC GGACGGGCCA GGTCCTTATG GCCAAGAATC CCCATGAGCG GAGGCCCCAG
GCCATTACCA CCAAGATTAC CACGGCCATT TTAGCCCTGG AGCGCGGCAA CTTGAACGAC
CAGGTTATTG CCAGTAAAAA TGCCGCCGAG ACGCCGGAGA GTTCTATTTA TCTCCAGGAA
GGGGAGACCC TGACCCTGGA GGAGCTTCTC TACGCCCTGC TACTGCGCTC GGCCAATGAT
GCCGCCGTGG CCATTGCTGA GCATATCGGT GGCAGTGTTG AAAACTTTGC CCGCATGATG
AATGCGAAGG TCCAGGAGAT TGGGGCCCGG GATACCCACT ATGTTAACCC CCACGGTCTG
ACAGCTCCGG ACCACTACTC CTCGGCTTAC GATCTGGCCC TTATTGGCCG TTACGCCATG
ATGAATCCTA AATTTCGGGA GATTGTCGCT ACCCGCCAGC GGATTATCCC CTGGGCCGGC
AAACCCTGGC CGCGATTGCT GATCAATGAA AATCGCCTGC TATGGGGCTA TTATGCTTAC
CCCGGTGCCG ATGGGGTAAA GAACGGCTAT ACTACCCCGG CCGGGCAGGT CCTGGTGGCC
TCGGCCACCA GGGATAACTG GCGGCTCATC GCCGTGGTCA TGAAGTCCCC CAATATGTAC
CGGGAAACCA GTGCCATCCT CGATTACGGG TTTAATAACT TTCACCAGGT GAAGTTAATG
CCTGCCGGCC AGCAGGTAGC CCTGGCCGGT GTCCGCGGCG GCATAGCAGC AAATATACCT
GCCGTCACGG CCGATGACGT CCTGGTGGTC GAGCCAAAGA ATGAGACCTG GACCTGGCAG
CAGCGGGTGG AACTTAATCC CGACCTCAAT GCGCCGGTCA AGAAGGGGGA TAGGATCGGG
CGGATAATCT TCACTTCCCA CGACCAGGAA GTAAGTGTGG ACCTGATAGC TGCCGGTGAC
GTAGCCCCCC GGCCCTGGTG GATGGGCTTC CTGGAAGCCT TCCTAACAGT CTTCAACCTG
CCCTGGTTGC GGTTCTAG
 
Protein sequence
MFKKYGPRFL AAGVFLFIFL SFIKPGRAAA PDIQAESYVL MDFRTGQVLM AKNPHERRPQ 
AITTKITTAI LALERGNLND QVIASKNAAE TPESSIYLQE GETLTLEELL YALLLRSAND
AAVAIAEHIG GSVENFARMM NAKVQEIGAR DTHYVNPHGL TAPDHYSSAY DLALIGRYAM
MNPKFREIVA TRQRIIPWAG KPWPRLLINE NRLLWGYYAY PGADGVKNGY TTPAGQVLVA
SATRDNWRLI AVVMKSPNMY RETSAILDYG FNNFHQVKLM PAGQQVALAG VRGGIAANIP
AVTADDVLVV EPKNETWTWQ QRVELNPDLN APVKKGDRIG RIIFTSHDQE VSVDLIAAGD
VAPRPWWMGF LEAFLTVFNL PWLRF