Gene Moth_0201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0201 
Symbol 
ID3832274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp197705 
End bp199084 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content60% 
IMG OID637828137 
Productpeptidase 
Protein accessionYP_429079 
Protein GI83589070 
COG category 
COG ID 
TIGRFAM ID[TIGR02889] germination protein YpeB 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACACGCA AGCTTTCAAC CATTCTCCTT TCCCTGGCCC TCCTGCTGGC TATCGGCTGG 
GGACTGTGGG AGAGGGCCAA CCGGCTAACC CTGGCCAACG CCGTCGAGGC CGGCGGCCAA
CGTGATTTTT ATAATCTCCT GAACTATGTC GAGCAGGCCC AGGTAAGTAT GGGCAAAACC
CTGGCCAGCA GTTCCCCCCG TCAACAGGCC GTCCACCTGA CGGAAGCCTG GAACCAGGCG
GCTGCGGCGC AACACTCCCT GACCCAGCTC CCCACCCCCG GTTTTAAACC GGTGAATACC
AGCAAATTTC TGTCCCAGAC CAGCGATTAC AGTAACTACC TGGCGCAAAA ACTGGCCCGG
GGCGAAGAAA TGACCCCCCA GGAAAGGCAG CAACTGGCCA GCCTGCGGGA AGAAATGGGG
CGTCTGGCCG CCGATTTGCA CCAGACAGAA GGCCAGGTGG CCGGCAAGAC TTTGCGCTGG
AGCAGCTTTT ACGGCTTCAA GATGCCGTCC CTGCCGCGGA CCATAGCCGG CCGGGCCCTT
CCGGTCGAGG CGCGACCCGG CCCCCTGGAT GGTTTCGTCA ATACCGACCG GCGCCTGCAG
ACCCTCCCCA GCCTGAACTA TGACGGCCCC TTTTCCGATC ACCTGGAGAA ACAGCGACCC
CTGGGGTTGG GTGGTGGCGA GGTAACCCAG GCCGAGGCTG AACGACGGGC GATGAACTTC
AGCAACGCCG CCAGTAACAG CAATTACCGG GTCCAGGCCA CTCGCACCAG CAACGGCCGC
ATCCCCACCT TTTCCCTCCG GCTGCTGGAT AGCAGGCGGC CCAATGTGAC GACCCATATA
GACGTCAGCA AGCAGGGCGG CCAGATCGTC TCCCTGTTAA ACACCCGTCC TGTAGGAGCT
CCTACCCTGG ACGCCGCCGC GGCCCTGGAA AAGGCCAGGG CTTTTCTCCA GGCCCAGGGC
TTCACCGGGA TGCAGCCGAC CTACACGGTA CGCACCGATA ACAACCAGGT CATCACCTTT
GCCGCCAAGG AAGGGGATGT CATCCTCTAC CCGGATCAGG TGAAGGTGAA GGTCGCCCTG
GACAACGGCG AAATAACCGG CTGGGACGCC ACACCCTATT ATATGTCCCA CCACAAACGG
GATCTGCCCC GGCCGAAGCT GACACCAGAG CAGGCCCGGG CCAAAATAAA CCCCGGGATC
AAGGTAGAAG GCGTCAGGCT GGCCCTCATA CCCTTGCCCG GGGGGCAGGA GAAGTTGACC
TACGAGGTTA AAACCAAAAT GGACAACACT TATTACCTCA ACTATATTAA TGCCTTGACC
GGTGAGGAAG AAAAGGTCTT GCAGATAATC GACGTACCCG GCGGCCAGCT CACCATGTAG
 
Protein sequence
MTRKLSTILL SLALLLAIGW GLWERANRLT LANAVEAGGQ RDFYNLLNYV EQAQVSMGKT 
LASSSPRQQA VHLTEAWNQA AAAQHSLTQL PTPGFKPVNT SKFLSQTSDY SNYLAQKLAR
GEEMTPQERQ QLASLREEMG RLAADLHQTE GQVAGKTLRW SSFYGFKMPS LPRTIAGRAL
PVEARPGPLD GFVNTDRRLQ TLPSLNYDGP FSDHLEKQRP LGLGGGEVTQ AEAERRAMNF
SNAASNSNYR VQATRTSNGR IPTFSLRLLD SRRPNVTTHI DVSKQGGQIV SLLNTRPVGA
PTLDAAAALE KARAFLQAQG FTGMQPTYTV RTDNNQVITF AAKEGDVILY PDQVKVKVAL
DNGEITGWDA TPYYMSHHKR DLPRPKLTPE QARAKINPGI KVEGVRLALI PLPGGQEKLT
YEVKTKMDNT YYLNYINALT GEEEKVLQII DVPGGQLTM