Gene Moth_0500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0500 
Symbol 
ID3832823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp517103 
End bp518146 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content50% 
IMG OID637828434 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_429373 
Protein GI83589364 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000058985 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCAA TCAATGTGAA CACTGTGCTG GCCGTTATGC TGAGCGGTTT GTTAATCATG 
GCCGGGGGTT TTGGTCTGGC CAGGATTTAC GATAACATGC ATAATCAAGT GGTTGCAGAG
GCATCTAAAC TCGGTCCTAT CGAGGAGAAC CACTCTGAGT CGGCCAAGTC CCAGGATTTA
AAAAGCATTA TTGCCGAGAA TCAAAAGCTA GTAGTGTGTC TAGAGGTTGA GCTTGATTAC
GGTAAGGTAC AGGGTTCCGG ATTTCTTTAT AACCGGCAGG GGGATGTAAT AACCAATGCC
CACGTGGTGG GTGACGCCAG AACTTGCCGG GTCAAATTGT CCGATGGCAC AGTTTATCAA
GGCACAGTTA TCGGGCGTGG GGAACAAATT GATGTGGCTC TGGTCAGGGT GCCGGAACTG
GCTGGTAAGG AACCAATGAA AATTGCCCGG GATAGAATGG CAGAAGTGGG AGATGAAGTG
ATCGCCCTGG GCAGTCCCCT TGGATTACAA AATACCGCCA CCACAGGAAT CATCAGCGGT
GTCAACCGGG ATTTAGATAT CGGGGATTAT CATTATAAGG GGCTATACCA AATTTCTGCC
CCCATCTCGC ATGGCAGTAG CGGAGGACCG TTACTGGATC GACATACCGG CGAGGTGCTG
GGTGTTAATT CCGCCGGGGC CGAGGGAGAA AATATCGGCT TTAGCATACC CATCACCCAG
GTTTTACCCC TGGTGGAAAA CTGGTCAAAA AACCCCAGTA CAGCACCGGT GCAGCCAACC
ACCGGTACTG TCGGGAAAAT TTCCGAGAAA GAATTAGCCA CTGCTGCATC CTACCTGGTT
GAGGACTTTT ACACTTGTGT CAATAACGGG GATTATGTAG GGGCCTATGC TCTGCTGGGA
AGTGACTGGC AGGCCAAACA GCCCTATGAG AAGTTCCGGG CCGGCTATTT AAATACACTT
TCAGTCTCAG ACGCATTTTT AAAACCAGGC CGCCATTCGG CAACTCCTCA ATTACTTGAC
TGGAATGCCA GTGTCGCTCC TTGA
 
Protein sequence
MKSINVNTVL AVMLSGLLIM AGGFGLARIY DNMHNQVVAE ASKLGPIEEN HSESAKSQDL 
KSIIAENQKL VVCLEVELDY GKVQGSGFLY NRQGDVITNA HVVGDARTCR VKLSDGTVYQ
GTVIGRGEQI DVALVRVPEL AGKEPMKIAR DRMAEVGDEV IALGSPLGLQ NTATTGIISG
VNRDLDIGDY HYKGLYQISA PISHGSSGGP LLDRHTGEVL GVNSAGAEGE NIGFSIPITQ
VLPLVENWSK NPSTAPVQPT TGTVGKISEK ELATAASYLV EDFYTCVNNG DYVGAYALLG
SDWQAKQPYE KFRAGYLNTL SVSDAFLKPG RHSATPQLLD WNASVAP