Gene Moth_2417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2417 
Symbol 
ID3832168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2540004 
End bp2541182 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content58% 
IMG OID637830336 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_431242 
Protein GI83591233 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0358901 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCTG TCGCTCCCCG AGGAAACAAG GGGGTTGCCT TCTTGAAGCG CTTTTTTCCC 
GGCGGTTTGA GGCGCTACCG GATACTCCTT CTGGTCTCAT TAGTTTTTTT GAGTTCTATC
CTGGGGGCGG GAGCCGCCCT CTATTTCGCA CCGCGATTGA TGTTCCGCCC CCTGCCACCA
GCGTCCCCAC CCCAAGGCTT GATTAACCAA CCCGTAGGCT TGCCGGCCCA GTATACCGCC
GATTCACCGG TAGTCACTAT TGCCCGGCAG GTGGGCCCTT CCGTCGTGGG GGTTCAGGCC
ATGACCGGTA CCAATTATAC TGGCGACGGC GTGGTAAAGC AGGGTTCGGG AGTAATCTTT
GATACCACTA ACGGCTATAT TGTTACCAAT AACCACGTTA TTGCTGGCGC CGGTCGGATA
ACAGTCAGCC TGGACCGGGA GCAAACCTAT CCGGCGACCC TGGTAGGTGC CGATGAGCGT
AGCGATCTGG CGGTATTAAA GGTCCAGGGG CCCAATCTCC CCCAGGCGCG CCTGGGGGAT
TCCAGCACCC TGCAGGTAGG GGAAACTGTA GTAGCCATTG GTAACCCCCT GGGACGGGAG
TTTGCCCGGT CGGTAACCGT AGGGGTAATC AGCGCTTTGA ACCGGGAGGT AACAGTTCCC
GGGTCCCGGG GTGTGGAGAT AACCCTCCGT GTCCTCCAGA CCGATGCCCC AATTAACCCT
GGCAACAGCG GCGGCGCCCT GGTTAACCTG CGAGGTGAGA TTATCGGCAT CAACAGCGTC
AAGATTGCCG CCAGTGGTGT CGAGGGAATG GGTTTCGCCA TTCCCATAAA CGACGTCCGG
CCCATTATTG ACCAGATAAT CACCCGCGGC TATGTCACCC ACCCCTTCCT GGGAGTTTAT
AACCTCCAGG AGATTACCCC GGAGATGGCC CAGTGGTATA ATATACCTGT AGGCGTCTAT
GTCGGGGGTG TCTTCAAGGA TGGCCCGGCA GCCAAGGCCG GCCTGCAGGT AGGAGATGTC
ATCACCGCCG TAGAGAACCA GAAAGTCGCC ACCTATGATG ACATCCAGCG CCTGATCAAT
AAGAAATCCC CGGGGGATCA GGTGACGGTG ACTATCCGGC GCCTCAAGTC GCCAAACCCG
GTCAACTACA CTATAACCCT GGGGGAATTA CCTAAATAG
 
Protein sequence
MAAVAPRGNK GVAFLKRFFP GGLRRYRILL LVSLVFLSSI LGAGAALYFA PRLMFRPLPP 
ASPPQGLINQ PVGLPAQYTA DSPVVTIARQ VGPSVVGVQA MTGTNYTGDG VVKQGSGVIF
DTTNGYIVTN NHVIAGAGRI TVSLDREQTY PATLVGADER SDLAVLKVQG PNLPQARLGD
SSTLQVGETV VAIGNPLGRE FARSVTVGVI SALNREVTVP GSRGVEITLR VLQTDAPINP
GNSGGALVNL RGEIIGINSV KIAASGVEGM GFAIPINDVR PIIDQIITRG YVTHPFLGVY
NLQEITPEMA QWYNIPVGVY VGGVFKDGPA AKAGLQVGDV ITAVENQKVA TYDDIQRLIN
KKSPGDQVTV TIRRLKSPNP VNYTITLGEL PK