Gene Cthe_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1038 
Symbol 
ID4811332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1241562 
End bp1242587 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content44% 
IMG OID640106456 
Productgermination protease 
Protein accessionYP_001037463 
Protein GI125973553 
COG category 
COG ID 
TIGRFAM ID[TIGR01441] GPR endopeptidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000076763 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCATTTGG CAAAAAGGAT AAGGGAGGAT TACAGGGTGG AAAGGAATAT AAGAACGGAT 
CTTGTGCTGG AAGCTCATGA GCTTTTGAAG GAAAACGAGT TTAAAAACGA AAGAAGGGAA
CCGCCGGGAG TCGATGTTGA AAACGACGGT ACGGAAGATA TAAGAATAAC CAGGGTGAGG
GTAACGTCAC CCACCGGTGA GGCGGCTATC GGAAAGCCGA TGGGTAATTA TATCACCCTT
GAGGTGCCCA GGCTCAAGGA AAACGACCAG GAATTGTACG AAGAGACTTG CAAAGCTCTT
GCCAAAGAAC TGACACGTGT ATTGAATCTT AAAGACGACT CAACAATTCT GGTTATAGGA
TTGGGTAACT GGAATGTCAC ACCGGACGCA TTGGGGCCGA AAGTCGTTTC AAGGCTTATG
GTCACAAGGC ATTTGCTTGA GTATGTTCCT GATCAGGTTG ATGAAGGGGT AAGACCGGTG
TGTGCGGTAT CTCCCGGCGT GTTGGGTATT ACCGGTATAG AGACGGGTGA GATTGTAAGA
GGAATTGTTG ACAGGGTAAA ACCCGATGTT GTAATTGCGA TAGATGCTTT AGCTTCCAGA
AAAATGGAAA GAGTGAATAC CACTATTCAG ATTGCGGATA CCGGAATTTC CCCGGGTTCG
GGAGTCGGCA ACAAAAGAAT GGAGCTTTCC AGAGAAACTT TGGGAGTTCC GGTTATTGCA
ATCGGAGTCC CGACCGTGGT GGATGCGGCA ACCATGGCAA ATGACACAAT TGATCTCGTT
ATAGACAACC TTATTAGAGA AGCAAAAGAA GATTCGCATT TTTACAATAT GCTTAAAAAT
ATTGACAGAA ATGAAAAATA TCAATTGATA CAAGAGGTGT TGCAGCCCTA TGTGGGCAAC
CTTGTGGTAA CTCCGAAAGA AATTGACGAT GTTGTTGACA GAATTGCAAA AGTAATTGCT
AACGGTCTTA ATATTGCGCT TCACCAAGGT ATTACATTAA ACGATGTCAA CCGGTATGTC
CAGTAG
 
Protein sequence
MHLAKRIRED YRVERNIRTD LVLEAHELLK ENEFKNERRE PPGVDVENDG TEDIRITRVR 
VTSPTGEAAI GKPMGNYITL EVPRLKENDQ ELYEETCKAL AKELTRVLNL KDDSTILVIG
LGNWNVTPDA LGPKVVSRLM VTRHLLEYVP DQVDEGVRPV CAVSPGVLGI TGIETGEIVR
GIVDRVKPDV VIAIDALASR KMERVNTTIQ IADTGISPGS GVGNKRMELS RETLGVPVIA
IGVPTVVDAA TMANDTIDLV IDNLIREAKE DSHFYNMLKN IDRNEKYQLI QEVLQPYVGN
LVVTPKEIDD VVDRIAKVIA NGLNIALHQG ITLNDVNRYV Q