Gene Cthe_2387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2387 
Symbol 
ID4811039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2853597 
End bp2854868 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content45% 
IMG OID640107800 
Productgamma-D-glutamyl-{L}-meso-diaminopimelate peptidase I. metallo peptidase. MEROPS family M14C 
Protein accessionYP_001038782 
Protein GI125974872 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2866] Predicted carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAAGTAC TAAGACTTGG CTCCGTCGGA CCTGATGTAA AACTGGCTCA AAGCCTGCTT 
AATAAAATTG GCTATCCGGT CGGGGCTGTG GACGGAATAT ACGGAACACG AACCCGGCAG
GCGGTAATTG CTTTTCAGAG AAACAACGGA CTTGTGGCCG ACGGAATTGT GGGACCGGCA
ACGTGGAGCG TCTTTGAACA ATTTCTGAGA GGATACGCAA TTTACTATGT CAGACCCGGA
GATACTTTAT ATAATATTGC CGGAAGGTTT TATACATCGG TCAACTCCAT AGTAACAGCC
AATCCGGGAA TAAACCCCAA TGTCATAAAT ATAGGGCAAA GGTTGGTGGT TCCCTACGGA
ATAGATGTGG TATTTACCGA CATTGACTAT ACTTATGAAA TAATGGAAAG GGATATCCAG
GGTCTTAAAG CAAGATATCC GTTCCTGGAG ACAGGAGTCG CGGGCACCAG TGTCCTTGGA
CGAAATCTTT ATTATCTGAG ACTCGGAACC GGTCCAAGAC AGGTATTTTA CAATGCCGCC
CATCATGCCA TTGAATGGAT AACCACCGTG CTTCTTATGA AATTTGCGGA AAACTTCCTG
AAAGCATATT CCACCGGCAG CAGGATTCGT GGTTATAATG TAAGGGAAAT ATGGAATCAA
AGCAGTATAT ACATTGTACC CATGGTAAAT CCGGACGGAG TGGATCTTGT CCTTAACGGA
CTTAGCCCGA CAAACCCGTA TTATGCGGAC CTGCTGCGCT GGAATACCAC AGGCAGACCG
TTTTCCCAGG TGTGGAGTGC CAATATCAGG GGAGTTGATT TAAACAGAAA TTATCCGGCA
AGTTGGGAAG AAGCAAAAGC GCAGGAAGAA GCATTGGGTA TATTCGGTCC TGGCCCCACA
AGATACGGAG GACCGTATCC TCTTTCAGAG CCCGAGTCAT CCGCCATGGT GAGCTTTACA
AGAACTCATG ATTTCAGGCT TGCCCTGGCA TACCATTCGC AGGGAAGAGT AATATACTGG
AACTATTTAA ATCTTGCTCC ACCTGAGTCC CTGACAATTG CAAATGCTTT TGCAAGGGTA
AGCGGATACA TTGTTTTGGA TGTCCCTTAC GAGGCTGCCT ATGCCGGATA CAAGGATTGG
TTTATACAGG AGTACAGAAG ACCCGGATTC ACTATTGAAG TGGGATTGGG GCAAAATCCT
CTTCCCATAT CCCAATTTAA TACTATTTAT AATGATAATG AAGAAATTCT GCTTCTTGCA
TCTTTAATTT AA
 
Protein sequence
MQVLRLGSVG PDVKLAQSLL NKIGYPVGAV DGIYGTRTRQ AVIAFQRNNG LVADGIVGPA 
TWSVFEQFLR GYAIYYVRPG DTLYNIAGRF YTSVNSIVTA NPGINPNVIN IGQRLVVPYG
IDVVFTDIDY TYEIMERDIQ GLKARYPFLE TGVAGTSVLG RNLYYLRLGT GPRQVFYNAA
HHAIEWITTV LLMKFAENFL KAYSTGSRIR GYNVREIWNQ SSIYIVPMVN PDGVDLVLNG
LSPTNPYYAD LLRWNTTGRP FSQVWSANIR GVDLNRNYPA SWEEAKAQEE ALGIFGPGPT
RYGGPYPLSE PESSAMVSFT RTHDFRLALA YHSQGRVIYW NYLNLAPPES LTIANAFARV
SGYIVLDVPY EAAYAGYKDW FIQEYRRPGF TIEVGLGQNP LPISQFNTIY NDNEEILLLA
SLI