Gene Cthe_2452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2452 
Symbol 
ID4809831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2925365 
End bp2927173 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content41% 
IMG OID640107866 
Productoligopeptidase F 
Protein accessionYP_001038847 
Protein GI125974937 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR00181] oligoendopeptidase F 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGAAT CAAAAACAAA TGCACTTCCA AAAAGAGATG AAATAGACAG TAAATACAAA 
TGGAAGCTTG AACATATATA TGCCGGTATT GACGATTGGG AAAGAGATTT CAGCAAAGTA
AAAGAATATA TATCCCAAAT AGTTAAATTC AAAGGTACAT TGGGCAAAGA TTCAAACACA
CTCTTAGAAT GCTTAAAACT CAGCAATGAA CTGATGTCCA CCAATGACAG GGTCTTTGTG
TACGCCCGTA TGAAAAAGGA TGAAGACAAC TCAAATTCCA CATACCAGAG CCTGGCCGAC
AGAGCATCCG CCCTTATGAC CGAAGCTTAC GCAGCCACAT CTTTTATTGT GCCTGAAATA
CTCACCATCC CTGAGGAGAA ATTAAACAAA TATCTTGAAG AAAACAAAGA CCTTCAGCTG
TACCGTCAGT TTTTCCGCGA AATTTTGCGT CAAAAAGAGC ATGTGCTTTC GGAAAAAGAA
GAGGAATTGC TGGCCCTTGC CTCAGAAATG GCCGGATCTC CGAGGGAAAT TTTTACCATG
TTCAACAATG CAGACATTAA GTTTCCCTTT ATAAAAGATG AAGACGGGGA AGAAGTGGAA
CTTACCAAGG GCAGATACAT TAAATTTCTT GAAAGCAAAG ACAGGAGAGT CCGCAAAGAT
GCATTCCAGG CACTTTACAG CACTTATGCC AAATTCAAAA ATACCATTGC GGCTTCACTT
GTCGGAAGTA TCAAAGCCTC CAAATTTTAC GCAACTGCCG CCAAATACGA TTCATCCCTT
GAAGCTTCTT TAGATGCTGA CAACATAAGC GTGGACGTGT ATGACAACTT AATTGAAACG
GTAAATAAAA ACCTTCATCT TCTCCACAGG TACCTGAAAC TCAGGAAAAA GGCTTTAAAG
CTTGACGAGC TTCATATGTA TGACCTGTAT GTTCCAATTG TCGAGGAATC AAAAAAGAAC
ATTCCCTATG AGGAAGCTTT AAAGATGGTG GAAGCAGGAC TTCGCCCTTT GGGGGAAGAA
TATATCTCGC ACCTTAAGGA AGGCTTTACA AACGGCTGGA TAGATGTGTA CGAAAACCAG
GGCAAAACCA GCGGCGCATA TTCATGGGGA GCATACACAA CGCATCCATA TGTCCTCTTA
AACTACCAGG GCACAATAAA TGACGTGTTC ACCATAGCTC ATGAAATGGG CCATGCTCTC
CATTCGTATT ACACCAACAA AACCCAACCC TATGTTTATT CGGAATATAA AATCTTTGTT
GCAGAAGTGG CATCAACAGT GAACGAAGCC CTGCTTATGA ATTACCTTCT TGACAAAACG
AAAGACAAAA CGGAAAAAGC CTATCTTCTG AATCATTATC TAGAACAGTT CCGTGGCACT
GTTTACAGGC AGGTTATGTT TGCCGAGTTT GAAAAAACAG TACACATGAA ACATAAAAAC
GGAGAACCTT TGACAGCGGA TATCTTAAGC AACATATACT ATGATCTTAA CAAAAAATAT
TTTGAAGCTG AGGTAAATGT GGACGAGGAA ATATCCATGG AATGGGCAAG AATTCCCCAT
TTTTACACCA GCTTCTACGT TTACAAATAC GCCACAGGCT TTTCATCCGC AATCGCCATA
TCGGACATGA TCCTAAAAGA AGGACAGCCT GCAGTGGACA GATACATCAA ATTCTTAAAA
AGCGGAAGCT CCGATTATCC ACTGGAACTT CTTAAAATTG CCGGAGTCGA CCTTTCCACG
CCAAAACCGG TGCAGGACGC CCTGGATGTG TTTGAAAAAA TCCTTGGGGA ACTTGAAGCG
CTTATATAG
 
Protein sequence
MAESKTNALP KRDEIDSKYK WKLEHIYAGI DDWERDFSKV KEYISQIVKF KGTLGKDSNT 
LLECLKLSNE LMSTNDRVFV YARMKKDEDN SNSTYQSLAD RASALMTEAY AATSFIVPEI
LTIPEEKLNK YLEENKDLQL YRQFFREILR QKEHVLSEKE EELLALASEM AGSPREIFTM
FNNADIKFPF IKDEDGEEVE LTKGRYIKFL ESKDRRVRKD AFQALYSTYA KFKNTIAASL
VGSIKASKFY ATAAKYDSSL EASLDADNIS VDVYDNLIET VNKNLHLLHR YLKLRKKALK
LDELHMYDLY VPIVEESKKN IPYEEALKMV EAGLRPLGEE YISHLKEGFT NGWIDVYENQ
GKTSGAYSWG AYTTHPYVLL NYQGTINDVF TIAHEMGHAL HSYYTNKTQP YVYSEYKIFV
AEVASTVNEA LLMNYLLDKT KDKTEKAYLL NHYLEQFRGT VYRQVMFAEF EKTVHMKHKN
GEPLTADILS NIYYDLNKKY FEAEVNVDEE ISMEWARIPH FYTSFYVYKY ATGFSSAIAI
SDMILKEGQP AVDRYIKFLK SGSSDYPLEL LKIAGVDLST PKPVQDALDV FEKILGELEA
LI