Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2452 |
Symbol | |
ID | 4809831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2925365 |
End bp | 2927173 |
Gene Length | 1809 bp |
Protein Length | 602 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640107866 |
Product | oligopeptidase F |
Protein accession | YP_001038847 |
Protein GI | 125974937 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1164] Oligoendopeptidase F |
TIGRFAM ID | [TIGR00181] oligoendopeptidase F |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGAAT CAAAAACAAA TGCACTTCCA AAAAGAGATG AAATAGACAG TAAATACAAA TGGAAGCTTG AACATATATA TGCCGGTATT GACGATTGGG AAAGAGATTT CAGCAAAGTA AAAGAATATA TATCCCAAAT AGTTAAATTC AAAGGTACAT TGGGCAAAGA TTCAAACACA CTCTTAGAAT GCTTAAAACT CAGCAATGAA CTGATGTCCA CCAATGACAG GGTCTTTGTG TACGCCCGTA TGAAAAAGGA TGAAGACAAC TCAAATTCCA CATACCAGAG CCTGGCCGAC AGAGCATCCG CCCTTATGAC CGAAGCTTAC GCAGCCACAT CTTTTATTGT GCCTGAAATA CTCACCATCC CTGAGGAGAA ATTAAACAAA TATCTTGAAG AAAACAAAGA CCTTCAGCTG TACCGTCAGT TTTTCCGCGA AATTTTGCGT CAAAAAGAGC ATGTGCTTTC GGAAAAAGAA GAGGAATTGC TGGCCCTTGC CTCAGAAATG GCCGGATCTC CGAGGGAAAT TTTTACCATG TTCAACAATG CAGACATTAA GTTTCCCTTT ATAAAAGATG AAGACGGGGA AGAAGTGGAA CTTACCAAGG GCAGATACAT TAAATTTCTT GAAAGCAAAG ACAGGAGAGT CCGCAAAGAT GCATTCCAGG CACTTTACAG CACTTATGCC AAATTCAAAA ATACCATTGC GGCTTCACTT GTCGGAAGTA TCAAAGCCTC CAAATTTTAC GCAACTGCCG CCAAATACGA TTCATCCCTT GAAGCTTCTT TAGATGCTGA CAACATAAGC GTGGACGTGT ATGACAACTT AATTGAAACG GTAAATAAAA ACCTTCATCT TCTCCACAGG TACCTGAAAC TCAGGAAAAA GGCTTTAAAG CTTGACGAGC TTCATATGTA TGACCTGTAT GTTCCAATTG TCGAGGAATC AAAAAAGAAC ATTCCCTATG AGGAAGCTTT AAAGATGGTG GAAGCAGGAC TTCGCCCTTT GGGGGAAGAA TATATCTCGC ACCTTAAGGA AGGCTTTACA AACGGCTGGA TAGATGTGTA CGAAAACCAG GGCAAAACCA GCGGCGCATA TTCATGGGGA GCATACACAA CGCATCCATA TGTCCTCTTA AACTACCAGG GCACAATAAA TGACGTGTTC ACCATAGCTC ATGAAATGGG CCATGCTCTC CATTCGTATT ACACCAACAA AACCCAACCC TATGTTTATT CGGAATATAA AATCTTTGTT GCAGAAGTGG CATCAACAGT GAACGAAGCC CTGCTTATGA ATTACCTTCT TGACAAAACG AAAGACAAAA CGGAAAAAGC CTATCTTCTG AATCATTATC TAGAACAGTT CCGTGGCACT GTTTACAGGC AGGTTATGTT TGCCGAGTTT GAAAAAACAG TACACATGAA ACATAAAAAC GGAGAACCTT TGACAGCGGA TATCTTAAGC AACATATACT ATGATCTTAA CAAAAAATAT TTTGAAGCTG AGGTAAATGT GGACGAGGAA ATATCCATGG AATGGGCAAG AATTCCCCAT TTTTACACCA GCTTCTACGT TTACAAATAC GCCACAGGCT TTTCATCCGC AATCGCCATA TCGGACATGA TCCTAAAAGA AGGACAGCCT GCAGTGGACA GATACATCAA ATTCTTAAAA AGCGGAAGCT CCGATTATCC ACTGGAACTT CTTAAAATTG CCGGAGTCGA CCTTTCCACG CCAAAACCGG TGCAGGACGC CCTGGATGTG TTTGAAAAAA TCCTTGGGGA ACTTGAAGCG CTTATATAG
|
Protein sequence | MAESKTNALP KRDEIDSKYK WKLEHIYAGI DDWERDFSKV KEYISQIVKF KGTLGKDSNT LLECLKLSNE LMSTNDRVFV YARMKKDEDN SNSTYQSLAD RASALMTEAY AATSFIVPEI LTIPEEKLNK YLEENKDLQL YRQFFREILR QKEHVLSEKE EELLALASEM AGSPREIFTM FNNADIKFPF IKDEDGEEVE LTKGRYIKFL ESKDRRVRKD AFQALYSTYA KFKNTIAASL VGSIKASKFY ATAAKYDSSL EASLDADNIS VDVYDNLIET VNKNLHLLHR YLKLRKKALK LDELHMYDLY VPIVEESKKN IPYEEALKMV EAGLRPLGEE YISHLKEGFT NGWIDVYENQ GKTSGAYSWG AYTTHPYVLL NYQGTINDVF TIAHEMGHAL HSYYTNKTQP YVYSEYKIFV AEVASTVNEA LLMNYLLDKT KDKTEKAYLL NHYLEQFRGT VYRQVMFAEF EKTVHMKHKN GEPLTADILS NIYYDLNKKY FEAEVNVDEE ISMEWARIPH FYTSFYVYKY ATGFSSAIAI SDMILKEGQP AVDRYIKFLK SGSSDYPLEL LKIAGVDLST PKPVQDALDV FEKILGELEA LI
|
| |