Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2413 |
Symbol | |
ID | 4808128 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2881865 |
End bp | 2882878 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640107826 |
Product | NLP/P60 |
Protein accession | YP_001038808 |
Protein GI | 125974898 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000144369 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTTCAA AAAGTGCAGG AATGAAAAAG ATATTGCTGG TTGCGGTGAT TTTAGGCATA TATGGACTCT TTATGTGCGG TTGCGGAAAA GTGTCCCAAA AAGTGAGCAG CAACATACCC AATGGCAGCA ACGGGGCTGA AGAAGATGTT GCAAAGCCGA AAAGCAGTGA CGGCAAAAAT TTGGATATGG AGAACAAATA CAAAAACAAG GCGGTTGTAT TGGAAACAGT GGTTGATATT TTCAGAGAGC CTGATATAAA CTCGGAAAGG GTTACCCAGG CGATTTTCAA CCAGCCTGTT GAAGTGATTG AGGAAAAGGG CAGCTGGACA AAAGTGAAAG TTGTGGACGG ATATACCGGG TGGCTTAAGT CGAAGTTTAT TGACAGGGAT TGTACAAGCA TTATGGAGGA AAAGTATACT GACAGAGCCG TGATTACGGG AAAGACAAAA AAAGTTTATT CCTCCGCGGG AGGAGGAGTG ACGCTGAAAG ATGTTGTAAT GGGAACTGAG CTTTTTATAA AGGAAAAAAA GGACCGTTAT TACGAGGTTG CTCTGCCCGG AGGAATTACG GGATGGATTG ATACAAAAGA TACAATAAAA GTTCCGTCGG GAAGTCCGAT ACCGAAGACA TCCGCACAGG ATTTTGTGGC TACTGTGAGT AAATTCACAG GTACGCCGTA CCTGTGGGGA GGAGTCAGCA GCTGGGAAGG TGTTGACTGT TCGGGGCTTG TGTATATTTG CAGCAGAATA AACGGAGTGG ACCTGCCCAG GGATGCCGAC ATGCAGTTTG AATTTATAAA AACCGGGGTT GGAAGCGTGG AAGAATTAAA AGCCGGGGAC TTCCTCTTTT TCAGCTCCAG TGAAGAGCTT AAAGATGTAT CCCATGTAGG GGTTTATGTC GGAGACGGCA AGTTCATACA AGCGGCAAAA TCAAAGGGAA TGGTTGTTGA AACCGGGCTT GATACGGAAT ATTACCTTAA AAGGTTAAAA GGCATTAAAA GAATATTTGA ATAA
|
Protein sequence | MFSKSAGMKK ILLVAVILGI YGLFMCGCGK VSQKVSSNIP NGSNGAEEDV AKPKSSDGKN LDMENKYKNK AVVLETVVDI FREPDINSER VTQAIFNQPV EVIEEKGSWT KVKVVDGYTG WLKSKFIDRD CTSIMEEKYT DRAVITGKTK KVYSSAGGGV TLKDVVMGTE LFIKEKKDRY YEVALPGGIT GWIDTKDTIK VPSGSPIPKT SAQDFVATVS KFTGTPYLWG GVSSWEGVDC SGLVYICSRI NGVDLPRDAD MQFEFIKTGV GSVEELKAGD FLFFSSSEEL KDVSHVGVYV GDGKFIQAAK SKGMVVETGL DTEYYLKRLK GIKRIFE
|
| |