Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0298 |
Symbol | |
ID | 7407615 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 343477 |
End bp | 344439 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643714688 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_002572211 |
Protein GI | 222528329 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000146885 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCACAAAC ACAAGAAGCT TTTTTTACTG TCAGGTGCTA TCTTTGCAAT ATTATCATCA ATACTTGCTG CCAATCTCTA CATGAGTAAT AATCCTTCTA CACAAAACCC TCAACAAGCC TATAAAAAGC AAATTGTTCC CTGGAGCTAC AAAAAACTTG GTATCACTAA AATGTGGAAA TTTACAAGAG GCAAAAATGT TAAAATTGCT ATTCTGGATT CTGGTATTGA CCTGAACCAT CCTGACTTAA AAAGTGCAAA TATTATCAAA ACTATTAACT TTATTGAGCC AAACAAACCC GCATCAGATG AAACAGGACA TGGAACTTTT ATCGCAGGTA TAATCGCAGC TCAAAATAAC AACTTTGGTA TTGTCGGCAT TGCACCTGAT GCTGAAATTT TCATCTTAAA AATCTTAAAT AAAAAACTTG AAGGAAAAGT TGACCTCGTT GTACGTGCTC TTGACTTTTG TATAAAAAAC AAGATTAACA TCGTAAACAT GAGTTTTTCT ACCTCATCTG ATAATCCAAA ACTCAGAAAA GCTGTTTCAA AAGCAGCAAA ACATAAAATA ATCATTGTTG CCTCGGCAAG AAATTCATTT GGTTCAAAAG CAGGCTTTCC TGCATCATAC CCCGAAGTTA TATCTGTTGC TTCTGTCAAC TGCAAAAATC AAATATCTCA GTTTTCTTCT CAAGGCAAAA TTGATTTTTG CTCTTATGGT GAAAATATTT TGTCCACAGC TCCAAACAAT AGCTACAAAC TCTCAAGTGG AAACTCTGTG GCTGCTGCAC ACCTGACAGC AATTATCGCT CTTATCTTAA GCAAACCAGA AAAGTGGGGC TTGCCCCCTA AACACAGCAT AAACAAAGAT AAAATCTATA ATGTATTGCT AAAACTTTCT GAAGACCTCG GTGAAAAAGG TAAAGATAAT ATATTTGGCT TCGGTCTTGT GAAATTTAAA TAA
|
Protein sequence | MHKHKKLFLL SGAIFAILSS ILAANLYMSN NPSTQNPQQA YKKQIVPWSY KKLGITKMWK FTRGKNVKIA ILDSGIDLNH PDLKSANIIK TINFIEPNKP ASDETGHGTF IAGIIAAQNN NFGIVGIAPD AEIFILKILN KKLEGKVDLV VRALDFCIKN KINIVNMSFS TSSDNPKLRK AVSKAAKHKI IIVASARNSF GSKAGFPASY PEVISVASVN CKNQISQFSS QGKIDFCSYG ENILSTAPNN SYKLSSGNSV AAAHLTAIIA LILSKPEKWG LPPKHSINKD KIYNVLLKLS EDLGEKGKDN IFGFGLVKFK
|
| |