Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0438 |
Symbol | |
ID | 7407515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 497322 |
End bp | 500795 |
Gene Length | 3474 bp |
Protein Length | 1157 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643714825 |
Product | S-layer domain protein |
Protein accession | YP_002572343 |
Protein GI | 222528461 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAA GTTGGGGTAA GAAACTTTTT GCTTTATTGA GCTTACTGAG TTTATTATTA AGCTTTTTGG TAAATACTTC ATTTTCTCAA AACCTTTCAT ACTATCAGCA AGCAGCACAG GTTTTAAAGG AAAAAGGAAT AATGACAGGT GACACAAAAG GAAATTTGAA TCTTGACAAA CCTCTCAAAC GTTCAGAGAT TTCCAAAATG ATTATCATGC TGCTTGGCAA AAAGCCTTTA GCTGATTTTT ATGCAAATCA AAAAAAATCT TCTTTTAAAG ATGTTAAGAC AAATTACTGG GGACTTGGCT ACATAGAAGC AGCAAAAGCA ATAGGATTGA TTTCAGGGTA TACAGATGGT ACTTTCAAGC CAGAACAGTA CTTAAAAGTT GAAGAGTTGA CTGCTATAGT TGTAAGAGCA CTTGGTGTAA AGGAGTCTGA GCTCAAAGGC AAGTGGCCAC TAAATTATAT CCAGAAAGCA TATTCGATGA ATATTTTTTA TGGAATAGAA TCCGAAATTG ATATAGGAAA GCTTGTCACA AGAGGGCAGA CGGCAGCTAT ACTTTACAAT GCGTTTTTGA ATGAAAGTCT GAAAGCTGCA AAACCTGTCG GGCTTGAAAT AATTGACCTG CAAACTTTAA AGGTAACATT TGATAAGGAG CTTTCTTCAA TTGTTAAATC TGACTTTTCG TTTGATGGTG GGCTTTCTGT TTTGGACGCA AAGTTTGCAG ACTCAAGCAA AAAGGCTGTT GAGATAAAAA CATCTTTGCA GCAAGAAGGG AAAGAGTACA CGCTTTTTTA CAAAGGGCAA ACTACAACTT TAAAGTTTGT GGCAAAGACA ATGCCTTTTT CTTTTGCAGA GGATATCAAA ATAGAGAGTT TAAAGAAAGT GGATTTAAAG TTTACAAAGC CGATTTCAAA GAGCCAGCAG GATAACCTGC CGATAAAAAT TTATGTCAAT GGCAAAGAAG TTACAGATGT GAAAAAGTTT ATTTCAAGCG ATTATAAAAC TGTGAGTATA ATCTTCCCAA ACAAACTAAA TCAAACTGAC AAATTGATGG TTGAGATTTC AAACCTTCTT TCAGAAACAG GTCAGAGCTT GACTCTTACA AAGGAACTAA CAGTTATTGA TGCAACCCAG CCAAAGGTTG TGGATTTTAA GGTGGTCAAT AGCAAAAGGT TTAAGATTAT TTTTTCTGAG CCTATGAACA TTGATTCGAC AAGTACTTAC AAGGTATGCG ATTTGTCTTC GGTTGGGGCA AACATTAGAA TTGATTCAAA TTATGCATAT GCCAAGCTCA CACCAAAACA TCAAGAGAAT GCTATTGACG TTGAACTTTT ATATCCTCTT GCTGATGGGA ATCACACAGT TGAAATAACA GAAGCGAAAG ACTTTGCAGG ATACAAAGCC CCTGACTTTA AAGCTACATT TACAACAGTG CTTGAAAAAA ATCCGCCAAA GCTTGTGTCC TTAGACCTTG TTTCAAATAA TCAAATAAGG CTTGTGTTTG ATGAAGAGAT AAGAAGCTTA GATGGTTTAA TTCCAACAGG CGAGTATGAG GTTTATCAAG CACAGGACAG TACCAACCAT GCAATTGGCG CAAAAATAAC ACTTCTTTCA GATGAAAAAA CAATTGACAT TCAGTTAAAT CCACAATTGA AGCTTGATAG CAGGGCACTT GTTTCATTTG AAGTGAGGTT CCGATACGTT GAAGACCTTC TTGGCAACAA GGTATCAGAT TGGGTATCAG TAACATCCAA AGCTCAGGAT GATACTACAA AACCGGCAGT CAAAAGTGTT GAGGTTTTGG ATGGGAATAT AATTAAGGTA ACATTTACTA AAAATGTAAA TGCTACTGAT AAGGTTCAAA GCTTTTCCCT TCTTTCAGGA GATGGCACAC AAATAGTGGA AGCTTATGCA AAAAGCGTAA AACCGCTTAA GGAGGAAGAT AATTCAACAT TTGCTGTTGA GTTTTCGACA CTTGCAGCAA TAAATGGTGG AAGATATACT TTGAAGATTT TAGGCATCTG CGATACATCT GTGAGAGAAA ATGTTATGGA TACTGTTTCA TTTGCGATAG ATGCTAAAGA CACTCTTGCA CCAACTATCA CAGCTGCAAT TGCGAAGTAT GATTCTTCTT CGGATGTAGA CAAAATAGAC ATATTTTTCT CAGAACCTAT GGATGTTGAA AAATTAAAAA ATTTGAGCAG TTATTTTGTA GGGGCATCAA GTGCAACAAT TCCACTTTCG AGCGTAAAAG GAGCAAAAAT TGATTATATT TCACCAAACG GCGACAGAAT CACCCTTTTA ATTCCAGGGG CAGACGATTC AACGCCTGGC AGATGGAGTC AATCTGGAGC TGTAGTTGAC AAATTGGCAG CTCCTACACT CACTGACAAG GCAGGCAATT TTATAGCAAA TGCTACAATA GCAATGCCAC TTTCTGTGTC GGCAAACTTT AGAGGAATAT CTGCGCAAGA CATTGAGGTT GTTGCAGTGG ACAAAAATAC AATTGAAATA AGAAGCTTGA ATGGATACAT CTTTGCATCG TTTGACCCTG CAGCGATAAT GTTCAGAAAC GCATATTCGA CTTCAAGTTT AAATGGAAAT CCTGACAACG ATAAAGTGGT TAGCCTTGGA ATTGTAAGTT ATACAATTTC TCAAGATAAA AAGACCATTA CTTTAAAAAC ATCAATTTCT TTGACCTCAA GCGCTATGGC TGATACAAAT GATTCCGGTC AAGATGCTGA ACAGCTGAAA ATATTTACAG TAAATTCGAA TATAAAAGAC CAGTTTGAAC AGAACCTTGT AATTCTGCCA ACATTTGATA TTAACTTTTA TCCGTCGATA TTACTCAAAG ACAAAATTTC TCCACAGCAG ACAGGGGTTT CTGTTGGAAG TGGAAATCAA TCAGACACAA TAGCTATTAC ATTTGATGAG CCAGTTTTTG CCTTGCCAGG TATAAATAGT ACAGTTCTGG CTGCAGGAAT TGAGCTAAAG GTTAGTGGTA CTACTTTAAT TCCAGATGTT GATTACACAG CTTACGTCCA AAACGGTATA GTTTATGTGA AAGTCAAAAA GTCGGGGATT GTGGATAGCA AGGTAAGCTT GGAGATAAAA AGACCAGATT TAATAGTAGA TAGCAATGGA AATCCTTCTA TTGTCTTGAA AGCTCAAACT GTTGAGCATG TGACAGAAAG GACTTCCCCT GATGTCACAG CAGAGTTTTC TTCGACTGAT ACAAGAAAAG TCAAGCTCAC ATTTTCTGAA CCTATGGATG CTTCTACACT AATTGCTCAA AACTTCTCAT GCGTTGCAGG CGGTAATATC ATAAGTTTTG TAAAATCTTC TGATAACAGA GTTATTGAGA TCACATTCAC AAACCCACTC CCGGCAGGAA GCATTGTGAA TATATCACCG AATGTCAAGG ACTTGGCTGG AAATTCGGTG TCAGTTCAGG CTGTGAGAAA ATAG
|
Protein sequence | MNKSWGKKLF ALLSLLSLLL SFLVNTSFSQ NLSYYQQAAQ VLKEKGIMTG DTKGNLNLDK PLKRSEISKM IIMLLGKKPL ADFYANQKKS SFKDVKTNYW GLGYIEAAKA IGLISGYTDG TFKPEQYLKV EELTAIVVRA LGVKESELKG KWPLNYIQKA YSMNIFYGIE SEIDIGKLVT RGQTAAILYN AFLNESLKAA KPVGLEIIDL QTLKVTFDKE LSSIVKSDFS FDGGLSVLDA KFADSSKKAV EIKTSLQQEG KEYTLFYKGQ TTTLKFVAKT MPFSFAEDIK IESLKKVDLK FTKPISKSQQ DNLPIKIYVN GKEVTDVKKF ISSDYKTVSI IFPNKLNQTD KLMVEISNLL SETGQSLTLT KELTVIDATQ PKVVDFKVVN SKRFKIIFSE PMNIDSTSTY KVCDLSSVGA NIRIDSNYAY AKLTPKHQEN AIDVELLYPL ADGNHTVEIT EAKDFAGYKA PDFKATFTTV LEKNPPKLVS LDLVSNNQIR LVFDEEIRSL DGLIPTGEYE VYQAQDSTNH AIGAKITLLS DEKTIDIQLN PQLKLDSRAL VSFEVRFRYV EDLLGNKVSD WVSVTSKAQD DTTKPAVKSV EVLDGNIIKV TFTKNVNATD KVQSFSLLSG DGTQIVEAYA KSVKPLKEED NSTFAVEFST LAAINGGRYT LKILGICDTS VRENVMDTVS FAIDAKDTLA PTITAAIAKY DSSSDVDKID IFFSEPMDVE KLKNLSSYFV GASSATIPLS SVKGAKIDYI SPNGDRITLL IPGADDSTPG RWSQSGAVVD KLAAPTLTDK AGNFIANATI AMPLSVSANF RGISAQDIEV VAVDKNTIEI RSLNGYIFAS FDPAAIMFRN AYSTSSLNGN PDNDKVVSLG IVSYTISQDK KTITLKTSIS LTSSAMADTN DSGQDAEQLK IFTVNSNIKD QFEQNLVILP TFDINFYPSI LLKDKISPQQ TGVSVGSGNQ SDTIAITFDE PVFALPGINS TVLAAGIELK VSGTTLIPDV DYTAYVQNGI VYVKVKKSGI VDSKVSLEIK RPDLIVDSNG NPSIVLKAQT VEHVTERTSP DVTAEFSSTD TRKVKLTFSE PMDASTLIAQ NFSCVAGGNI ISFVKSSDNR VIEITFTNPL PAGSIVNISP NVKDLAGNSV SVQAVRK
|
| |