Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2295 |
Symbol | |
ID | 7407714 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2426577 |
End bp | 2429801 |
Gene Length | 3225 bp |
Protein Length | 1074 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643716659 |
Product | S-layer domain protein |
Protein accession | YP_002574138 |
Protein GI | 222530256 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAAA AGTTTTTAGC AGTCATTTTG TTACTTTGCT TTGTGGTTGT AAACTTTGGT TTGTCTTCTG CTTTTGCTGC ATACAAAGAC ATTCCATCAA ATGCAAGTTA CAAGCAGGCA GTAGAAAAGT TAAATAAGCT TGGGATTCTT GTTTACAAGG ACTATTTCAA GCCAAATGCT GCTGTCTTAC GCGGTGAGTT TGCAGCTGCG ATTGTAAAGA TTTCAAACGT AGAGGATGAG GTGAATCTGC TAAAAGGATA TTCTCAGTAT CCAGATATAA AGCCAAACAC CACACTTTGT GGATATGTCA ACTGGGCGGT AAAGAAAAAA TACATGACAC CAATGGCAGA CAATAAGTTC CATCCAAATG ACCCGCTTAC CTTTGCCCAG GCAACAACTG CTATTGTGAG GATGCTTGGG TATTCCGACT CAGATCTTTC TGGCATCTGG CCGCAGAATT ATATCGACAA AGCATCAGAG CTTGGGCTTA TAAAAGGAAT AAACCTTTCT GCGTCGCAAA AGGTTCCGCG CTGGGCTGCG GCACTGATGC TATCAAGGCT TCTTGATACT TATGTCAAAA GCGGTGGAAA TCAAGCTCAA TCTGGCCAGT CAGCTTTAAG TGGAGTATCA GCTTCATCCC AAAGCAACGG CACAAAATTT TCTGAGTATG TTGGGCTTTA CAAATCGTAT GTTGTGCTTG ATACCGGCAA AACTTCTTCA AAGCTTCTTC CAAATGAGGT TTTGACAGAC AGCGGAGTGC TTGTGAATGC AACAAAGACG CAACTTGAAG TTGGGAAAAA GTACATGCTT CAGGTTGATA CCAATAAAAT CACGAAAGTG TTTGGCACTG AAGCTGATTC TTTCCAGATT GTCAGCACAA AGGTAAGCAG CAGGACTGTA TATTACAAGG AAAGTGGAAA GACAAAATCA ATAACTTTGC CATCTTCGGC AACATACTAT TACAACGGCT CAAAGCAGAG CTATGATGCA ATAGAAAATG TACTTAAACC GAACCAGAAA ATAAGCTTTA TCTATTCTGA AGATAGGAGC AAAGTGGACT ATATTGTAAT TAAAGACATA TATGCACAAG AGGTTTATGG AAACTACGAT GAGGTGCTGA TTTTGGCAAC TCCTAAAACA TCATCGTCGT TAGAGGCAAA CCAGGTTCAG ACAGACAAGG GAATATACTT TGTTGCATCT TCAATAAAAC CTGAAAACCT TGAAATTGGA GCAAAGTATG GAGTGTATAT AAAAGATGAT ACAATCACTG CAGCTTTGCA AAAAGTGTGG GTATCAGAAA AGTTTACAAT TACAAATATA GATGATTACA CACTTGATGC TGCTCAAAAC GGCAAAACAC AAAGGATTCA GCTGACAAGC AAACCTTTAT ATTACTATCA AGGAACAAAA CAGAGCTATG AAAATTTACC AAACATTTTA AAAGAAGACC AGATACTCTA TGTATCAAAA GACCCTGACA CAGGCAAGGT TATGGCATAT GTTATTCAAG ACCCATACGG CACCCAGTAT GGAAACTATA TTGAGGCAAT AATCCTGCAG GATGCACTTT TAAACCCTGC TTTAGAAAAC AATCAGGTTT TAACAGACAA AGGTATATTT TATTTACCTA ATATAAACAC AAAACTGGAG ATTGGCTCAA AGTATGGTGT TTATGTTAAG GATGACAAGA TAACATTAGT TGTAAAAAAA TTAAATACTG TGAATTTGTA TGAGATTACA GATGTTGTTA GTGATACAAA TGTAAAGTTA AAATCATCAA AAGGCCAGGA GAACATAATC CTGCCACAAA AACCTGTTTA CTACTATAAC GGCAATAAAA TTAGCTACAG TGATTTGAAA AACGCGTTAA AATCAGGTCA AAAAATCTAT TTTGGATACT CAAAGGATGG GAAAACTTGC GAGTATATTG TTCTTCAGGA CCCATATTCA TCTGAGTACG GTGCATATAC TGAGGTTATC GTTCTGGCAG ATGCGGTTGT ATCTGATAAG TTATCTACAA ACGAGGTTTT AACAGACAAA GGGATATACG CGGTAAAGTC CACAGCAGGC AAGCTTACAG TTGGTGCAAA ATATGGGGTA TATATCAAAG ATGATACAAT CACAAAGGTT GTAAAGAAGC TCAACAGCGT CGATACAGCA GAGATTACAG AGGTTATAAG CGATACAAAT GTTGTTCTCA AAAAAGGCAG CACAAGCAGC TCTACTTTCC TGCCACAAAA ACCTGTTTAT TATTACAATG GAAGTAAGGT TGACTACAAT AGTTTGAAAA ACATAATAAA ATCAGGGCAA AAGATTTATT TTGGATACAA CGCAGCAGGA AATTCCTATG AGTATGCAAT AATTCAAGAC CCATACTACG ACAGCTATGG AAAATATGTA GAGACAGTGA TTTTGGGGAC ATATTCAACT ACAAAAGGGC TTGATGTGAA TGAAATTTTG ACAGACCAGG GAATTTTAAC ATTGCCAGAA AACCAAAATG TGAATTTAGA ACTTGGTGCA AAATATGGGT TTTACATCGA CCAAGACAAT CAGATAACCC TTGTGTACAA AAAACTAAAT TCAACCGAAG GTGTGACAGT ACTTTCTGCC ATTTCAAACA AAGTGACAGT TGACAAGGGT GGTAGTCAAC TTGATATGAT ATTGCCTCAA AATATAACAT ATTACTATAA TGGTTCAAAG ATTGATTTTT CAACAGCACT TAGCAGACTT CAGATGTCAA CATCGCTTGT GTTTGGACTG TCAAGTAAGA AAAAAGGATA TGACTACTGT GTAATATTTG ACCCTGTATA CAGCAAGCCA TACATTGCAA ATGAGCAGAC ATACTTAACG CTAAAAGCAG GTGATCTGGA TATAAGTGGT AGTAGCAAAG TTATAAAAGA TGGGGATGTT GTGGACTACA GCTATATTCA GAAAAACGAT GTTGTATATG CTGTGACAGA TATCTGGGGC GGCAACAAGT TCATACTTGT TGTAGATAGC AAAGTTGAGG GTTACGTCAA GAGCTACCAA CCAACAAGGT TTACACCAAA GTCTATTGTT GTGAACGTAT ATGACCAGGT ATCAGGAAAG CTTGTAGACA AGACATATGA GGTGAGCGAA GACTTTGACC CATCTGTGCT TTTGGCAGAT ACATTTAAAG TTGGTCAGAG AGTGTACCTC ATCTTAGGAT ACGATGGCAA GGTTGTGAGC ATTGTAAATC CGTAA
|
Protein sequence | MRKKFLAVIL LLCFVVVNFG LSSAFAAYKD IPSNASYKQA VEKLNKLGIL VYKDYFKPNA AVLRGEFAAA IVKISNVEDE VNLLKGYSQY PDIKPNTTLC GYVNWAVKKK YMTPMADNKF HPNDPLTFAQ ATTAIVRMLG YSDSDLSGIW PQNYIDKASE LGLIKGINLS ASQKVPRWAA ALMLSRLLDT YVKSGGNQAQ SGQSALSGVS ASSQSNGTKF SEYVGLYKSY VVLDTGKTSS KLLPNEVLTD SGVLVNATKT QLEVGKKYML QVDTNKITKV FGTEADSFQI VSTKVSSRTV YYKESGKTKS ITLPSSATYY YNGSKQSYDA IENVLKPNQK ISFIYSEDRS KVDYIVIKDI YAQEVYGNYD EVLILATPKT SSSLEANQVQ TDKGIYFVAS SIKPENLEIG AKYGVYIKDD TITAALQKVW VSEKFTITNI DDYTLDAAQN GKTQRIQLTS KPLYYYQGTK QSYENLPNIL KEDQILYVSK DPDTGKVMAY VIQDPYGTQY GNYIEAIILQ DALLNPALEN NQVLTDKGIF YLPNINTKLE IGSKYGVYVK DDKITLVVKK LNTVNLYEIT DVVSDTNVKL KSSKGQENII LPQKPVYYYN GNKISYSDLK NALKSGQKIY FGYSKDGKTC EYIVLQDPYS SEYGAYTEVI VLADAVVSDK LSTNEVLTDK GIYAVKSTAG KLTVGAKYGV YIKDDTITKV VKKLNSVDTA EITEVISDTN VVLKKGSTSS STFLPQKPVY YYNGSKVDYN SLKNIIKSGQ KIYFGYNAAG NSYEYAIIQD PYYDSYGKYV ETVILGTYST TKGLDVNEIL TDQGILTLPE NQNVNLELGA KYGFYIDQDN QITLVYKKLN STEGVTVLSA ISNKVTVDKG GSQLDMILPQ NITYYYNGSK IDFSTALSRL QMSTSLVFGL SSKKKGYDYC VIFDPVYSKP YIANEQTYLT LKAGDLDISG SSKVIKDGDV VDYSYIQKND VVYAVTDIWG GNKFILVVDS KVEGYVKSYQ PTRFTPKSIV VNVYDQVSGK LVDKTYEVSE DFDPSVLLAD TFKVGQRVYL ILGYDGKVVS IVNP
|
| |