Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2342 |
Symbol | |
ID | 7407761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2484219 |
End bp | 2487251 |
Gene Length | 3033 bp |
Protein Length | 1010 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643716706 |
Product | S-layer domain protein |
Protein accession | YP_002574185 |
Protein GI | 222530303 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAAAA ATAAGATGAA AATAATAACC TTATATAGAT TAATAAGCTG GGTAGTTTTG ATTTTGTTTT TTTTGATGTT AATTTCCGCA AATATCCAAG CTGCTCAGGA GTATGTAGCA AGCAGCAGCT ACAGCAAGAC GGCAAAAAAT ACTCAAATTG TGAACTTGCC TCAAAATGCC TTGTCGAGAG AAGCGTTTTT GTTTTTGTCA GCTTTTGGCT TTTTCAATAC AGTTGAAAAA GCAAAGATAA ATCCTTCTGA TATTCTGTCA AAGGAAGAGG CGCTGTCTGT AATTTTAAAC AGCGTTGGCA AACAGCAGGA TGCTTTTGTG AGGGCTGAAA AGCTGGAACT AAAAAGACCA TCAGGTCAGA AGCTTGTAAA GCCATACAAC TATCTTTATC TTGGATATAT TCAGCTTGCA TATGATATGA AAATTTTGTC CAAAAAAGAG TATCAGGATG CGATATCTCA AGTGCAGCCA AGTGAAAAAG AACATGAGAA GATGACCCAG CAGCTGATAA AGAAAAATGA TGATACCATT GCAAAAGCTG TGTATGAAGG AAGACCATAT TCGTATGACG ATTTGATATT TGTAAGGTCT GCTCCTGCTA CCCGTCAGGA AGTTTGCTTA TGGGTGGTAA AGGCATTCAA AATACCTTTT ATTTATGAGA ATTTGGCTAA AACTTACCCT GATTATGACA GGATTGACAG TAAGTTTTTA AGTTCAATTA ATACTCTTTT GAAAAATGGT GCTCTTGTTG GAAGATCTGA TGGATATCTT CATCCAGACG ACTACATAAC ATATGAAGAG CTGGCATTTA TCCTCGGGAG TCTCAAACCA AATATCCTGA GCGCAAATGG GTTAAAAGAG GTAAAGCTTG AGGTAAAAGA TATCCAGAAG TTCAGCAGTG GCAAAACCAT TTTGGTGTGT GAGGATGAAA GTGGCAATAG CATAAATATT ACAGTCAACC CTGGCAAACA GGATTTTGGA GTAATAGCAA ATGGTAATTT TTTGAGCTCT TCCTATCTTC AGAAAGGGGA CTATGTAGCC TTTTATGTAA ATGGCAAAAA CGAGGTTGTG CTGGCCAGCA TTTTGCAAAG GCCTGGTCAG GAAAATATAA AGGGTGTAAT CTCCAGAATT GATGCTAAAA AGATGATATT TTCGGTAAAG CTTCCAGATG GCAAAGTTTA CAATTTGAGT TTAAGTCCAA AAGCTACAAT TTATGATTCA AACTCTGGAA AGAATTTAAA CTTTTCAGAA TTGAATGTAG GAAATCTTGT ACAGGTTAAA GTCCAAAAAG ACAGAGCAGA TGCCATAACA TTGCTTTCGC TTGACAGTCC TGAAATTGAA AGAGTACAGG GGATAATCTC AAGAATAACA AAGGACAGAA TTGTACTCAG TCAAAATGGA CAGCTAACAG AGTATCTTCT TTCACCAGAT ACAGTCTACA TTGATAAAGG TGATTTTTCA AGAGTTCTTA TAAAGAATGA TTTTTATGAG GGTATGAAAG TTTTGGCAGG TACGGCTTGC GGGTATGTAC AGTATCTCAG CACCACATAT GATGAGAAAT CTGAAGATAT AGTTTCCGGT ATTTTGCACG AGGTAGATTC AAACTTGGGG TATCTTGAGA TTTACAACCA ACAAGGTGAA AAAAAGAGTT ACAGATTTTC AAAAAAGCTT GGGCTAAAAG TGAAAAAGGA TGGTCAAAAT GCTTCTTTAG ACAGCCTTTT ACCAGGCGAT GTTGTTTTCC TCTATTTTAG CGGCGATTTT GTGCGCACAG TCACAGCAAG TTCAAACCTG CAGCAGAAGG TTGCAAAGAT TGAAAATGTT GTAAGAAGCC TTGCAAGCGG TCTTCCACAA AAAATAATTG TCAATATCGA TGGGAGAATA TATGGACCAT ATGAAATAAA TGACAACGTT GATATTGTAA AAAATGGCAT TTCTGCAGCT TTAAAGGATA TTATGCCAGG TCAGTATGTG AAGCTCACAG GGAGCTTCTT TGGAAACTCA GCGTACATCC GCAGAATAGA GATATCAGGG AATGAATATG TAAAAAATAT CTACATCGCA AAAGGAACGG TCAATGGAAA TGCTTTATAT CTTTCTGACA TCCAGATTTT AAGAAACAAC GCATTTGAAC CGCTGTATAC ATGGCTTTCT TTCCAGATTC CATCTGATTT GAAATTCATT TTAGATGGCA GCTCTCTTGC ACCACTTTCT AAGTTGCAAA ACTTACCTGT TGTAGTTGTA ACAAAGGAAA GATTTTCACA GGAGGTTTTG GACACCATTG TGGCAATATC AAGAGGTTCT TTTACCAAGA TACAGGGCGA GGTGAGCATG ACATCTTCAA ATTCGGTGGT GATATCTGGA AATAGGTATT CAATAGGAAA CAAAACGTAC ACCGTTGCAA ACGGGCTTTT GATCCCTGCA AGTTTCAAAG TTGGGGATGA AATAATTGGT ATTGCAAGCC AAGGAAGCCT TGTTTTAGCA AAGCAACAGG AAGGTATTTC AAAACCCATA TTTGTTCGAG GGCAGGTACA AGACATTTCA GAGCTTGAAT ACATCACTGT GAAAGACTAT GTCTATTTAG ATGGTCAGAG AGGATGGCAA TATGTTCCGA GCAAGCTAAC TCTTTTTTAT GATACACAGA CAGTTATGTG TGATGTATAT GGACTTGGCA GTCCGAAAGA GATATTGAAT CTGAAAAACA AGAGTGTGTA TATCATTCAT AATGGCAAAT ACGCAAATGT TATAATTGAT ACAAGCTTTG GTGGATACAT TGTCACAGGC GTGGTTGGCA AGGACATGAA GATTCTAAAT GCCCAGTACA ACGATATGAT GACTCAGACA TGGAACAGAA TTGATAAAAG TTTTGTCCTT GATACCACCC AAGCAGTTTT GATAGACGCA GGAGGGAACT TAACAGCTCA GATGCCACAA TTTGGCGACA GGGTTTTGTT ACTTGTTCCT CAGAGCAGTT TTGACATCTC AAAATCGGTA TTAACGCCAT CGATTGTGCT TGTAAATTAC TGA
|
Protein sequence | MIKNKMKIIT LYRLISWVVL ILFFLMLISA NIQAAQEYVA SSSYSKTAKN TQIVNLPQNA LSREAFLFLS AFGFFNTVEK AKINPSDILS KEEALSVILN SVGKQQDAFV RAEKLELKRP SGQKLVKPYN YLYLGYIQLA YDMKILSKKE YQDAISQVQP SEKEHEKMTQ QLIKKNDDTI AKAVYEGRPY SYDDLIFVRS APATRQEVCL WVVKAFKIPF IYENLAKTYP DYDRIDSKFL SSINTLLKNG ALVGRSDGYL HPDDYITYEE LAFILGSLKP NILSANGLKE VKLEVKDIQK FSSGKTILVC EDESGNSINI TVNPGKQDFG VIANGNFLSS SYLQKGDYVA FYVNGKNEVV LASILQRPGQ ENIKGVISRI DAKKMIFSVK LPDGKVYNLS LSPKATIYDS NSGKNLNFSE LNVGNLVQVK VQKDRADAIT LLSLDSPEIE RVQGIISRIT KDRIVLSQNG QLTEYLLSPD TVYIDKGDFS RVLIKNDFYE GMKVLAGTAC GYVQYLSTTY DEKSEDIVSG ILHEVDSNLG YLEIYNQQGE KKSYRFSKKL GLKVKKDGQN ASLDSLLPGD VVFLYFSGDF VRTVTASSNL QQKVAKIENV VRSLASGLPQ KIIVNIDGRI YGPYEINDNV DIVKNGISAA LKDIMPGQYV KLTGSFFGNS AYIRRIEISG NEYVKNIYIA KGTVNGNALY LSDIQILRNN AFEPLYTWLS FQIPSDLKFI LDGSSLAPLS KLQNLPVVVV TKERFSQEVL DTIVAISRGS FTKIQGEVSM TSSNSVVISG NRYSIGNKTY TVANGLLIPA SFKVGDEIIG IASQGSLVLA KQQEGISKPI FVRGQVQDIS ELEYITVKDY VYLDGQRGWQ YVPSKLTLFY DTQTVMCDVY GLGSPKEILN LKNKSVYIIH NGKYANVIID TSFGGYIVTG VVGKDMKILN AQYNDMMTQT WNRIDKSFVL DTTQAVLIDA GGNLTAQMPQ FGDRVLLLVP QSSFDISKSV LTPSIVLVNY
|
| |