Gene Athe_0438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0438 
Symbol 
ID7407515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp497322 
End bp500795 
Gene Length3474 bp 
Protein Length1157 aa 
Translation table11 
GC content37% 
IMG OID643714825 
ProductS-layer domain protein 
Protein accessionYP_002572343 
Protein GI222528461 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA GTTGGGGTAA GAAACTTTTT GCTTTATTGA GCTTACTGAG TTTATTATTA 
AGCTTTTTGG TAAATACTTC ATTTTCTCAA AACCTTTCAT ACTATCAGCA AGCAGCACAG
GTTTTAAAGG AAAAAGGAAT AATGACAGGT GACACAAAAG GAAATTTGAA TCTTGACAAA
CCTCTCAAAC GTTCAGAGAT TTCCAAAATG ATTATCATGC TGCTTGGCAA AAAGCCTTTA
GCTGATTTTT ATGCAAATCA AAAAAAATCT TCTTTTAAAG ATGTTAAGAC AAATTACTGG
GGACTTGGCT ACATAGAAGC AGCAAAAGCA ATAGGATTGA TTTCAGGGTA TACAGATGGT
ACTTTCAAGC CAGAACAGTA CTTAAAAGTT GAAGAGTTGA CTGCTATAGT TGTAAGAGCA
CTTGGTGTAA AGGAGTCTGA GCTCAAAGGC AAGTGGCCAC TAAATTATAT CCAGAAAGCA
TATTCGATGA ATATTTTTTA TGGAATAGAA TCCGAAATTG ATATAGGAAA GCTTGTCACA
AGAGGGCAGA CGGCAGCTAT ACTTTACAAT GCGTTTTTGA ATGAAAGTCT GAAAGCTGCA
AAACCTGTCG GGCTTGAAAT AATTGACCTG CAAACTTTAA AGGTAACATT TGATAAGGAG
CTTTCTTCAA TTGTTAAATC TGACTTTTCG TTTGATGGTG GGCTTTCTGT TTTGGACGCA
AAGTTTGCAG ACTCAAGCAA AAAGGCTGTT GAGATAAAAA CATCTTTGCA GCAAGAAGGG
AAAGAGTACA CGCTTTTTTA CAAAGGGCAA ACTACAACTT TAAAGTTTGT GGCAAAGACA
ATGCCTTTTT CTTTTGCAGA GGATATCAAA ATAGAGAGTT TAAAGAAAGT GGATTTAAAG
TTTACAAAGC CGATTTCAAA GAGCCAGCAG GATAACCTGC CGATAAAAAT TTATGTCAAT
GGCAAAGAAG TTACAGATGT GAAAAAGTTT ATTTCAAGCG ATTATAAAAC TGTGAGTATA
ATCTTCCCAA ACAAACTAAA TCAAACTGAC AAATTGATGG TTGAGATTTC AAACCTTCTT
TCAGAAACAG GTCAGAGCTT GACTCTTACA AAGGAACTAA CAGTTATTGA TGCAACCCAG
CCAAAGGTTG TGGATTTTAA GGTGGTCAAT AGCAAAAGGT TTAAGATTAT TTTTTCTGAG
CCTATGAACA TTGATTCGAC AAGTACTTAC AAGGTATGCG ATTTGTCTTC GGTTGGGGCA
AACATTAGAA TTGATTCAAA TTATGCATAT GCCAAGCTCA CACCAAAACA TCAAGAGAAT
GCTATTGACG TTGAACTTTT ATATCCTCTT GCTGATGGGA ATCACACAGT TGAAATAACA
GAAGCGAAAG ACTTTGCAGG ATACAAAGCC CCTGACTTTA AAGCTACATT TACAACAGTG
CTTGAAAAAA ATCCGCCAAA GCTTGTGTCC TTAGACCTTG TTTCAAATAA TCAAATAAGG
CTTGTGTTTG ATGAAGAGAT AAGAAGCTTA GATGGTTTAA TTCCAACAGG CGAGTATGAG
GTTTATCAAG CACAGGACAG TACCAACCAT GCAATTGGCG CAAAAATAAC ACTTCTTTCA
GATGAAAAAA CAATTGACAT TCAGTTAAAT CCACAATTGA AGCTTGATAG CAGGGCACTT
GTTTCATTTG AAGTGAGGTT CCGATACGTT GAAGACCTTC TTGGCAACAA GGTATCAGAT
TGGGTATCAG TAACATCCAA AGCTCAGGAT GATACTACAA AACCGGCAGT CAAAAGTGTT
GAGGTTTTGG ATGGGAATAT AATTAAGGTA ACATTTACTA AAAATGTAAA TGCTACTGAT
AAGGTTCAAA GCTTTTCCCT TCTTTCAGGA GATGGCACAC AAATAGTGGA AGCTTATGCA
AAAAGCGTAA AACCGCTTAA GGAGGAAGAT AATTCAACAT TTGCTGTTGA GTTTTCGACA
CTTGCAGCAA TAAATGGTGG AAGATATACT TTGAAGATTT TAGGCATCTG CGATACATCT
GTGAGAGAAA ATGTTATGGA TACTGTTTCA TTTGCGATAG ATGCTAAAGA CACTCTTGCA
CCAACTATCA CAGCTGCAAT TGCGAAGTAT GATTCTTCTT CGGATGTAGA CAAAATAGAC
ATATTTTTCT CAGAACCTAT GGATGTTGAA AAATTAAAAA ATTTGAGCAG TTATTTTGTA
GGGGCATCAA GTGCAACAAT TCCACTTTCG AGCGTAAAAG GAGCAAAAAT TGATTATATT
TCACCAAACG GCGACAGAAT CACCCTTTTA ATTCCAGGGG CAGACGATTC AACGCCTGGC
AGATGGAGTC AATCTGGAGC TGTAGTTGAC AAATTGGCAG CTCCTACACT CACTGACAAG
GCAGGCAATT TTATAGCAAA TGCTACAATA GCAATGCCAC TTTCTGTGTC GGCAAACTTT
AGAGGAATAT CTGCGCAAGA CATTGAGGTT GTTGCAGTGG ACAAAAATAC AATTGAAATA
AGAAGCTTGA ATGGATACAT CTTTGCATCG TTTGACCCTG CAGCGATAAT GTTCAGAAAC
GCATATTCGA CTTCAAGTTT AAATGGAAAT CCTGACAACG ATAAAGTGGT TAGCCTTGGA
ATTGTAAGTT ATACAATTTC TCAAGATAAA AAGACCATTA CTTTAAAAAC ATCAATTTCT
TTGACCTCAA GCGCTATGGC TGATACAAAT GATTCCGGTC AAGATGCTGA ACAGCTGAAA
ATATTTACAG TAAATTCGAA TATAAAAGAC CAGTTTGAAC AGAACCTTGT AATTCTGCCA
ACATTTGATA TTAACTTTTA TCCGTCGATA TTACTCAAAG ACAAAATTTC TCCACAGCAG
ACAGGGGTTT CTGTTGGAAG TGGAAATCAA TCAGACACAA TAGCTATTAC ATTTGATGAG
CCAGTTTTTG CCTTGCCAGG TATAAATAGT ACAGTTCTGG CTGCAGGAAT TGAGCTAAAG
GTTAGTGGTA CTACTTTAAT TCCAGATGTT GATTACACAG CTTACGTCCA AAACGGTATA
GTTTATGTGA AAGTCAAAAA GTCGGGGATT GTGGATAGCA AGGTAAGCTT GGAGATAAAA
AGACCAGATT TAATAGTAGA TAGCAATGGA AATCCTTCTA TTGTCTTGAA AGCTCAAACT
GTTGAGCATG TGACAGAAAG GACTTCCCCT GATGTCACAG CAGAGTTTTC TTCGACTGAT
ACAAGAAAAG TCAAGCTCAC ATTTTCTGAA CCTATGGATG CTTCTACACT AATTGCTCAA
AACTTCTCAT GCGTTGCAGG CGGTAATATC ATAAGTTTTG TAAAATCTTC TGATAACAGA
GTTATTGAGA TCACATTCAC AAACCCACTC CCGGCAGGAA GCATTGTGAA TATATCACCG
AATGTCAAGG ACTTGGCTGG AAATTCGGTG TCAGTTCAGG CTGTGAGAAA ATAG
 
Protein sequence
MNKSWGKKLF ALLSLLSLLL SFLVNTSFSQ NLSYYQQAAQ VLKEKGIMTG DTKGNLNLDK 
PLKRSEISKM IIMLLGKKPL ADFYANQKKS SFKDVKTNYW GLGYIEAAKA IGLISGYTDG
TFKPEQYLKV EELTAIVVRA LGVKESELKG KWPLNYIQKA YSMNIFYGIE SEIDIGKLVT
RGQTAAILYN AFLNESLKAA KPVGLEIIDL QTLKVTFDKE LSSIVKSDFS FDGGLSVLDA
KFADSSKKAV EIKTSLQQEG KEYTLFYKGQ TTTLKFVAKT MPFSFAEDIK IESLKKVDLK
FTKPISKSQQ DNLPIKIYVN GKEVTDVKKF ISSDYKTVSI IFPNKLNQTD KLMVEISNLL
SETGQSLTLT KELTVIDATQ PKVVDFKVVN SKRFKIIFSE PMNIDSTSTY KVCDLSSVGA
NIRIDSNYAY AKLTPKHQEN AIDVELLYPL ADGNHTVEIT EAKDFAGYKA PDFKATFTTV
LEKNPPKLVS LDLVSNNQIR LVFDEEIRSL DGLIPTGEYE VYQAQDSTNH AIGAKITLLS
DEKTIDIQLN PQLKLDSRAL VSFEVRFRYV EDLLGNKVSD WVSVTSKAQD DTTKPAVKSV
EVLDGNIIKV TFTKNVNATD KVQSFSLLSG DGTQIVEAYA KSVKPLKEED NSTFAVEFST
LAAINGGRYT LKILGICDTS VRENVMDTVS FAIDAKDTLA PTITAAIAKY DSSSDVDKID
IFFSEPMDVE KLKNLSSYFV GASSATIPLS SVKGAKIDYI SPNGDRITLL IPGADDSTPG
RWSQSGAVVD KLAAPTLTDK AGNFIANATI AMPLSVSANF RGISAQDIEV VAVDKNTIEI
RSLNGYIFAS FDPAAIMFRN AYSTSSLNGN PDNDKVVSLG IVSYTISQDK KTITLKTSIS
LTSSAMADTN DSGQDAEQLK IFTVNSNIKD QFEQNLVILP TFDINFYPSI LLKDKISPQQ
TGVSVGSGNQ SDTIAITFDE PVFALPGINS TVLAAGIELK VSGTTLIPDV DYTAYVQNGI
VYVKVKKSGI VDSKVSLEIK RPDLIVDSNG NPSIVLKAQT VEHVTERTSP DVTAEFSSTD
TRKVKLTFSE PMDASTLIAQ NFSCVAGGNI ISFVKSSDNR VIEITFTNPL PAGSIVNISP
NVKDLAGNSV SVQAVRK