Gene Athe_2295 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2295 
Symbol 
ID7407714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2426577 
End bp2429801 
Gene Length3225 bp 
Protein Length1074 aa 
Translation table11 
GC content38% 
IMG OID643716659 
ProductS-layer domain protein 
Protein accessionYP_002574138 
Protein GI222530256 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAA AGTTTTTAGC AGTCATTTTG TTACTTTGCT TTGTGGTTGT AAACTTTGGT 
TTGTCTTCTG CTTTTGCTGC ATACAAAGAC ATTCCATCAA ATGCAAGTTA CAAGCAGGCA
GTAGAAAAGT TAAATAAGCT TGGGATTCTT GTTTACAAGG ACTATTTCAA GCCAAATGCT
GCTGTCTTAC GCGGTGAGTT TGCAGCTGCG ATTGTAAAGA TTTCAAACGT AGAGGATGAG
GTGAATCTGC TAAAAGGATA TTCTCAGTAT CCAGATATAA AGCCAAACAC CACACTTTGT
GGATATGTCA ACTGGGCGGT AAAGAAAAAA TACATGACAC CAATGGCAGA CAATAAGTTC
CATCCAAATG ACCCGCTTAC CTTTGCCCAG GCAACAACTG CTATTGTGAG GATGCTTGGG
TATTCCGACT CAGATCTTTC TGGCATCTGG CCGCAGAATT ATATCGACAA AGCATCAGAG
CTTGGGCTTA TAAAAGGAAT AAACCTTTCT GCGTCGCAAA AGGTTCCGCG CTGGGCTGCG
GCACTGATGC TATCAAGGCT TCTTGATACT TATGTCAAAA GCGGTGGAAA TCAAGCTCAA
TCTGGCCAGT CAGCTTTAAG TGGAGTATCA GCTTCATCCC AAAGCAACGG CACAAAATTT
TCTGAGTATG TTGGGCTTTA CAAATCGTAT GTTGTGCTTG ATACCGGCAA AACTTCTTCA
AAGCTTCTTC CAAATGAGGT TTTGACAGAC AGCGGAGTGC TTGTGAATGC AACAAAGACG
CAACTTGAAG TTGGGAAAAA GTACATGCTT CAGGTTGATA CCAATAAAAT CACGAAAGTG
TTTGGCACTG AAGCTGATTC TTTCCAGATT GTCAGCACAA AGGTAAGCAG CAGGACTGTA
TATTACAAGG AAAGTGGAAA GACAAAATCA ATAACTTTGC CATCTTCGGC AACATACTAT
TACAACGGCT CAAAGCAGAG CTATGATGCA ATAGAAAATG TACTTAAACC GAACCAGAAA
ATAAGCTTTA TCTATTCTGA AGATAGGAGC AAAGTGGACT ATATTGTAAT TAAAGACATA
TATGCACAAG AGGTTTATGG AAACTACGAT GAGGTGCTGA TTTTGGCAAC TCCTAAAACA
TCATCGTCGT TAGAGGCAAA CCAGGTTCAG ACAGACAAGG GAATATACTT TGTTGCATCT
TCAATAAAAC CTGAAAACCT TGAAATTGGA GCAAAGTATG GAGTGTATAT AAAAGATGAT
ACAATCACTG CAGCTTTGCA AAAAGTGTGG GTATCAGAAA AGTTTACAAT TACAAATATA
GATGATTACA CACTTGATGC TGCTCAAAAC GGCAAAACAC AAAGGATTCA GCTGACAAGC
AAACCTTTAT ATTACTATCA AGGAACAAAA CAGAGCTATG AAAATTTACC AAACATTTTA
AAAGAAGACC AGATACTCTA TGTATCAAAA GACCCTGACA CAGGCAAGGT TATGGCATAT
GTTATTCAAG ACCCATACGG CACCCAGTAT GGAAACTATA TTGAGGCAAT AATCCTGCAG
GATGCACTTT TAAACCCTGC TTTAGAAAAC AATCAGGTTT TAACAGACAA AGGTATATTT
TATTTACCTA ATATAAACAC AAAACTGGAG ATTGGCTCAA AGTATGGTGT TTATGTTAAG
GATGACAAGA TAACATTAGT TGTAAAAAAA TTAAATACTG TGAATTTGTA TGAGATTACA
GATGTTGTTA GTGATACAAA TGTAAAGTTA AAATCATCAA AAGGCCAGGA GAACATAATC
CTGCCACAAA AACCTGTTTA CTACTATAAC GGCAATAAAA TTAGCTACAG TGATTTGAAA
AACGCGTTAA AATCAGGTCA AAAAATCTAT TTTGGATACT CAAAGGATGG GAAAACTTGC
GAGTATATTG TTCTTCAGGA CCCATATTCA TCTGAGTACG GTGCATATAC TGAGGTTATC
GTTCTGGCAG ATGCGGTTGT ATCTGATAAG TTATCTACAA ACGAGGTTTT AACAGACAAA
GGGATATACG CGGTAAAGTC CACAGCAGGC AAGCTTACAG TTGGTGCAAA ATATGGGGTA
TATATCAAAG ATGATACAAT CACAAAGGTT GTAAAGAAGC TCAACAGCGT CGATACAGCA
GAGATTACAG AGGTTATAAG CGATACAAAT GTTGTTCTCA AAAAAGGCAG CACAAGCAGC
TCTACTTTCC TGCCACAAAA ACCTGTTTAT TATTACAATG GAAGTAAGGT TGACTACAAT
AGTTTGAAAA ACATAATAAA ATCAGGGCAA AAGATTTATT TTGGATACAA CGCAGCAGGA
AATTCCTATG AGTATGCAAT AATTCAAGAC CCATACTACG ACAGCTATGG AAAATATGTA
GAGACAGTGA TTTTGGGGAC ATATTCAACT ACAAAAGGGC TTGATGTGAA TGAAATTTTG
ACAGACCAGG GAATTTTAAC ATTGCCAGAA AACCAAAATG TGAATTTAGA ACTTGGTGCA
AAATATGGGT TTTACATCGA CCAAGACAAT CAGATAACCC TTGTGTACAA AAAACTAAAT
TCAACCGAAG GTGTGACAGT ACTTTCTGCC ATTTCAAACA AAGTGACAGT TGACAAGGGT
GGTAGTCAAC TTGATATGAT ATTGCCTCAA AATATAACAT ATTACTATAA TGGTTCAAAG
ATTGATTTTT CAACAGCACT TAGCAGACTT CAGATGTCAA CATCGCTTGT GTTTGGACTG
TCAAGTAAGA AAAAAGGATA TGACTACTGT GTAATATTTG ACCCTGTATA CAGCAAGCCA
TACATTGCAA ATGAGCAGAC ATACTTAACG CTAAAAGCAG GTGATCTGGA TATAAGTGGT
AGTAGCAAAG TTATAAAAGA TGGGGATGTT GTGGACTACA GCTATATTCA GAAAAACGAT
GTTGTATATG CTGTGACAGA TATCTGGGGC GGCAACAAGT TCATACTTGT TGTAGATAGC
AAAGTTGAGG GTTACGTCAA GAGCTACCAA CCAACAAGGT TTACACCAAA GTCTATTGTT
GTGAACGTAT ATGACCAGGT ATCAGGAAAG CTTGTAGACA AGACATATGA GGTGAGCGAA
GACTTTGACC CATCTGTGCT TTTGGCAGAT ACATTTAAAG TTGGTCAGAG AGTGTACCTC
ATCTTAGGAT ACGATGGCAA GGTTGTGAGC ATTGTAAATC CGTAA
 
Protein sequence
MRKKFLAVIL LLCFVVVNFG LSSAFAAYKD IPSNASYKQA VEKLNKLGIL VYKDYFKPNA 
AVLRGEFAAA IVKISNVEDE VNLLKGYSQY PDIKPNTTLC GYVNWAVKKK YMTPMADNKF
HPNDPLTFAQ ATTAIVRMLG YSDSDLSGIW PQNYIDKASE LGLIKGINLS ASQKVPRWAA
ALMLSRLLDT YVKSGGNQAQ SGQSALSGVS ASSQSNGTKF SEYVGLYKSY VVLDTGKTSS
KLLPNEVLTD SGVLVNATKT QLEVGKKYML QVDTNKITKV FGTEADSFQI VSTKVSSRTV
YYKESGKTKS ITLPSSATYY YNGSKQSYDA IENVLKPNQK ISFIYSEDRS KVDYIVIKDI
YAQEVYGNYD EVLILATPKT SSSLEANQVQ TDKGIYFVAS SIKPENLEIG AKYGVYIKDD
TITAALQKVW VSEKFTITNI DDYTLDAAQN GKTQRIQLTS KPLYYYQGTK QSYENLPNIL
KEDQILYVSK DPDTGKVMAY VIQDPYGTQY GNYIEAIILQ DALLNPALEN NQVLTDKGIF
YLPNINTKLE IGSKYGVYVK DDKITLVVKK LNTVNLYEIT DVVSDTNVKL KSSKGQENII
LPQKPVYYYN GNKISYSDLK NALKSGQKIY FGYSKDGKTC EYIVLQDPYS SEYGAYTEVI
VLADAVVSDK LSTNEVLTDK GIYAVKSTAG KLTVGAKYGV YIKDDTITKV VKKLNSVDTA
EITEVISDTN VVLKKGSTSS STFLPQKPVY YYNGSKVDYN SLKNIIKSGQ KIYFGYNAAG
NSYEYAIIQD PYYDSYGKYV ETVILGTYST TKGLDVNEIL TDQGILTLPE NQNVNLELGA
KYGFYIDQDN QITLVYKKLN STEGVTVLSA ISNKVTVDKG GSQLDMILPQ NITYYYNGSK
IDFSTALSRL QMSTSLVFGL SSKKKGYDYC VIFDPVYSKP YIANEQTYLT LKAGDLDISG
SSKVIKDGDV VDYSYIQKND VVYAVTDIWG GNKFILVVDS KVEGYVKSYQ PTRFTPKSIV
VNVYDQVSGK LVDKTYEVSE DFDPSVLLAD TFKVGQRVYL ILGYDGKVVS IVNP