Gene Athe_2342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2342 
Symbol 
ID7407761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2484219 
End bp2487251 
Gene Length3033 bp 
Protein Length1010 aa 
Translation table11 
GC content38% 
IMG OID643716706 
ProductS-layer domain protein 
Protein accessionYP_002574185 
Protein GI222530303 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAAA ATAAGATGAA AATAATAACC TTATATAGAT TAATAAGCTG GGTAGTTTTG 
ATTTTGTTTT TTTTGATGTT AATTTCCGCA AATATCCAAG CTGCTCAGGA GTATGTAGCA
AGCAGCAGCT ACAGCAAGAC GGCAAAAAAT ACTCAAATTG TGAACTTGCC TCAAAATGCC
TTGTCGAGAG AAGCGTTTTT GTTTTTGTCA GCTTTTGGCT TTTTCAATAC AGTTGAAAAA
GCAAAGATAA ATCCTTCTGA TATTCTGTCA AAGGAAGAGG CGCTGTCTGT AATTTTAAAC
AGCGTTGGCA AACAGCAGGA TGCTTTTGTG AGGGCTGAAA AGCTGGAACT AAAAAGACCA
TCAGGTCAGA AGCTTGTAAA GCCATACAAC TATCTTTATC TTGGATATAT TCAGCTTGCA
TATGATATGA AAATTTTGTC CAAAAAAGAG TATCAGGATG CGATATCTCA AGTGCAGCCA
AGTGAAAAAG AACATGAGAA GATGACCCAG CAGCTGATAA AGAAAAATGA TGATACCATT
GCAAAAGCTG TGTATGAAGG AAGACCATAT TCGTATGACG ATTTGATATT TGTAAGGTCT
GCTCCTGCTA CCCGTCAGGA AGTTTGCTTA TGGGTGGTAA AGGCATTCAA AATACCTTTT
ATTTATGAGA ATTTGGCTAA AACTTACCCT GATTATGACA GGATTGACAG TAAGTTTTTA
AGTTCAATTA ATACTCTTTT GAAAAATGGT GCTCTTGTTG GAAGATCTGA TGGATATCTT
CATCCAGACG ACTACATAAC ATATGAAGAG CTGGCATTTA TCCTCGGGAG TCTCAAACCA
AATATCCTGA GCGCAAATGG GTTAAAAGAG GTAAAGCTTG AGGTAAAAGA TATCCAGAAG
TTCAGCAGTG GCAAAACCAT TTTGGTGTGT GAGGATGAAA GTGGCAATAG CATAAATATT
ACAGTCAACC CTGGCAAACA GGATTTTGGA GTAATAGCAA ATGGTAATTT TTTGAGCTCT
TCCTATCTTC AGAAAGGGGA CTATGTAGCC TTTTATGTAA ATGGCAAAAA CGAGGTTGTG
CTGGCCAGCA TTTTGCAAAG GCCTGGTCAG GAAAATATAA AGGGTGTAAT CTCCAGAATT
GATGCTAAAA AGATGATATT TTCGGTAAAG CTTCCAGATG GCAAAGTTTA CAATTTGAGT
TTAAGTCCAA AAGCTACAAT TTATGATTCA AACTCTGGAA AGAATTTAAA CTTTTCAGAA
TTGAATGTAG GAAATCTTGT ACAGGTTAAA GTCCAAAAAG ACAGAGCAGA TGCCATAACA
TTGCTTTCGC TTGACAGTCC TGAAATTGAA AGAGTACAGG GGATAATCTC AAGAATAACA
AAGGACAGAA TTGTACTCAG TCAAAATGGA CAGCTAACAG AGTATCTTCT TTCACCAGAT
ACAGTCTACA TTGATAAAGG TGATTTTTCA AGAGTTCTTA TAAAGAATGA TTTTTATGAG
GGTATGAAAG TTTTGGCAGG TACGGCTTGC GGGTATGTAC AGTATCTCAG CACCACATAT
GATGAGAAAT CTGAAGATAT AGTTTCCGGT ATTTTGCACG AGGTAGATTC AAACTTGGGG
TATCTTGAGA TTTACAACCA ACAAGGTGAA AAAAAGAGTT ACAGATTTTC AAAAAAGCTT
GGGCTAAAAG TGAAAAAGGA TGGTCAAAAT GCTTCTTTAG ACAGCCTTTT ACCAGGCGAT
GTTGTTTTCC TCTATTTTAG CGGCGATTTT GTGCGCACAG TCACAGCAAG TTCAAACCTG
CAGCAGAAGG TTGCAAAGAT TGAAAATGTT GTAAGAAGCC TTGCAAGCGG TCTTCCACAA
AAAATAATTG TCAATATCGA TGGGAGAATA TATGGACCAT ATGAAATAAA TGACAACGTT
GATATTGTAA AAAATGGCAT TTCTGCAGCT TTAAAGGATA TTATGCCAGG TCAGTATGTG
AAGCTCACAG GGAGCTTCTT TGGAAACTCA GCGTACATCC GCAGAATAGA GATATCAGGG
AATGAATATG TAAAAAATAT CTACATCGCA AAAGGAACGG TCAATGGAAA TGCTTTATAT
CTTTCTGACA TCCAGATTTT AAGAAACAAC GCATTTGAAC CGCTGTATAC ATGGCTTTCT
TTCCAGATTC CATCTGATTT GAAATTCATT TTAGATGGCA GCTCTCTTGC ACCACTTTCT
AAGTTGCAAA ACTTACCTGT TGTAGTTGTA ACAAAGGAAA GATTTTCACA GGAGGTTTTG
GACACCATTG TGGCAATATC AAGAGGTTCT TTTACCAAGA TACAGGGCGA GGTGAGCATG
ACATCTTCAA ATTCGGTGGT GATATCTGGA AATAGGTATT CAATAGGAAA CAAAACGTAC
ACCGTTGCAA ACGGGCTTTT GATCCCTGCA AGTTTCAAAG TTGGGGATGA AATAATTGGT
ATTGCAAGCC AAGGAAGCCT TGTTTTAGCA AAGCAACAGG AAGGTATTTC AAAACCCATA
TTTGTTCGAG GGCAGGTACA AGACATTTCA GAGCTTGAAT ACATCACTGT GAAAGACTAT
GTCTATTTAG ATGGTCAGAG AGGATGGCAA TATGTTCCGA GCAAGCTAAC TCTTTTTTAT
GATACACAGA CAGTTATGTG TGATGTATAT GGACTTGGCA GTCCGAAAGA GATATTGAAT
CTGAAAAACA AGAGTGTGTA TATCATTCAT AATGGCAAAT ACGCAAATGT TATAATTGAT
ACAAGCTTTG GTGGATACAT TGTCACAGGC GTGGTTGGCA AGGACATGAA GATTCTAAAT
GCCCAGTACA ACGATATGAT GACTCAGACA TGGAACAGAA TTGATAAAAG TTTTGTCCTT
GATACCACCC AAGCAGTTTT GATAGACGCA GGAGGGAACT TAACAGCTCA GATGCCACAA
TTTGGCGACA GGGTTTTGTT ACTTGTTCCT CAGAGCAGTT TTGACATCTC AAAATCGGTA
TTAACGCCAT CGATTGTGCT TGTAAATTAC TGA
 
Protein sequence
MIKNKMKIIT LYRLISWVVL ILFFLMLISA NIQAAQEYVA SSSYSKTAKN TQIVNLPQNA 
LSREAFLFLS AFGFFNTVEK AKINPSDILS KEEALSVILN SVGKQQDAFV RAEKLELKRP
SGQKLVKPYN YLYLGYIQLA YDMKILSKKE YQDAISQVQP SEKEHEKMTQ QLIKKNDDTI
AKAVYEGRPY SYDDLIFVRS APATRQEVCL WVVKAFKIPF IYENLAKTYP DYDRIDSKFL
SSINTLLKNG ALVGRSDGYL HPDDYITYEE LAFILGSLKP NILSANGLKE VKLEVKDIQK
FSSGKTILVC EDESGNSINI TVNPGKQDFG VIANGNFLSS SYLQKGDYVA FYVNGKNEVV
LASILQRPGQ ENIKGVISRI DAKKMIFSVK LPDGKVYNLS LSPKATIYDS NSGKNLNFSE
LNVGNLVQVK VQKDRADAIT LLSLDSPEIE RVQGIISRIT KDRIVLSQNG QLTEYLLSPD
TVYIDKGDFS RVLIKNDFYE GMKVLAGTAC GYVQYLSTTY DEKSEDIVSG ILHEVDSNLG
YLEIYNQQGE KKSYRFSKKL GLKVKKDGQN ASLDSLLPGD VVFLYFSGDF VRTVTASSNL
QQKVAKIENV VRSLASGLPQ KIIVNIDGRI YGPYEINDNV DIVKNGISAA LKDIMPGQYV
KLTGSFFGNS AYIRRIEISG NEYVKNIYIA KGTVNGNALY LSDIQILRNN AFEPLYTWLS
FQIPSDLKFI LDGSSLAPLS KLQNLPVVVV TKERFSQEVL DTIVAISRGS FTKIQGEVSM
TSSNSVVISG NRYSIGNKTY TVANGLLIPA SFKVGDEIIG IASQGSLVLA KQQEGISKPI
FVRGQVQDIS ELEYITVKDY VYLDGQRGWQ YVPSKLTLFY DTQTVMCDVY GLGSPKEILN
LKNKSVYIIH NGKYANVIID TSFGGYIVTG VVGKDMKILN AQYNDMMTQT WNRIDKSFVL
DTTQAVLIDA GGNLTAQMPQ FGDRVLLLVP QSSFDISKSV LTPSIVLVNY