Gene Athe_1519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1519 
Symbol 
ID7409024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1604915 
End bp1607278 
Gene Length2364 bp 
Protein Length787 aa 
Translation table11 
GC content33% 
IMG OID643715888 
Productpeptidase S16, lon-like protein 
Protein accessionYP_002573390 
Protein GI222529508 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID[TIGR00764] lon-related putative ATP-dependent protease 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTTA CACCAGATAA ATTAAAAAAG AATGTAGATT TATCCAATTT TGAATTTAAA 
ACCACAAATG AGATAGAACC TTTAACAACC ATAATTGGGC AAGAAAGAGC CAAAAGAGCA
TTTGAGTTTG GACTGAGCGT TACCACAAAA GGATACAACA TTTACATGTG TGGTCCCACA
GGTACAGGTA AGACAAGCTT TGCCGAAAAT TATTTAAAGG AGATAGCAAA AAACAAACCT
GCTCCGAACG ACTGGGTTTA TGTTTATAAT TTTGCCAATC ATGATTCACC TATTGCAATA
TCTCTGCCAA AAGGAATGGG CAAAGTTTTC AGGAAGGATA TGACAGATTT CTTAGAATTT
GTAATTAATG ATTTGAAGAA GGTTTTTAAC AGTGAGGAAT ACGAAAATGA CAAAAATAAT
ATTTACAATG AATATCAGGA AAAAAGGACA CAGCTTTTAG ATAAGTTAGC TGAAGAAGCA
AGAGAATATG ATTTTGAGAT AAAATATACT CCAAGCGGTG TGTATTTTAT TCCTATTGTA
AATGGTAGGG CGATTTCAGA AGAGGAATAT CCTGAGTTAG AAAAGACTAT CAGAGATGAA
ATTGAAAAAA AGGTCAAGAA GCTTCAACTT GAAACTCAAG AGGTTTTAAA GAAGATAAAG
GTTCTTGAAA AAGAGTTGAA AGAGAGAATA AAAGAACTAC AAAAGAGGAT AGCTGTCTTT
ACAATAAGTC ATTATGTATA TGAAATTAGA AGCAAGTATA AAGATAACAT AAAAATCTTG
GATTATATTG ATAGTGTAAC AGATGACATT ATTGAAAATC TTGATGATTT TTTGGACAAA
GAAGAAGAAG ACACACAAAT TCCTTTGCAA TTTGTTCCAT ATAAAAAATT CTCAAGACTT
GATAAATATA AGGTCAATGT AATTGTTGAC AATTCAGAAT TAGATGGCGC GCCTGTTGTT
TATGAAGTAA ATCCCACTTA TTACAATCTC ATAGGAAAAA TTGAATATGA TAACGAGATG
GGCAACATTC TTGTTACAGA CTACACCAGA ATCAAGGCGG GAGCAATTCA CAAAGCAAAC
GGAGGATATC TCATCCTTCA GGCAAAAGAT TTGCTAAGCT ATCCACAGGC ATGGGAAGCC
TTGAAAAGAG TACTAAAAAC AGGTCAGATA TACATTGAAA ATCTAAAGGA TATTTATGGA
CTTTTTATAA CTCCTTCTTT AAAACCAGAA CCTATACCAG TTGATTTAAA GGTTATTTTG
ATTGGTAGTG AGTATATTTA TAACATCTTA TACACATACG ATGAAGATTT TAAAAAACTT
TTCAAGATTA AAGCTGATTT TGACAGCGAA ATGGATTACA ATCAAGATAA TCTCTATAAG
ATGATTCAGT TTATTAGTTC ATTTTGTAAA AAAGAGAATG CATTGCCATT TTCCAAGGAT
GCCGTTGAGA AGGTAATTGA ATATTCCTGC AGACTTGTTG AGAACCAGGA AAAGCTTTCA
ACACGGTTTA ATGAAATTGT TGAGATATTG GCAGAAGCTA ACACCTGGGC TCAACTGGAA
AGAAGTAATG TGGTGAGGAA GGAACATGTA AGCAAAGCAA TTTGTGAAAA AGAATACAGA
AGCGCAAAAT ATGAAGAAAA AATAAACCAG ATGATTGAGG AAGGTACTAT TTTAGTTGAT
GTAGATGGTT ATAAGGTAGG TCAAATAAAT GCTCTTGCAA TTCTGGATGT TGGTGACTAT
GTTTTTGGCA AACCTTCTCT AATAACTGTT ACCACAAGTA GCGGAAGAAG TGGTATAATA
AATATTGAAC GTGAAGTACA AATGTCTGGT AAAACTCATA GCAAAGGTAT TTTGATTATC
TCAGGGTATA TTTCCCAGCT GTTTGCCCAA GATATGCCAC TTACGTTAAA TGCAACTATA
TGTTTTGAAC AACTATATTC TGGCATTGAA GGTGATTCTG CATCAGCAGC TGAACTTTGT
GCCCTGCTTT CAGCCTTGAG TGATATGCCC ATCTATCAGG GAATAGCAAT AACAGGTTCT
GTAAACCAAA AAGGAGTGAT TCAGCCTGTT GGCGGTGTGA CAAAAAAGAT AGAAGGTTTT
TATTATGTTT GCAAGAAAAA AGGTTTAAAT GGCAAGCAGG GTGTTATAAT CCCACATCAA
AATATAAAAA ATTTAGTATT GTGTGACGAG GTGGTAGAAG AAGTTAGAAA AGAAAATTTT
CACATCTGGG CAGTAAAAAC AATCGATGAA GCTATTGAGA TTTTGACAGG CAAAAAGTTT
GATGAAGTGG TACTTTTAGC AAGACAAAAG TTAAAAAGAT ACTTAGACAA TTTGGTAAGT
TTTAGTGATA AAAAAGATGA GTAA
 
Protein sequence
MKLTPDKLKK NVDLSNFEFK TTNEIEPLTT IIGQERAKRA FEFGLSVTTK GYNIYMCGPT 
GTGKTSFAEN YLKEIAKNKP APNDWVYVYN FANHDSPIAI SLPKGMGKVF RKDMTDFLEF
VINDLKKVFN SEEYENDKNN IYNEYQEKRT QLLDKLAEEA REYDFEIKYT PSGVYFIPIV
NGRAISEEEY PELEKTIRDE IEKKVKKLQL ETQEVLKKIK VLEKELKERI KELQKRIAVF
TISHYVYEIR SKYKDNIKIL DYIDSVTDDI IENLDDFLDK EEEDTQIPLQ FVPYKKFSRL
DKYKVNVIVD NSELDGAPVV YEVNPTYYNL IGKIEYDNEM GNILVTDYTR IKAGAIHKAN
GGYLILQAKD LLSYPQAWEA LKRVLKTGQI YIENLKDIYG LFITPSLKPE PIPVDLKVIL
IGSEYIYNIL YTYDEDFKKL FKIKADFDSE MDYNQDNLYK MIQFISSFCK KENALPFSKD
AVEKVIEYSC RLVENQEKLS TRFNEIVEIL AEANTWAQLE RSNVVRKEHV SKAICEKEYR
SAKYEEKINQ MIEEGTILVD VDGYKVGQIN ALAILDVGDY VFGKPSLITV TTSSGRSGII
NIEREVQMSG KTHSKGILII SGYISQLFAQ DMPLTLNATI CFEQLYSGIE GDSASAAELC
ALLSALSDMP IYQGIAITGS VNQKGVIQPV GGVTKKIEGF YYVCKKKGLN GKQGVIIPHQ
NIKNLVLCDE VVEEVRKENF HIWAVKTIDE AIEILTGKKF DEVVLLARQK LKRYLDNLVS
FSDKKDE