Gene Athe_0934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0934 
Symbol 
ID7407835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1035147 
End bp1037474 
Gene Length2328 bp 
Protein Length775 aa 
Translation table11 
GC content38% 
IMG OID643715303 
ProductATP-dependent protease La 
Protein accessionYP_002572812 
Protein GI222528930 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000076135 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGCAGCA CAGTATCTAC AAGAACAATA CCAGTGATTC CTCTGCGTGG GCTTGTGGTT 
TTTCCATACA TGATGCTTCA CTTTGATGTT GGAAGACAAA TTTCTCTTAA GGCATTAGAA
CAAGCTATGG AAAATGACCA GCTTGTTTTG CTTCTTTCTC AAAAAGACCC AAAACAAGAA
GAACCAACAC CGGATGATAT GTATCAGTTT GGCACAGTTG CAAAGGTAAA GCAGATGTTA
AAACTGCCAA GCGAGACTTC AAGGATACTT GTTGAAGGTC TCTATAGAGC ACGAGTTATA
AGATATTTGT CAACAAACCC ATATTTTTTA GTTGAGGTTG AAGAATATAA AGAAAATGAA
ATTAAATTAG AAGATGATCC TGAATTAGAA GCACTCATAA GAAATGTGGT TGGAGCATTT
GAAGAGTTTG CAAGACTCAC AAACAAAATT CCACCTGATG CTATTTTGTC TGTCACTACA
ATTCAAAGCC CTGACCAGCT TGCAGATGTT ATAGCTGCAA ATGTTGTTGT CAAGCTTGAA
GATAAACAGC TTTTACTTGA AAAGGTTGAC TTGAAAGAAA GACTTGTAAA ACTATATGAA
ATGATACTAA AGGAAAAAGA AATAATTGAG ATTGAAAGAA AAATTGCTAT CAAAGTGAAA
AAACAGATTG ATAAAACCCA AAAAGAGTAT TATCTGAGAG AACAGCTAAA GGCAATCCAG
AGCGAACTTG GGGAAAAAGA CAGCCTTTTT TCTGAGGCAG AGGAATATAG AGAGCAGGTC
AAAAAACTGG GACTGAGCCA GGAAAGCCTT CAAAAGGTGT TCAAAGAAAT AGATAGGCTT
GAGAAATTGC CTCCAAACTC GCCCGAGGTT GGGGTTATAA GAACGTACCT TGACTGGATT
GTTGACCTTC CATGGAATGT GAGAAGCGAT GAAAAGATTG ATATAAATCT GGTCAAAAAA
GTGCTTGATG AAGACCACTA TGGGCTTACA AAAGTAAAAG AAAGGATACT TGAGTATATT
GCTGTAAGGA AACTAAAAAA TGACATGAAA GGGCCTATTT TGTGTTTAGT AGGACCACCT
GGTGTTGGGA AGACATCAAT TGCAAAATCA ATAGCACGTG CACTTAACAG AAATTATGTA
AGAATTTCGC TTGGTGGTCT TCGGGATGAA GCAGAGATAC GAGGACACAG GAAAACCTAT
GTTGGTGCAA TGCCAGGAAG AATTATTTAT GCTCTTCGTC AGGCAAAAAC AAAAAACCCT
CTTATACTTT TGGATGAGAT TGACAAGATG TCGAATGATT TTAGGGGTGA CCCTGCATCT
GCGCTTTTAG AGGTCTTGGA CAGTGAACAG AACTTTGCAT TCCGCGACCA TTATATTGAA
ATTCCTTTTG ATTTGTCTGA AGTAATGTTC ATAGCAACAG CAAACACACT TGAGACAATT
CCAAGACCTT TACTTGACAG GCTTGAAGTG ATTGAGATTA CAGGGTATAC TGAAGAAGAG
AAGCTTGAGA TTGCTAGAAG GTACCTTTTG CCCAAGCAAT TAGAACAGAA TGGGCTTAAA
AAATCTCAAC TGAGATGTGA AGAGAGTGCT ATAAAGGATA TTATAGCATT TTATACACGT
GAATCAGGTG TGAGAAATTT AGAAAGAGAA ATTGCGAGAT TGTGTAGGCG TGTTGCTAAG
GAAATTTTAG AAAAAAACAA AAAGATGGTA AAAATCACAT CAAAGAATCT TGAGAAGTAC
TTAGGTACAC CTAAGTACAG AAGAGATGAG TTAATAGAAG AAAATAGGAT TGGTATTGTG
ACGGGTCTTG CATGGACTCC ATTTGGTGGT GAAACTCTTT TTGTCGAAGC ACTTGTTATG
CCAGGGTCTG GCAAGTTAGA ACTTACAGGT CAGCTTGGCG ATGTTATGAA AGAGTCAGCA
AAGGCTGCAG TGAGTATTAT AAGGTCAAGG GCGAAAGAAC TTGGAATTGA CCAAAACTTT
TACAAAGAGT GTGATATTCA CATTCATGTT CCAGAAGGTG CTATTCCAAA AGATGGACCG
TCTGCTGGAG TGACAATGGC AACTGCAATG GTTTCGGCAC TTTCACAGAG AAGAGTAAGA
TACGATGTTG CCATGACAGG CGAGATTACT TTAAGCGGTA GGGTACTTCC AATTGGCGGG
GTTAAAGAGA AGGTTTTAGC AGCAAAGAGA ATGGGTATTA AAAATGTAAT ACTACCTATT
GGAAATAAAA AGGATGTAGA TGAGCTTGAA GACTATGTAA AAAAAGATAT GAACTTTATA
TTTGTAAAAA CAATTGATGA GGTGTTTGAT GTTGCAATAG TAAAGTAA
 
Protein sequence
MSSTVSTRTI PVIPLRGLVV FPYMMLHFDV GRQISLKALE QAMENDQLVL LLSQKDPKQE 
EPTPDDMYQF GTVAKVKQML KLPSETSRIL VEGLYRARVI RYLSTNPYFL VEVEEYKENE
IKLEDDPELE ALIRNVVGAF EEFARLTNKI PPDAILSVTT IQSPDQLADV IAANVVVKLE
DKQLLLEKVD LKERLVKLYE MILKEKEIIE IERKIAIKVK KQIDKTQKEY YLREQLKAIQ
SELGEKDSLF SEAEEYREQV KKLGLSQESL QKVFKEIDRL EKLPPNSPEV GVIRTYLDWI
VDLPWNVRSD EKIDINLVKK VLDEDHYGLT KVKERILEYI AVRKLKNDMK GPILCLVGPP
GVGKTSIAKS IARALNRNYV RISLGGLRDE AEIRGHRKTY VGAMPGRIIY ALRQAKTKNP
LILLDEIDKM SNDFRGDPAS ALLEVLDSEQ NFAFRDHYIE IPFDLSEVMF IATANTLETI
PRPLLDRLEV IEITGYTEEE KLEIARRYLL PKQLEQNGLK KSQLRCEESA IKDIIAFYTR
ESGVRNLERE IARLCRRVAK EILEKNKKMV KITSKNLEKY LGTPKYRRDE LIEENRIGIV
TGLAWTPFGG ETLFVEALVM PGSGKLELTG QLGDVMKESA KAAVSIIRSR AKELGIDQNF
YKECDIHIHV PEGAIPKDGP SAGVTMATAM VSALSQRRVR YDVAMTGEIT LSGRVLPIGG
VKEKVLAAKR MGIKNVILPI GNKKDVDELE DYVKKDMNFI FVKTIDEVFD VAIVK