Gene Athe_2340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2340 
Symbol 
ID7407759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2481236 
End bp2482654 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content38% 
IMG OID643716704 
Productcarboxyl-terminal protease 
Protein accessionYP_002574183 
Protein GI222530301 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000278022 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGTA ATAAGAAAAT CTTGATAAAG ATTTTAGCTG TAATGGTTGC GCTTTCAATA 
TTTGTTGCAG TTCCTGTGTA TTCCCAGTTT TTTATAATCT CAAATGATTT TCCTACAGAC
AAGCAGATGG ATTATATCAA AAAGGTGCTG CAGGTTGCAA AGGTGTATCA TATAGGCAAG
TACAGTTATG ATGAGCTTAT TGATATGATG TTTACAGGGC TTTTCAAGAG CCTTGACAAA
TATTCAGAGT ACATGAAACC ACAGCAGGCT CAGGACTTTA CCCAGAGCGT AAATGGCGAG
TTTTCGGGAA TAGGTATCCA GATAGAAAAA CAGGAGGACT ACATAGTTAT TGTAGGAGTT
TTTGATGGAA CACCTGCAAA AGAAGCGGGT CTGAAAGTTG GTGATAAAAT TATAGCAGCA
GATGGAAAGT CTCTGGTTGG GAAAACAACA GATGATGCTG TTAAGCTCAT TCGCGGGCAG
GAAGGCACAA CTGTTGTGAT TGACATCTTA AGAGATGGTA AGACTTACAG ATTTTCTATC
GTAAGAAAAA AAATAAAGAT ACCTGTTGTT GAGTATAAGG TACTTGATAA TAATATAGGA
TATATAAAAC TTACACAGTT TACACAGGGC TGTTCCAATG ATATCAAAAA AGCTCTTGAT
GAGTTTGATA AAAAAGGTAT CAAAAATATT ATTTTTGATA TTCGAAACAA CCCCGGCGGA
CTTTTGGATG AGGTTGTAAA GATATGTGAA TATTTTGTGC CAGAAGGACC AATTGTAACA
ATTGAATATA ATACTTATAA AGATGAGTAT AAATCAAAAA ACAAAGAAAC AAAGTATAGG
CTTGCAGTTT TGACTAACGA GTCGAGTGCT TCTGCTTCGG AGATTTTTGC CCAAGCTATA
AAAGATAGAA AAGTTGGGGT TGTTATTGGT ACAAAGACAT ATGGCAAAGG AACTGTTCAG
ACTCTAATTG GCCTTCCTGA GACAGGTACC AAGAAAGGAT ATGTTGCCAA AGTTACAGTT
GCAAAGTACA AGTCACCGTC TGGCTATTAT GTTGAAGGAA AAGGTGTTGT GCCAGACATA
GAGGTTCAGG ACGACTCACT CTCCCAGTTT GGACCTGATA AGATTTTGAG CCTGAGCGCA
ACCAAGAAGT TCAAAAAAGG TGATATGGAC TTGGAGGTTT TGGCAGCTCA GCAAAGGCTT
TTCTACCTTG GATATTTAAG CAACTGGACA GCCAAGATGG ATGATAGCAC AGTGGCTGCG
GTTAAAAAGT TCCAGAAAGA CAATAAGCTT TATCCTTCTG GAGTGCTTGA TATAACAACG
CAGAAAAAGC TAAATGAGAA GTTTTTAGAG TTTGTAAAAT CCAAATATGT AGACAAACAG
CTACAGCGAG CAATCCAGTA TTTCAAAACT GGGAAGTAA
 
Protein sequence
MRSNKKILIK ILAVMVALSI FVAVPVYSQF FIISNDFPTD KQMDYIKKVL QVAKVYHIGK 
YSYDELIDMM FTGLFKSLDK YSEYMKPQQA QDFTQSVNGE FSGIGIQIEK QEDYIVIVGV
FDGTPAKEAG LKVGDKIIAA DGKSLVGKTT DDAVKLIRGQ EGTTVVIDIL RDGKTYRFSI
VRKKIKIPVV EYKVLDNNIG YIKLTQFTQG CSNDIKKALD EFDKKGIKNI IFDIRNNPGG
LLDEVVKICE YFVPEGPIVT IEYNTYKDEY KSKNKETKYR LAVLTNESSA SASEIFAQAI
KDRKVGVVIG TKTYGKGTVQ TLIGLPETGT KKGYVAKVTV AKYKSPSGYY VEGKGVVPDI
EVQDDSLSQF GPDKILSLSA TKKFKKGDMD LEVLAAQQRL FYLGYLSNWT AKMDDSTVAA
VKKFQKDNKL YPSGVLDITT QKKLNEKFLE FVKSKYVDKQ LQRAIQYFKT GK