Gene Athe_1799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1799 
Symbol 
ID7408586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1871746 
End bp1872885 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content33% 
IMG OID643716176 
ProductPeptidase M23 
Protein accessionYP_002573665 
Protein GI222529783 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.935113 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAAATA AACTAAAAAT TTTCTCTATT TTGCTTGTAT TTATTATTTT TTTTGAAACT 
GCTGTATCAA ATACCTTAAA AGAGTATCAA AGCAAGCTAA GGTCTATTGA AAAGAGCAAG
CAAAAGACAC AGCAGAAGAT AGTAGAAGTT AAAAAACAGC AAAAGCAGGT CTTATTACAA
ATAGATAATC TTGACAAAAA GATTGATAAT GTAGAAGAAA AAATAAGAAA TTTAAAAGCA
AACATTGCAT CTGTTGAAAA CAAGATTTTG CAGACAGAGG CTGAACTTAA TGAAGAAGAA
AAAAGAAAAG AAATGTATTA TGAAAAGTTT AAAGATAGAA TAAGATGTAT TTATGAGCTA
AATACTGTCT CTGTATCGTA TATTGAAATG CTACTTGACT CTCAAAACCT TTCAGATTTT
TTTACAAGAA TGTATCTTTT TAACGATATA ATTGAATATG ATAAGCAAAT ACTAAATGAG
TACACAAAGA GCATTGAGAC TATCAAAAAT AAGAAAGAAG AACTTGTACT GCTAAAGGAA
GACTTAAATA GAGAAAAGAG GGAGCTTGAA AACTACCAAG CTTCTCTTTT GGCGGAGCAA
AATGAAAAGA AAAAACTTTT GTTGGAACTT GAAAAAGAGC AAGATAAATT GGAAAAGATG
CTTGATGAAC TTGAAGAGAT TTCAAATGAG CTTTCCAAGA AAATAAAGGA GATTTTGGCA
AGACAGAAGA CAAAACGCAT ATATAAAGGT GGCAAACTCT TGTGGCCGCT TGAGGGGTAT
TATGGGATAA CATCATATTT TGGTATGAGA TTTCATCCAA TTTTGAAGAA AAACAAAATG
CACACAGGGA TAGACATTGC AGCACCATAT GGAGCAAGTG TTTTAGCAGC AGCAGACGGT
GATGTGATTT TAGCCGGATG GGTGTCTGGT TATGGTAAGA CTATCATTAT AGATAATGGT
AGTGGTATTT CAACTTTATA TGCTCATCTT TCTTCAATCA ACGTAGCTGT GGGTCAAAAA
GTGAAAAGAG GCGAAAGTAT TGGCAATGTT GGTGCAACTG GGTATGCTAC CGGTCCGCAC
CTTCATTTTG AAGTTAGAAT AAACGGCGAT GTTACTGACC CGCTTAATTT CCTAAGATAA
 
Protein sequence
MKNKLKIFSI LLVFIIFFET AVSNTLKEYQ SKLRSIEKSK QKTQQKIVEV KKQQKQVLLQ 
IDNLDKKIDN VEEKIRNLKA NIASVENKIL QTEAELNEEE KRKEMYYEKF KDRIRCIYEL
NTVSVSYIEM LLDSQNLSDF FTRMYLFNDI IEYDKQILNE YTKSIETIKN KKEELVLLKE
DLNREKRELE NYQASLLAEQ NEKKKLLLEL EKEQDKLEKM LDELEEISNE LSKKIKEILA
RQKTKRIYKG GKLLWPLEGY YGITSYFGMR FHPILKKNKM HTGIDIAAPY GASVLAAADG
DVILAGWVSG YGKTIIIDNG SGISTLYAHL SSINVAVGQK VKRGESIGNV GATGYATGPH
LHFEVRINGD VTDPLNFLR