Gene Athe_2441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2441 
Symbol 
ID7408065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2583589 
End bp2585796 
Gene Length2208 bp 
Protein Length735 aa 
Translation table11 
GC content42% 
IMG OID643716804 
ProductPeptidase M23 
Protein accessionYP_002574282 
Protein GI222530400 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00201295 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGGCT ACAATCAGGA AAAAGCAAAA CAGGTAGCAA AGGCATATGT CAAACACAAA 
GCAAAGTCGG TTTTAGTGAA GTTCCTAATG AGCAAACCTG GATTAATAAT AGCAGGAATA
CTTCTGCTGA TTTTACTGAT TCTTGGCTCC ATACAGGCAT TTATCGAAGC AGGAGAAATC
TCCAATAGAC TCGACGATGA GCACAATCAA AAACTGCAAA AAGACGTTGT TGACATCGCA
AGGAAAAAGT CAGGCAACTT ACTGGATTAC TATGGCACAG ACGAAAATCT GGCTTTGAAT
GCAGCGTGGG TCATGAGTTA TTACAAATAC CTGCAGTTCA TGAACAAAGC AGACATAACA
GAAGCCAAGG ACTTTGACGA AGTAGCGAAA GAGCTGGCAA AACAGGGCAT TAAAAGCGCT
GTCAATCAGA TTCTGCCACA GTGGTACATC TACCGCAAAG TCAAGGGTGA ATTATCAGAC
CTGGCTGAGC GTATGAAACC AAGATTCCTG TACATCAAAA CAGAACGCAA AACTGTTACT
ACAAAACTTT TGCACTACTC AATGCCAAGA ACATATTCCT ACACAGTTTC TAAAGAAGTC
TACAACCCTG AAACAGGAAA ATATGAAACC CAGACAGAAA ACAAAAGCCA GACTGTTACC
GTGAAGTGGA CTGAAAAAGT TGAAATTACC GAAACACAAC CTATCTGGCT GATAGTTGCT
GCTGACACAA TCAAACAGCG CTATACATTC AGCTACGATA TCAACGAATA CAAGCGCACA
TACCAGAACA ATGTGCCCCA GAAAACACAG AAGACAAAAA GCCAGGACTT AAAGTTCGAT
TTTGACCCAA CAAAATTCAT TGAGAACAAG AACTGCTTAC ATGCATCCGA CCTGGGTATC
AAATCTACAT TTGAACTCAA AACCAATTTT GATAAAACAG ACAAAGACAT GAAACCTTCC
ACCAGCAGTA AAACATTGGA AACATCCTCA GAAACAGAAG AACCCGCATT TTCTCCAGAA
AAAGGAGCAA CAAACATAAG TCACACTGAT GGAGAAAAGA AACTTGAAAA AGAAGAAGTA
CTGCAGGTAA TTGAAACTGT CCCCGAACTC AATGGGCAGA ACTCAATTGG TAAGGAATAT
GAAAGGCTTG AGGAGATGAT AAAAGAAGAC AACCCTGGTG AGAATATTGA ACTTGCCAAA
TCAATGATTA TCAACACCGC AATGAGTTTC ATAAAAGGCA CAAAAGATTT GAACTGGGTA
TTCAGTGACA TAAACGATTA CATGAACGTT GGTGGATATG TAAACTGTTC ATACATTCCC
GCACAATTCC TGCCAATGTT CCAGGAAGCA GAAAAACTGA TGGGTATCCC ATACTGGTTT
TTAGGGGCAG TATCGTTCAG GGAAAGTTCA TTCAATCCCA GCGCAAAAGC AGAGAACTAC
GATGGCGTAG CAATAGGTCT CATGCAGGTA CAACAAAAAC ACTGGAATAG CAGGGTAGAA
GCATTCAAAA ATGCATTCCC GGATGTAACC ATCACAGGCG ACATAACCAA TCCAAGAGAC
CAGATTCTAA TAGGTACATG GACACTGTAC AATAGTTTCA AAGAAATGGG AATAGACCCA
AAAACAGTGG ACTGGCAGGG TGATGGATGG AAAGAACAAG TAATCCCTGC TCTTGCTGGT
TACTGGATGG GTGTTAACGG TGCGAAACAG TGGGACGCAC CTCGAAACTT CGCAAAAACA
AGGAGTGAAT ACGCACCGCC GCTCATCGAC CAGGCACAGT TATATAAGTC TGTAAGTGAA
ATGCTAACGA ACCCGAACGT GGCAAAACCG ATCGCAGGAG AAATTTATAT CACGAGTCCG
TTTGGTATGC GATATCACCC AATTTCCCAC GAATGGAAAA TGCACACCGG TATAGACATT
GCAACAACTT ACGGACAACC CGTTTTTGCT GTTCAGAATG CAGTAGTCAA GTTTGCAGGT
TGGATGAACG GATATGGTAA GACTATTATC TTGCAATCAG GGGAGTATGA GTTTTACTAT
GCACATCTGG CCGAGATAAA TGTTCAGGTA GGGCAGGTGG TAAAGAAAGG CGATGAAATA
GGCAGCGCTG ATTCAACAGG ATATTCTTCT GGTAACCACC TGCATTTTGA AATTAGAATC
AATGGTACTC CAGTGGACCC ACTCACAGTC CTTGGCAACC TTCAGTAG
 
Protein sequence
MFGYNQEKAK QVAKAYVKHK AKSVLVKFLM SKPGLIIAGI LLLILLILGS IQAFIEAGEI 
SNRLDDEHNQ KLQKDVVDIA RKKSGNLLDY YGTDENLALN AAWVMSYYKY LQFMNKADIT
EAKDFDEVAK ELAKQGIKSA VNQILPQWYI YRKVKGELSD LAERMKPRFL YIKTERKTVT
TKLLHYSMPR TYSYTVSKEV YNPETGKYET QTENKSQTVT VKWTEKVEIT ETQPIWLIVA
ADTIKQRYTF SYDINEYKRT YQNNVPQKTQ KTKSQDLKFD FDPTKFIENK NCLHASDLGI
KSTFELKTNF DKTDKDMKPS TSSKTLETSS ETEEPAFSPE KGATNISHTD GEKKLEKEEV
LQVIETVPEL NGQNSIGKEY ERLEEMIKED NPGENIELAK SMIINTAMSF IKGTKDLNWV
FSDINDYMNV GGYVNCSYIP AQFLPMFQEA EKLMGIPYWF LGAVSFRESS FNPSAKAENY
DGVAIGLMQV QQKHWNSRVE AFKNAFPDVT ITGDITNPRD QILIGTWTLY NSFKEMGIDP
KTVDWQGDGW KEQVIPALAG YWMGVNGAKQ WDAPRNFAKT RSEYAPPLID QAQLYKSVSE
MLTNPNVAKP IAGEIYITSP FGMRYHPISH EWKMHTGIDI ATTYGQPVFA VQNAVVKFAG
WMNGYGKTII LQSGEYEFYY AHLAEINVQV GQVVKKGDEI GSADSTGYSS GNHLHFEIRI
NGTPVDPLTV LGNLQ