Gene Athe_0190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0190 
Symbol 
ID7407181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp238338 
End bp240086 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content37% 
IMG OID643714591 
ProductPeptidase M23 
Protein accessionYP_002572114 
Protein GI222528232 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCAAAGA ACCATACAAA ATCTAAAACA TCTGCCACAG CCACCGTTTT GGACCAGAAA 
AAAGCAAAGT TTGTCTCAAA AGATACAAGG GACAATGAAT TCAAACCTGA TTTAAAAGAG
CTTTCAATAA AGAGTGAAAA ACCTATTGTG CTGGCAGATG CAGAAAAAGA AAAGAGAAAA
TCAAAAGATA AAAAAACAAT TGAGAGGGAG AATAATAGTT ATTTACATAG ATTTTGGAGA
ACTATATCTT CCAAAGTAAA ACCAGCGATA ATTAAGAAAA TCCGCGTTTT CAAAAATAAA
CCAGAAAGCG TTGTTTTTTC TCACAAGCAT TTTAAAAAAG ATCCTCTAAA ACGCTTATTT
GAGGAGATAT TAAGGATAAT TATTGGTCTG AGGTTCATTG GCAATATAAG AAATTCAAAG
GAAGGTATGC TAAAGCTAAA GATATCTGCA ACGTTAGCAT TTATGATTTT TATGATTTTT
GCCATACATA AAATTCCAAC AACATACCAG AAGGCTTATG CAGTTTTACT CAATGGAAGC
GTTGTAGGAT ACGTAAAGGA CAAAACAGAG GCTCAAAAAC TTTTCAGTAA TTTGAAAGAA
GAAATTTCAA GCAAGCACAG AACAGATGAA TTTGTTTTTC AGAACAAACC TCAGCTGAAG
GAAATTCCGC CTGGTCAGTA CAAAAGTACA AGTTTAGATA GTCTTAAAAA CACCATAATA
GAAAAAGGGG TTGTGCTTTT AAAAAGGTAT GCGCTTTTTG TGAACACAAA AGCTTACATG
GTATTTGAAA ATCCTCAGAT ACCACATAAA GTTTTGCTTA AACTAAAAAG CGTTTATTAC
AAAAATGAAG CTCAATCTGC AAGATTTTTA GAAAAGGTTG AGATAAAGCC TGTGTATGTA
AAACCAAGTG TGTCAGTTGC AACTGAAGAT CAGGCGCTGA CTAAAATAAT GTTTGGAAAA
GATGAGGTAA TAGAGTATAC TATCAAAGAA GGAGATACGC TGTGGGATTT AGCCAGAAAA
TATGATATCT CTGTGGATGA TATATTTGCC TCGAACCCGG GCCTTACTGA AAAGATAATG
CCTGGGCAGA AGATAAAACT TTCCAAGATG ACACCGCTTA TTAACGTTGT GCTTGAAAAA
GAAGTGGAGT TTGAGGATGT ATTGCCCAAG CAAGTAAAAG TTATAAAATC TGAAAACTAT
TATACAACAC AGACTATTGT AAAGCAGGAA GGGAAAGATG GAAGGGCAAA GATAAAAGCT
AAGATTGTAT ACATGAACGG TCTTGAGTAT GACAGAAAGA TACTTTTCCA GCAAATTTTG
CAAAGACCAG TTGACAGGGT TGTAGTTGTT GGAACAAAAA AACCTCCGAG ATATTTTGCA
ACAGGAAGAT TTTCGTATCC CGTATGGGGG CTACTTACAT CGCGATTTGG ATACAGAGGG
AGAGAATTTC ATGAAGGGAT AGACCTTGCT GTTCCGTGGG GTTCAAACGT GTATGCAGCT
GACGGTGGCG TTGTTGAGTT TGCCGGGTGG TCAGGTGGTT ATGGCAAGCT CATTATTATA
AATCACCAGA ACGGCTACAA AACTTACTAT GGTCATCTTA GCAGGTTTTT GGTAAGCCCA
GGACAGAAGG TTGCAAAGGG GCAGCTTATT GCCAAAAGCG GGTCAACCGG TAGAAGTACA
GGTCCTCATC TTCATTTTGA GGTGAGGAAA AATGGCGTAC CACAAAATCC GCTTGTGTAT
CTGCACTAA
 
Protein sequence
MAKNHTKSKT SATATVLDQK KAKFVSKDTR DNEFKPDLKE LSIKSEKPIV LADAEKEKRK 
SKDKKTIERE NNSYLHRFWR TISSKVKPAI IKKIRVFKNK PESVVFSHKH FKKDPLKRLF
EEILRIIIGL RFIGNIRNSK EGMLKLKISA TLAFMIFMIF AIHKIPTTYQ KAYAVLLNGS
VVGYVKDKTE AQKLFSNLKE EISSKHRTDE FVFQNKPQLK EIPPGQYKST SLDSLKNTII
EKGVVLLKRY ALFVNTKAYM VFENPQIPHK VLLKLKSVYY KNEAQSARFL EKVEIKPVYV
KPSVSVATED QALTKIMFGK DEVIEYTIKE GDTLWDLARK YDISVDDIFA SNPGLTEKIM
PGQKIKLSKM TPLINVVLEK EVEFEDVLPK QVKVIKSENY YTTQTIVKQE GKDGRAKIKA
KIVYMNGLEY DRKILFQQIL QRPVDRVVVV GTKKPPRYFA TGRFSYPVWG LLTSRFGYRG
REFHEGIDLA VPWGSNVYAA DGGVVEFAGW SGGYGKLIII NHQNGYKTYY GHLSRFLVSP
GQKVAKGQLI AKSGSTGRST GPHLHFEVRK NGVPQNPLVY LH