Gene Athe_1795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1795 
Symbol 
ID7408582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1867865 
End bp1868848 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content35% 
IMG OID643716172 
ProductPeptidase M23 
Protein accessionYP_002573661 
Protein GI222529779 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATCA ATTTCTATAT ATGTATAAAC AAAGCAATGG ATGGCAAAAA TACTTTTATA 
GAAGGAGGTG CGAAGGTGAA CAAGAAAAAC TGGAAAGAAA AACTTTTAGA CTTTTTTGAC
ACTAAAGGTT TTTATATTAT CGTAGCTGTA TGTCTGCTGG TAATAGGATT TTCAGTTTAT
ACCATTGCCA CCACAGACTT TACAAAATAT GAAGTTGAAG AAAGTCAAGG TAGTAACCAA
TCTTTAGAAG GACAAACCAA ACCTGCTGAA ATACCAGTGC CACAAATTAC CACAGAAGAG
GTAACCAAAC AGGATAATTT GAAAAAGAAT GCCTCTAAAT CTTCCACTAT AAGCTCACAA
GAGAAAAATG TAGCCCAAAA TAGTGGTAAA TTAAATGTCA ATGAGCACTC TACGAAATCC
AGCAGTAATT CAAAAAGTGG TAGTTTAAAA AATAAAAGAT CTTCTGCTTT ACACCAAAAG
CACGAGAGCC AAAATCAAAG TGCAAACAAA AATATTCAGA TTGGAACTGA TACCGGCCAG
GATGATGTTG AGGTTATAAA CCCTGTTGAC TTCAAGCCTA TCTTTCCTAC CATTGGGAAG
GTAATAAGAG AGTTTTCTGA CCAGTCGCTT GTATACTCAA AAACTCTTGA TGAGTGGACA
GAACACCCTG GAATTGACAT AGAAGCTCAG GAAGGTAGCG CTGTAAAAGC TTGTTTTGAT
GGTACAGTTA TTGATTTAGG AGAAGACCCT CTTTACGGGA AATATGTTGT AATAGACCAT
GGAGATGGAT ATATCTCAAA GTACTACAAT CTCAAAGACT TAAAGGATAT TCAAATAGGA
GACATTGTAA GGCAGGGAGA GAAAATAGGA GAGGTTGGAA CAAGTTCTAA CATAGAGTAT
ATGGATCCGC CGCATCTTCA TTTTGAGATA ATTTACAATG GAGAAAATCA AAATCCTCTG
AAATTTTTAC CCAAAACAAA TTAA
 
Protein sequence
MDINFYICIN KAMDGKNTFI EGGAKVNKKN WKEKLLDFFD TKGFYIIVAV CLLVIGFSVY 
TIATTDFTKY EVEESQGSNQ SLEGQTKPAE IPVPQITTEE VTKQDNLKKN ASKSSTISSQ
EKNVAQNSGK LNVNEHSTKS SSNSKSGSLK NKRSSALHQK HESQNQSANK NIQIGTDTGQ
DDVEVINPVD FKPIFPTIGK VIREFSDQSL VYSKTLDEWT EHPGIDIEAQ EGSAVKACFD
GTVIDLGEDP LYGKYVVIDH GDGYISKYYN LKDLKDIQIG DIVRQGEKIG EVGTSSNIEY
MDPPHLHFEI IYNGENQNPL KFLPKTN