Gene Athe_1809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1809 
Symbol 
ID7408596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1883993 
End bp1885042 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content37% 
IMG OID643716186 
Productmembrane-associated zinc metalloprotease 
Protein accessionYP_002573675 
Protein GI222529793 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0750] Predicted membrane-associated Zn-dependent proteases 1 
TIGRFAM ID[TIGR00054] RIP metalloprotease RseP 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.489906 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTTA TACTGGCTTT AATTGTGCTG ACAATAGTGA TACTTGTTCA TGAATTTGGA 
CATTTTATTA TATGTAAACT TTCAGGTGTA CTTGTTGAAG AATTTGCCAT TGGTTTTGGT
CCCAAACTCT TTAGCATCAA AGGCAAAGAG ACAGAATATT CTGTGAGGAC ATTTTTGATA
GGAGGGTATG TAAAGCCTCT TGGTGAAGAC CAGGATGTTG ACCACCCACG AGCACTCAAT
AACGCAAAGG TTCACAAAAG GATTTTGATG GTTTTGATGG GACCTGTTAT GAACTTTGTT
CTTGCCATAA TAATCATGAT AGGGATTGGT TATTTTATTG GTTTTGGAAC AAACACAATT
GGTAGGGTGG AGCCAAACAT GCCTGCATAT GAGGCTGGGA TTAGAAGTGG TGACAGGATT
GTTGCGCTTG ACAAAAATAG AGTTTATGTC TGGGACCAGG TAAACTTTTA TTTGGCTGTT
CACAATATGC TCTACAAAGA CAGAGAAGTA AAGATAAAAG TTTTAAGAGA TGGCAAACAA
TATACTTTCA GGGTAAAGCC TAAGTATGAC CCGAATACAA AAACAAAGCG AATAGGCGTT
TTGTCAAAGA TATCACGAAA AAATTTATTT GATAGCATCT ATTATGGAAT ATTTGGGACA
TATGCTGAGA TAAAAGAGAC TATATACAGT GTTGTGCTGA TGATAACAGG AAAAGTTTCT
GGTTCTGAGA TTATGGGACC TGTTGGTATG GTGAAAACAA TTGGAGAGGC TGCGAATGCC
GGATTTAAAC AGAGCGTGCT CAGAGGTCTT TTGAATATTC TGTGGCTTAT GCAGCTGATT
TCAGTGAACT TGGGTGTTAT AAATCTCATC CCATTTCCTG CTTTAGATGG CAGCAGGCTT
GTGTTTTATT TGTACGAAGC TGTGGCAAGA AAACCTTTTA ATAGAGAAAA AGAAGCACTG
ATTCACACAA TTGGTTTTGT ACTTCTTCTG TTTTTGCTGG TGATAGTTAC CTTCAATGAC
ATAAAAAACA TAATAATGCC TGGAAGGTGA
 
Protein sequence
MNLILALIVL TIVILVHEFG HFIICKLSGV LVEEFAIGFG PKLFSIKGKE TEYSVRTFLI 
GGYVKPLGED QDVDHPRALN NAKVHKRILM VLMGPVMNFV LAIIIMIGIG YFIGFGTNTI
GRVEPNMPAY EAGIRSGDRI VALDKNRVYV WDQVNFYLAV HNMLYKDREV KIKVLRDGKQ
YTFRVKPKYD PNTKTKRIGV LSKISRKNLF DSIYYGIFGT YAEIKETIYS VVLMITGKVS
GSEIMGPVGM VKTIGEAANA GFKQSVLRGL LNILWLMQLI SVNLGVINLI PFPALDGSRL
VFYLYEAVAR KPFNREKEAL IHTIGFVLLL FLLVIVTFND IKNIIMPGR