Gene Msed_0189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0189 
Symbol 
ID5103933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp152722 
End bp153777 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content46% 
IMG OID640506094 
Productpeptidase M24 
Protein accessionYP_001190290 
Protein GI146302974 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0275902 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.05907 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTATA AGAACAGGGT ACAAAGGGTA AAGGAGCGTC TAAAGGGAAA GGCGGACTAC 
CTTGTTCTGG GCCCTGGAAG TAACATGTTT TACCTCACGG GTTTCACGGA GGAACCAATG
GAGAGACCGA TCCTCCTGAT ACTCGGAGAA CAGGATTACA TGATAGCCCC AAAGATGTAT
GAACAACAGT TATCAGGTCT CAGCCTAGAA GTAAGAACTT ACGTGGACGG AGAAGATCCC
TATTCTCTTT TACAGATCAA GAAAGGTTCT TCTCTTGCTA TCGACGACCA ACTTTGGTCA
ATGTTTCTAG TTAGTATACT TAATAGGTTC TCCCCATCGG ACCTAATCCT GGTTTCACCA
CTCATAGCTC CAATAAGATC AGTTAAGGAT GAGGAGGAGA TAGGGATAAT GAAGGAAGGG
TTGAAAATTG CAGAGCAATC CTTCATGGAA TTTATTTCGA GGGTTAAGGA GGGGGAGACG
GAATGTCGCT TGTCGCAGAT ATTGGAGGGG ATTTTCAGGG AGAATGGAGT AACGCCATCC
TTCTCTACAA TCCTCACCTC AGGTCCAAAC ACGGCAATGC CACACCTGAG ATGCACTGAG
AGGAAAGTGC GTAAAGGAGA ACCTGTGATT GTGGATTTTG GTATCAAATA CCATGGGTAT
TCCACAGATA CCACAAGGGT CGTTACTATT GGGAAGCCAT CACAGGAGGT GACAAAAATT
TGGGAAATAG TTCACGAGGC TGTGGTAAAG GCTGAGGAGT CCACCTATGG ATTGTCGGGG
ATGAAGATAG ACCAAAGGGC TAGAGGCGTC ATAGAAGGTA GGGGCTACGG TAAATACTTC
ATTCATAGAA CCGGACATGG AATTGGAATA GACGTTCACG AGTTCCCCTA CATCTCTCCC
GACAACGGCG ATGTGATACC TAGGAACTCC GTTTTTACCA TAGAACCTGG GATCTACATA
CCCGAGAAAT TTGGGATTAG AATCGAGGAC ATGGTCATTA TGCGAGACAG GGCGGAGGTT
CTCTCTTCCT TGCCTAAGGA GATCTATCAA GTCTAG
 
Protein sequence
MNYKNRVQRV KERLKGKADY LVLGPGSNMF YLTGFTEEPM ERPILLILGE QDYMIAPKMY 
EQQLSGLSLE VRTYVDGEDP YSLLQIKKGS SLAIDDQLWS MFLVSILNRF SPSDLILVSP
LIAPIRSVKD EEEIGIMKEG LKIAEQSFME FISRVKEGET ECRLSQILEG IFRENGVTPS
FSTILTSGPN TAMPHLRCTE RKVRKGEPVI VDFGIKYHGY STDTTRVVTI GKPSQEVTKI
WEIVHEAVVK AEESTYGLSG MKIDQRARGV IEGRGYGKYF IHRTGHGIGI DVHEFPYISP
DNGDVIPRNS VFTIEPGIYI PEKFGIRIED MVIMRDRAEV LSSLPKEIYQ V