Gene Msed_0440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0440 
Symbol 
ID5105436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp392237 
End bp393847 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content50% 
IMG OID640506346 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001190541 
Protein GI146303225 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTAT CTTACCTTAA CTACACGCCC GTTCAAGACT TCGATGTTAA GGATGGGCGA 
GTCGCATACG TGATCCTGAA GGACTCTCCT AAGGTTGAAA TTCTCGGAGT GGGAGAGGTG
CAGATAGAGG AGCCAGAGAC CGTTCACTGG GTGGGATCTA GGCTGGCTGT GGTCGCCGAT
CAGGGTGGGG CAGAGGTTAG GTCCATTTAC CTGGTGGATG AAGCTCCACA TCCCTTACTT
TCCGACGGAT TTGACAACAT GGAACCTGTC TTCCTCAAGG AGGACAAATT CTACTTCCTT
TCCAATAGGG ATAGGGAAAC GATTCGCCTC TACCTATATG ACGGAGGACA GATCACGAAG
GTAAGCAAGG GTAACCTTCC AGTATCTGAC GTTTGTGTCT CCCCTGGGGG AAGATGGGTA
GCCTACTCCT CAGGAATCTA CGATAATGAC CTTTACCTTC TTGATAGCAA GACAGGGGAA
GAGGTAGTCG TGTCATATCC TAACTCAGAG CAGTATCCAA GTTCGTCTCA ATGCTTCACA
GGGGATTCGC TTCTCTTTCT TAGCAATCAC AACGGCTTCC TTGATGTCGG GAAACTCTCG
TTAAGGGATC ACACAGTTTC TTGGTTAGTC ACGAGCAAGG AGGACAAGTT CGAGGCTCTG
ATGTGGAGGG ACAGGTTGGT GTACACTGTG GACGTGCGAG GTCATATTCT GCTCATGGTG
GACGGAAAGC CCCTAACTGA CCAGGGGGTA GTGACTGACG TGAAGGTCGA TAGAGACCTC
TTCTTCCTTC ACTCGTCATA CGACAGAGCA TACGATCTTT ACAGGCATTC CACCGTGACT
GAAAGGTTAA CGGACTCAAT GAGGGAGGTT AAGGGAGAAT TCGTGAAACC CACTCTCGTA
AAGTACGTCT CCCTAGGAGA GGAGATTGAT GGGCTCCTTT ACCAAAGGGG AGGGGAGAAA
CGTGGAGTAG TCTACATCCA CGGAGGTCCA GATTACGAGT GCCTAAGTAA CTACTCAGCT
GAGATTCAGA TGTTGGTGGA CCAAGGATTC AAGGTCATAT GCCCGAATTA TAGGGGCTCG
ACCGGTAGGG GAAGGAGGTT CAATCACCTC AATGACAGGG ACCTTGGTGG AGGCGACCTA
GTGGATGTGG TGGAGTCAGC TAGCCTCTTG AAGGTTCCCA AGGTTGCGGT GACAGGGGCG
AGTTATGGAG GATACCTAAC CATGATGGCA GTAACCAAGT ACCCTGAGAA ATGGTGTGCT
GCAGCTGCTG TGGTTCCATT CGTTAACTGG TTCACGGAAA AGAAGATGGA AAGAGAGGTA
CTCAGGCAGT ACGACGAGGT AAAGATAGGT AATGACGAGG AACTACTCAG GGATAGGTCC
CCCGTGTATT TCCTGGACAG GGTTAGGGCA CCACTCCTCC TTCTAGCTGG GGAAAACGAT
CCCAGATGTC CTGCTGAGGA GACGCTACAG GTAGTGGAGA AGATGAAGGA GATGGGAAGA
ACCGTGGAGT ACAAGATCTA CGAGAACGAG GGTCATGGAT TCGTTAAGAG GGAAAACCTG
GTGGATTCCA TAATAAGGGT AGTGGAATTT CTAGATAAAA ATTGTAAATA G
 
Protein sequence
MNLSYLNYTP VQDFDVKDGR VAYVILKDSP KVEILGVGEV QIEEPETVHW VGSRLAVVAD 
QGGAEVRSIY LVDEAPHPLL SDGFDNMEPV FLKEDKFYFL SNRDRETIRL YLYDGGQITK
VSKGNLPVSD VCVSPGGRWV AYSSGIYDND LYLLDSKTGE EVVVSYPNSE QYPSSSQCFT
GDSLLFLSNH NGFLDVGKLS LRDHTVSWLV TSKEDKFEAL MWRDRLVYTV DVRGHILLMV
DGKPLTDQGV VTDVKVDRDL FFLHSSYDRA YDLYRHSTVT ERLTDSMREV KGEFVKPTLV
KYVSLGEEID GLLYQRGGEK RGVVYIHGGP DYECLSNYSA EIQMLVDQGF KVICPNYRGS
TGRGRRFNHL NDRDLGGGDL VDVVESASLL KVPKVAVTGA SYGGYLTMMA VTKYPEKWCA
AAAVVPFVNW FTEKKMEREV LRQYDEVKIG NDEELLRDRS PVYFLDRVRA PLLLLAGEND
PRCPAEETLQ VVEKMKEMGR TVEYKIYENE GHGFVKRENL VDSIIRVVEF LDKNCK