Gene Msed_0395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0395 
Symbol 
ID5103638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp343473 
End bp345224 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content51% 
IMG OID640506301 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001190496 
Protein GI146303180 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCCCA AGGAAGCGTA TTCCTTGAAA GTCATATCTG AAGTCAAACT TGAGTCCCAG 
GGACTCATAC ACGGTGAGAC TTGGATAAAG GACAATGCCT ACTTCACGAC CATCTTCCTG
AACAATAGAC CAATCCTAGA GGGCAAGGTC TCCCTCCCTA GGTTCCTGGG AGAAGATCTA
TATTACGTTA GGAATGATGG TTCAGCAACA CTGCTGGTTC AGTCACCCTA TGGGGAACCC
AGAAAGCTCG CGGAGCTGGG TAAGATACTC AAGTTCGAAA AGCACGAGAA AGGCCTACTC
ATCCTGGGAG AAGATCTTCT GGATAAGCAA GCTCCCTCAG CACCCTTTAT CACGGAAAAG
AGGAAGTACA GGTTTGACGG AAGGGGGCTC CTCAGGACGA GGACCTCCCT CTACCTAGTG
AAGGGCAACG ACGTGGTCAA GGTCCTGGGA GGAGATTTTG ACGTCACTGA CTTCTCCACT
AACGGTAAGA GGGTAGTTGT ATCTACTACC CAACCAAATG ACGACCTAGG CTTGAATGCC
CTTTACGAGC TTGATCTTGA GACCGGAGAG ACCAGGAGGA TAACCAAGGA GGACGGTATG
ATAGTCGCAG TTGCCATGAA CTCTGACGGG GACGTTGCGT ACCTAGGACA TGATAAGGGG
AAGTCTCCGT GGGCAGTGAG GGAGGTGATC TTCCCTGAAA GAGGGGAGAG ATACCTGTGC
GGAAACACGT GCGGTTCCAC GGTCCTCACA GACGTCTTTG ATGGGGCTAA GGAAAGGCTA
GTTTTCCTGA AGAACCAGGT CATCACCTTG GGCCAGATGG GAGGAGAGGT AAACCTGTAC
CGGATAAGCG ACAGGAAGGT TGACAAGTTG ACTGAAGGGA AACAGGTGGT GAGGTTATTC
GACTACGACG GGAACTCCCT GGTTTACTCC TTCATGACAC CTGAAAAGCC CTCCCTTCTG
TTCCGTGGAG AGGTATACGA TCCGGACCCA AATGTGAAAG GGCTCATGCC CGTGAGGGTT
AGCTCCAAGA TTGAGGGGTG GGGCATCATC ACGGGAGATA AGCCCACAAT CCTCTTCATT
CACGGTGGGC CACATATGGC CTACGGTTAC GGTTACTTCA TCGAGTTTCA ATTCTTCGCC
TCAAACGGTT TCAACGTGAT TTACGCTAAC CCAACAGGAA GCCAGGGTTA TGGAGAGGAG
TTCGCCAAGG GATGCGTTGG GGACTGGGGA GGAAGGGACA TGGCAGAACT ACTGGAGTTT
GTGGAGGACG CTAGGAGGCA GTTTAACCTG ACTAAGAGGA TGGGAGTCAC GGGAGGGTCC
TATGGAGGTT TCATGACAAA CTGGATCATT ACTCACTCTG AGATCTTTTC AGCTGCAGTG
AGTGAGAGGG GTATCTCGAA CCTAGTTAGC ATGTGCGGTA CGAGCGACAT AGGCTTCTGG
TTCAATGCCG TGGAGTCAGG GGTCGATGAT CCTTGGAATC CAGAAAACAT GGAGAAGTTA
ATGAGAATGT CCCCAATATA CTACGTTGGG AAAGTAAGTA CTTCCACCAT GTTCATTCAT
GGGGAAGAGG ATTACAGGTG CCCCATAGAA CAGGCGGAGC AGTTTCACGT GGCCCTTAGA
TCTAGGGGAG TCGAGAGCAA GCTGGTGAGA TATCAGGGAG ACGGGCATGA ACACGCAAGG
AGAGGGAGAC CAGACAACAT GATGCACAGG TTAACAATAA AGTTACAGTG GTTCAAGGAC
CACCTCACGT AA
 
Protein sequence
MDPKEAYSLK VISEVKLESQ GLIHGETWIK DNAYFTTIFL NNRPILEGKV SLPRFLGEDL 
YYVRNDGSAT LLVQSPYGEP RKLAELGKIL KFEKHEKGLL ILGEDLLDKQ APSAPFITEK
RKYRFDGRGL LRTRTSLYLV KGNDVVKVLG GDFDVTDFST NGKRVVVSTT QPNDDLGLNA
LYELDLETGE TRRITKEDGM IVAVAMNSDG DVAYLGHDKG KSPWAVREVI FPERGERYLC
GNTCGSTVLT DVFDGAKERL VFLKNQVITL GQMGGEVNLY RISDRKVDKL TEGKQVVRLF
DYDGNSLVYS FMTPEKPSLL FRGEVYDPDP NVKGLMPVRV SSKIEGWGII TGDKPTILFI
HGGPHMAYGY GYFIEFQFFA SNGFNVIYAN PTGSQGYGEE FAKGCVGDWG GRDMAELLEF
VEDARRQFNL TKRMGVTGGS YGGFMTNWII THSEIFSAAV SERGISNLVS MCGTSDIGFW
FNAVESGVDD PWNPENMEKL MRMSPIYYVG KVSTSTMFIH GEEDYRCPIE QAEQFHVALR
SRGVESKLVR YQGDGHEHAR RGRPDNMMHR LTIKLQWFKD HLT