Gene Msed_1300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1300 
Symbol 
ID5104551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1278069 
End bp1279781 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content50% 
IMG OID640507189 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001191382 
Protein GI146304066 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCCCT TTGAATACAT TGAAAACCTA GAAGATCCTA GAACTAAGGC ATTCATAGAG 
GAGGAAACGA GGAACTCCTC TTTCTTTCAG GAGAGGGCAA AACTTCACTA TCAGCCCATT
CTCGAGAGAC TCACCGAGGA AAGGCCCATC ACGTTGGTGG GCACGGAAAA GGGAGTGGCA
ATTTTAGTTA GGTCCAAGAG TGGAGTCCAC GCTGAGGTCA ACGGGAACAT CATCAGGAGT
GAGAGGGAAC AAGACATCTT CAATTCCCTG GAGAGGGTAT GGAACTCAGA CCTGGTGAGA
ATAGGGGTAG GGATAGGAGG ATCTGATCAG GGTTACTCGA TCCTAGTGAA TGAGCAGGGT
AAGGTAGTGA GAAGGGTTGA GGGGCTCGTT AACCAGTTTT TCTTCTTAAG GGGCAAGCTG
TGTTACGTTA GGGAGTATAG GACAGAGAGC TCACCTGATG GAGTTCCTCC TGCAGTGGAA
AGGTTGTTCT GTGGGGAGGA GATGCTCCCC TTTTACCCTG GAAGGGGTGA GTGGATCTCA
GTTAAGGCTG AGGGAGATAA CCTTCTCCTG GTTAGGGGAA TAGGTTGGAG CAAGAAGGTA
CTCTATCGAG ACTTTGAAAA GGTGGATGAA GGCGATATCA CCTCCTACGA CATGAAGGGA
GGAAGGATAT ATTACGTGAA GGGAAACTCT CTCATGCGTG ATGGTGTGGA GTTATTCAAG
ATTTCAAGAC CCACACTGGA CATGAAGGTT ATGGACGATG GGATTCTGAC CCTCGAGATC
AGGAATTACA AGACGTCTCT AGTGAAGTAC TCAGAGGAGG GGAGGGAGAC CTGGAACTAC
ACGACGGACC ACATCCTCAC CTTCGATACA GTTGGCGATC AGATCTACGT CCTGGAGACA
TCATTTGACA CGTCATACAC CATCTCCAGG ATAAAGGATC AGAGAGTCGA GGTGCTGAGA
AGGGGGAGGG AGGAGAGGCT CACGGTCAAG GAGATTTACG TCCAGGGAGA CGTCCTCCTG
CACGGGTTCC TCCTAAGTAA GGGAGGTAAT AGGGGAGTTG TGGTTTACGG TTACGGTGGG
TTCGCGATCC CGCTCCTTCC CAGTTACAAT CCTCTATTCC TCGAACTTAT GGACTCTGGT
TACTCCGTCC TAGTCACAAA CCTCAGGGGA GGCTTTGAGA ACGGGGAGGA GTGGCACAAG
GCGGGGATGC TCAGGAACAA GATGAACGTA TTCAAGGATT TCTCGGAGTT CCTACAGACC
GTGAAAATGA TGGGAGGAAG GACAATAGCC ATGGGTGGAA GTAACGGTGG ACTGCTGGTG
GGAGCTACCC TTAACCTCTA CACGTCCCTG GTGGACTGTG GAGTCATAGG TTACCCTGTC
CTTGATATGT TGAAATTTCA CAAGTACCTC GCTGGTATGT ATTGGGTACC CGAGTACGGT
GACCCTGAAA AGGACTCCGA GTTCCTCCTT TCCTACAGTC CCTATCACAA CCTGAAGAAA
GGGCTACCTC CAACCCTAGT GTACACAGGG CTTAATGACG ATAGGGTCCA TCCCATGCAC
GCCTTGAAAT ACGTTGCTAA GTCTAGGGAG ATGGGAAACA AGGTTTACCT CTTCGTAAAT
AGGAGAGCTG GACATAACTT GAGCAGACCG GAGGCAAGTG CCGAGGAGAT GTCCACCGTG
GTGGCGTTCG TGGAACAGTG TCACTCACTC TGA
 
Protein sequence
MDPFEYIENL EDPRTKAFIE EETRNSSFFQ ERAKLHYQPI LERLTEERPI TLVGTEKGVA 
ILVRSKSGVH AEVNGNIIRS EREQDIFNSL ERVWNSDLVR IGVGIGGSDQ GYSILVNEQG
KVVRRVEGLV NQFFFLRGKL CYVREYRTES SPDGVPPAVE RLFCGEEMLP FYPGRGEWIS
VKAEGDNLLL VRGIGWSKKV LYRDFEKVDE GDITSYDMKG GRIYYVKGNS LMRDGVELFK
ISRPTLDMKV MDDGILTLEI RNYKTSLVKY SEEGRETWNY TTDHILTFDT VGDQIYVLET
SFDTSYTISR IKDQRVEVLR RGREERLTVK EIYVQGDVLL HGFLLSKGGN RGVVVYGYGG
FAIPLLPSYN PLFLELMDSG YSVLVTNLRG GFENGEEWHK AGMLRNKMNV FKDFSEFLQT
VKMMGGRTIA MGGSNGGLLV GATLNLYTSL VDCGVIGYPV LDMLKFHKYL AGMYWVPEYG
DPEKDSEFLL SYSPYHNLKK GLPPTLVYTG LNDDRVHPMH ALKYVAKSRE MGNKVYLFVN
RRAGHNLSRP EASAEEMSTV VAFVEQCHSL