Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0313 |
Symbol | |
ID | 5104949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 269672 |
End bp | 271474 |
Gene Length | 1803 bp |
Protein Length | 600 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640506219 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001190414 |
Protein GI | 146303098 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4934] Predicted protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAACA TAAGTTCTAT TCTAATAGTA TTGATTTTGC TTGTATTTTC CTCTCTAGGC TCGGGTCTAG TTTTGCAGGC CCAGGAATCC ATGTACTACG TGCAGACGTC ATCGCCCCAG TACTCAATTT TACCTGGATC ACAGTTCGTG GAACCGCTTA ACACTGGTCT TACGATCCCG ATAGCCATAT TGCTTAACTT CACGAACTAT TCATCGCTGA ACAGTGAGCT TCTATACGTG ATTTCAAGTG GTCATTATTT AACTCCCTTG AAGTTCAGGG AATACTACTA TCCCAGTCAG TCTTACGTGA ACTCCCTTGA GAACTACTTG GAAGGGTACG GGTTTGTTCC CACAGGGAAC TATGGACTCA TACTCACCTT TAACGCCACA GTAGGCGAGA TTGAGCAGGC GTTCCACACA TATATAAACG TGTACTATTA CCCCTTCAAG GATATCTATT GGTTCGGTAA GGTCGGAGAG AAGAGAGTGG GACCATTCTA TTACTTCACG AATAACGTTA CGCCATCACT TCCCTATGGC GTGGGTAAGT ACGTGCTTGG GATTGTGGGT ATAGATAATC TTGATCCACA CGCGTACCAG GGATTGAGGC AGGCTTGGAA TGTACCCATG ACCGAGGGGA AGGGATCCAG CAGTAGCGGG CTAATTTCAA GCGCAATATT CACTCCCAAT ACGATCCAGC AGTACTTCAA CTTCTCGTCG CTCTACGCGC AGGGTAAGAC GGGTTCAGGT TCCACGATCG CCATAGAGGG AGTGCCAGAG TGTTACGTGA ATACCACGGA CATATACTCG TTCTGGAGAC TTTTCAACAT TCCTAGGACT GGGTCATTCA ACGTTATCAC CCTTGGAAAT GATACCTCGG GAGGTCAATC GGGTGAAAAC GAGCTGGACG CTGAGTATTC AGGGGCCTTT GCTCCAGGGG CTAATATCGC AATAGTGTTC AGCGATGGAT ATGTGGGCGG GAAAGCCCTT GTGGGAAACC TCCTAAACTA CTATTATGAG TACTATTATA TGGCAAACTA TCTTAATCCT GACGTGGTCT CTATCTCAGT ATCGTTACCA GAGAGCTATC TTGCAGCGTA TTATCCGGCA ATGTTGTACA TGATACACAA CATCATGGTT CAGCTATCGG TGCAGGGAAC CTCGGTCTTG GCTGCATCAG GGGACTGGGG ATTTGAGTCC AATCATCCTC CGCCTAACTT CCACATTGGC GTTTACAACA CCATATGGTA CCCGGAAAGC GATCCACTGG TTACCTCGGT AGGCGGGATA TTCCTCAATG CGACCTCAAC GGGACAGATC TACTCCTTCT CTGGGTGGGA TTACAGTACA GGAGGCAACA GCGTGGTGTT CCCGGTTCAG ATATACGAGC TAACCTCCTT GATTCCGTTC ACGCCTATCA CAGTGAGAAC TTACCCTGAT ATCGCCTTCG TTTCAGCGGG CGGTTACAAT ATACCCGAAT TCGGTTTTGG GCTACCCCTG ATCTTCGACG GACAGCTGTT CGTGTGGTAT GGGACTAGCG GTGCAGCTCC CATGACGGCT GCGATGGTTT CGCTTGTGGG ACAGAGGTTA GGCCCACTGA ACTACGCGCT CTATCACATT TCGTACTCTG GAGAGGTAGT AACGCCCCAT GGAATTATTA AGGGACTGTC AGCCTGGATA CCCGTTACCT CTGGCAACAA CCCAATGCCG GCCCACTATG GATGGAATTA CGTTACTGGA CCTGGCACTT ATGACGCGTA CGGTATGGTC ATGGACTTGG GAATGTATGC CGAGTATATC TAA
|
Protein sequence | MKNISSILIV LILLVFSSLG SGLVLQAQES MYYVQTSSPQ YSILPGSQFV EPLNTGLTIP IAILLNFTNY SSLNSELLYV ISSGHYLTPL KFREYYYPSQ SYVNSLENYL EGYGFVPTGN YGLILTFNAT VGEIEQAFHT YINVYYYPFK DIYWFGKVGE KRVGPFYYFT NNVTPSLPYG VGKYVLGIVG IDNLDPHAYQ GLRQAWNVPM TEGKGSSSSG LISSAIFTPN TIQQYFNFSS LYAQGKTGSG STIAIEGVPE CYVNTTDIYS FWRLFNIPRT GSFNVITLGN DTSGGQSGEN ELDAEYSGAF APGANIAIVF SDGYVGGKAL VGNLLNYYYE YYYMANYLNP DVVSISVSLP ESYLAAYYPA MLYMIHNIMV QLSVQGTSVL AASGDWGFES NHPPPNFHIG VYNTIWYPES DPLVTSVGGI FLNATSTGQI YSFSGWDYST GGNSVVFPVQ IYELTSLIPF TPITVRTYPD IAFVSAGGYN IPEFGFGLPL IFDGQLFVWY GTSGAAPMTA AMVSLVGQRL GPLNYALYHI SYSGEVVTPH GIIKGLSAWI PVTSGNNPMP AHYGWNYVTG PGTYDAYGMV MDLGMYAEYI
|
| |