Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1231 |
Symbol | |
ID | 5103845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1207751 |
End bp | 1210525 |
Gene Length | 2775 bp |
Protein Length | 924 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640507123 |
Product | peptidase A5, thermopsin |
Protein accession | YP_001191316 |
Protein GI | 146304000 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAATTA CGGATTATGG CGTAGGGCCT AACGGTTTTT ACTCTTATGA CACTACCCAA TTCTTAGGGA CAGTGTACAT AAACAGTTTA CAAGCGTTGA CTTTTACTAG TGGAAGTACT GCGGTAACTT TTCAGCTCAA TGTCATGCTT AACTATGAGG CAAGAGGATC GTCTTACGCT CTTTGGGTTC AGGACGTCGC ATGTTTTAAT ACTGTAAACC ATGAAATATA TTTTATTGAT AACATTTGGA ATTCCACTGT ACCATTTGGT AATGTGACCG GTCTTCAAGG AAATGGTCAG TATGCTGTAT CCTCATCAAA TGGAAATTAT CCCTCGCAAG TTTTTTATGG CGACGTGTCC CACGATCCAG GATCAATTGT TACACAATCA CTTCCAACGG CTTTTGACCT TCTAGTAAAT GTAAGTACCA ATAACCTAGG TCAGCCTGTT ATCTATTTCT GGTACAATGA TGGTTATGGA TGGATAAACT ATGATACCGT TACAGTGACG AATGTTCAAG GTGCTACCAA CGTTTATTTT GAGGTAAATG GATATGCCCT TACAGGGAAT GGCAATAATT ACGATGCCGA GCTTGTTCTC GCCGGACCAG GTGGAGGTTC AAACACTTAT ATTAACCAAG CAAACGTTGA TTTACTTTTA TTTTACTGGA ACGGACATAA TTTCCAGGAG GTTGCAAATA CATACAACTA TGGTTCAGAT ACTGCCGAGA CATCGTCCAA TGTTAAGGTA CTTTATTATT ATTATCCATA TTCTGGCTTA CCCACGGGAT TTATAACCAC CGGGCCTGGG GCTCTTGGTC CATTGTGGAA CAGTACTGAC GTCACTTGCC TAACTGTAAA CACCGGGTTA AACCAGGGAT ACGTTCAGAT CTACAATGAT AGTTATTCCT ACCAGTACGC CGAGCAAGTA AATAACCTCT GTTTGTTCCA AGGGGGATCC GCCACATTCA CTCTTACGCC CTACGTGAAC TATTCAGTCC TAGTTTACAA CACTAACGAT CAGCTGGTTG GAGAGGCCAA CGTAATGGCT CAGCCTGGCT CCATGACCAC CAGCGTGACG CAATTTAATG TGGTGATACC TCCAAGCGTA GATCTCACAC TTGGGCAGAC AACCAATATT CCAGTTGAAA TTCAAGCTTA TGGACACGTA ACAGTCCAGG TTAAGGGCAA CTCTCTCACA AGGACTTACA ACGTGTACGT AAATGGACAA ACCACGTTGA ACGTTCCCGT AGAAGCAACG TCACCTGGCT CCTATCAACT TGAGGTTAAC GCAACTCTCT TCCCAGGGTT CTCCGTGGTT AAGTATGTTA CAGTGAACAT TGCACAACCT CAACTTTACT CCGTGACAGG ATCCTTCTCA GTGAATGGGC AAACGCCTCC CACTACACCC CTCGTAACCT TTACCTTCCC CAACGGCTCA AGGGTCACCT TGCCACTAAA TAACCTGAAC ATTAAGGTGC CCCCCGGTAC CACCTATTCC GCCTCAAATA TAACTGAGTC TGGCTATAGG TGGATCGCAG TTAATGGTCA AGGAGAGATA CTGGGCACAA CTCAGCTCTC CATAACCTAC TACGAGCAGG TTCAAGTGAA CTTCCAGTAT CAGGGCCCTG CAGTTCCTAC CGTGAGTTAC TACTACCTGG GGAACCTGGA GACCGCGAAG TTGCCCGTCA CGTTATGGGT GGACTACGGC ACTAGTTACA CATTCCCACA AATCCTCTCG AGCAGTTCAG GGGAAAGGGT GATCGCATAT AACTACCAGG GAACAGCAAC CTCTCCCAGC ACCTTCACAG TCACCTATTA TCAACAGTAC TACGTTGACG TCTCATCTCC TATCCCAGTC TACGCTCTCG TGAACGGCCA CAATGAGAGC CTAAGCTCGG GATGGTACAA CCAATCTACG TCAATAGACG TCGAGAACGT TCCTTACTAC CTAGGCCCAG GTGAGAGGGA AGTCATTACT AGTGTGACGC CTTCATCCTC TCTCTCCGTG AATTCTCCCC TAAACATTAC CGTCACCCCA GAGACTCAGT ACTATGTTCA GGTGTCGTCT CCTATCCCCG TTTATGCCGA AGTTGACTAT GAAAACTTCA CCCTCCAGTC AGGTTGGTAC TATCAAGGAA CCTTGATACA GGTTGAGAAC ATCACATATT ACCCCTCATC TAACGAACGT TACGTGATAA CTTCGATTAC TCCATCAACT TTCACGGTGA ACCAGCCGGA AACCATTACG ATCTCCACCG TGGTTCAGTA TAGGCTCACC TTACTTTCAT CAATTCCTAC ATACGCCCTT GTTAACGGAA ATAACGAGAC CTTGACCTCA GGTTGGTATA ACGCGGGAAC AACCATAAAC GTGGAGAATA TCACATATTA CGTTACCCCT ACAACTAGGG AACTCGTCAC CCAGATCTCT CCTTCCACCC TGACCATGAA CGGGCCTTCA GCTATTTCAG TACAGACAGT GAAACAGTAT CTAGTACAGA TAAACTCTCA ATACCCTGTA ACAATAAATG GGGTTCAGAC AAACAGTGAG TGGGTTAATG CAGGGTCAAG CATCACCTTG AATGCCAATT TACCTTTCTA CTTGACGGGA AGTTTCAGCG GAACTGCTCC AGTGGCTCTT GGAGGGAGTA TAGCGGTTAA CCAACCCGTC CAGGAGACGT TACAGACGTC CATCAGTCTA GTATTTGTGG GGATAGTTGC CGTTATTGCC ATAGTGGCTG TGGGTGTGGT GATAGTACTG ATAAAGAGAA GGTGA
|
Protein sequence | MGITDYGVGP NGFYSYDTTQ FLGTVYINSL QALTFTSGST AVTFQLNVML NYEARGSSYA LWVQDVACFN TVNHEIYFID NIWNSTVPFG NVTGLQGNGQ YAVSSSNGNY PSQVFYGDVS HDPGSIVTQS LPTAFDLLVN VSTNNLGQPV IYFWYNDGYG WINYDTVTVT NVQGATNVYF EVNGYALTGN GNNYDAELVL AGPGGGSNTY INQANVDLLL FYWNGHNFQE VANTYNYGSD TAETSSNVKV LYYYYPYSGL PTGFITTGPG ALGPLWNSTD VTCLTVNTGL NQGYVQIYND SYSYQYAEQV NNLCLFQGGS ATFTLTPYVN YSVLVYNTND QLVGEANVMA QPGSMTTSVT QFNVVIPPSV DLTLGQTTNI PVEIQAYGHV TVQVKGNSLT RTYNVYVNGQ TTLNVPVEAT SPGSYQLEVN ATLFPGFSVV KYVTVNIAQP QLYSVTGSFS VNGQTPPTTP LVTFTFPNGS RVTLPLNNLN IKVPPGTTYS ASNITESGYR WIAVNGQGEI LGTTQLSITY YEQVQVNFQY QGPAVPTVSY YYLGNLETAK LPVTLWVDYG TSYTFPQILS SSSGERVIAY NYQGTATSPS TFTVTYYQQY YVDVSSPIPV YALVNGHNES LSSGWYNQST SIDVENVPYY LGPGEREVIT SVTPSSSLSV NSPLNITVTP ETQYYVQVSS PIPVYAEVDY ENFTLQSGWY YQGTLIQVEN ITYYPSSNER YVITSITPST FTVNQPETIT ISTVVQYRLT LLSSIPTYAL VNGNNETLTS GWYNAGTTIN VENITYYVTP TTRELVTQIS PSTLTMNGPS AISVQTVKQY LVQINSQYPV TINGVQTNSE WVNAGSSITL NANLPFYLTG SFSGTAPVAL GGSIAVNQPV QETLQTSISL VFVGIVAVIA IVAVGVVIVL IKRR
|
| |