Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0696 |
Symbol | |
ID | 5105302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 634898 |
End bp | 636190 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640506600 |
Product | peptidase A5, thermopsin |
Protein accession | YP_001190795 |
Protein GI | 146303479 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000215244 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTCATGT ATTACGTTTC CGTGAGTATT TTACTGCTTC TAGCTCTCTC CTTGATCTCA CCCCTTGAAC TAGTGACCAC AGCACAGACA GGAATCTCTT TTCCCGTGGG AATCAGCTTC TTCTCGCTCT TCTCCACCTA TTACACGCCA TACGTAATGG GAGTCATGAA CCTTTCATCC CTTCAAATTG GGAGGTCCTA CATATCCGGT CAGCCCTTTG AGTACGGAAA TGCATCCCTC CAGCTCAACG CTATGTTGAA CGGTACTTAT TGGGCCCAGG ACGTTATGCT TTTCCACGAA ATCAATAACA GGACCTTCCA GGTTTACATG GTGATTAACT TCTGGAACCT CACAGGTCCC TTCGTCTCCC TGGTTCAAAA CACAACCACC TTCGACGGCC TTGGGGTTTA CTGTTATCAG GGACCAACCT TCAACATCAC TTTACCAGTC TCGCTGTCCC TATTCATGAA TTCCTCCCAG CATCTTCAGT TCGGGTACTC AATTAACGGG GTAAAGAGGG TGTACCTGAC CTTGCCCTTC CACGGCCTGT TCAAGTTAGG TGGCCTCTCC GTGAACGGAC TTCCCAACGA TCTAGAGATG GTATGGGGTG GCCCAGATGG AGGAAGCGTT GTGGACATGA TAGCCCAGGG ATCCGAGGAG CTCTACTTCC TTCAGGGTAA CAACTTAACC ATAGTTCCCT CTGCGCTCTC TGTGGGGCTA GACACCGCCG AGTCGGCCTA CGGGGTGGCC TCGTCCACAA ATCTGGAAAA CATCAAGAAA CCTTTCGCTG ACATTAACCG GGGAGTTAAT ACTCCATCTG TTCTTTGGCC CGTTCCACCA AATATAAACG TTACACAGGT GAACAGTACC GTTCACGTGA AGCTATACTA TGGAAATTAC ACCTTCTCGG GGCAGGAAGT TGAGATAAAG GTGTTGAAAG GTCTGAACTT AGTCACCCTA AGTCGTGGTG TAACCAACTC CTCTGGAGAG GTTACCTTCA CCAATGTTAC CCAATCATTT TATGAAGTGT ATTTTCCTGG AAATTACTCC CTCTCGCAGT CCTATGCCCT GTCATCCCCT CAATTGAACC ACTTGATTAA CGTGACAACG TCCACCTTCG ACTCCCTGGT TCATTTCCTT GAGACTTACA ACTTTAAGAA GGCTCTAAGT TCTGACTTCA ATCACATCAA GTATCACGGG GAAACCTCCG TGAACTACCT CCTGCTAGAG GTGATAGGGG GACTGACAGC GGGAATTCTG ATATCAGCGT TCCTGGTTAG GAAGTACACT TAA
|
Protein sequence | MFMYYVSVSI LLLLALSLIS PLELVTTAQT GISFPVGISF FSLFSTYYTP YVMGVMNLSS LQIGRSYISG QPFEYGNASL QLNAMLNGTY WAQDVMLFHE INNRTFQVYM VINFWNLTGP FVSLVQNTTT FDGLGVYCYQ GPTFNITLPV SLSLFMNSSQ HLQFGYSING VKRVYLTLPF HGLFKLGGLS VNGLPNDLEM VWGGPDGGSV VDMIAQGSEE LYFLQGNNLT IVPSALSVGL DTAESAYGVA SSTNLENIKK PFADINRGVN TPSVLWPVPP NINVTQVNST VHVKLYYGNY TFSGQEVEIK VLKGLNLVTL SRGVTNSSGE VTFTNVTQSF YEVYFPGNYS LSQSYALSSP QLNHLINVTT STFDSLVHFL ETYNFKKALS SDFNHIKYHG ETSVNYLLLE VIGGLTAGIL ISAFLVRKYT
|
| |