Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0514 |
Symbol | |
ID | 5103674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 470094 |
End bp | 471260 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640506418 |
Product | peptidase U32 |
Protein accession | YP_001190613 |
Protein GI | 146303297 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000000252039 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000774187 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGTATAACA AGTTTAAATA CAATAAAAAT GATCTAAATT TCATGAGGCT CGTTGTAGGA ACAAATTTCG ACGATGAACT CATAGGGAAA ATAAAGGAAT ACCCAGTTAG CCACATCTTT GGGAGTCACA CAAAAACCTT GACGGGACAC GGGAGAGCTT CCTTCATCCT CCCACAGGTT GACGATGAGA GGTTCAAGGC CCATCTCGAC GTCGTGCATG AGGCTGGAAT AAAGTTCCTT TATACCATGA ATACTGCTAC GCTGAACGGT GGGGAATACT CTGAGAAGTT CGTGAAGAGG TTATCAGAGG AAATTGAAAG ACTCGTGGGT TTCGGAGTAG ATGGCTTCGT CGTGGCTCTA CCCTTTCTAG TCAGGTTAAT AAAGAGGGAG CATCCGGAGT TGGAGGTGTC TATCTCGTCC TACGCTAGAG TCTACAATAT CAGGGAGGTT GAGAACTTCA TGGAACTTGG GGCGGACACG GTGATACTTC ACGAGGACGA TAACAGGAAC TTCAGGTTGT TGAGATCTCT ACAGAAGTTA CAGAGGAGGG TTGATTTCGA GCTTATTACC AACAATTCTT GCCTTTGGGG TTGCGTCTAT AGGAGAACGC ATGATATAGT CTCGTCACAG AGCTCAGTTG AGGGGGGAAT AGAGGCGTGG TTTGAGTATC CCATTCTCTT CTGTGCTACA GACGTTAGGA ACGACTTGGC TAACATCATT AGGATGAGAT GGATAAGGCC AGAGGACCTG GTAGTATATG AAGGCCTGGG ATTTGATAGG TTCAAAATTG CGGGAAGGAA CAAGAGGACA GAGTGGTTAG TTAGGGCGGT AAAAGCTTAC GCCAACAGGA AGTACGACGG CAACTTGCTG GACATAGTCA GCTACCCTCA GGGAAGGGCT GTCCCGAAGG TAATGGAGAA GGTGGGAGGT CCTAAGGATT ATGACGTGTT AAAGGAGGTT TACGTGGATA ACACAAAGTT TCCGCCCAAT TGGCTGAGCT TTTTCAGGTA TAACCAATGC GAGGAGAGAT CTTGCTCAGA GTGCGGTTAC TGCACTGCAG TGGCAAGGGA AGTTATGAGG GTTGAGGGGA AAGAGATCTC TGAACTTGAC TTAGGGAAGA TTCAAGCGCC CATAGATCTA ATTCCGAGGT TTGGTGGAAA TGGTTAG
|
Protein sequence | MYNKFKYNKN DLNFMRLVVG TNFDDELIGK IKEYPVSHIF GSHTKTLTGH GRASFILPQV DDERFKAHLD VVHEAGIKFL YTMNTATLNG GEYSEKFVKR LSEEIERLVG FGVDGFVVAL PFLVRLIKRE HPELEVSISS YARVYNIREV ENFMELGADT VILHEDDNRN FRLLRSLQKL QRRVDFELIT NNSCLWGCVY RRTHDIVSSQ SSVEGGIEAW FEYPILFCAT DVRNDLANII RMRWIRPEDL VVYEGLGFDR FKIAGRNKRT EWLVRAVKAY ANRKYDGNLL DIVSYPQGRA VPKVMEKVGG PKDYDVLKEV YVDNTKFPPN WLSFFRYNQC EERSCSECGY CTAVAREVMR VEGKEISELD LGKIQAPIDL IPRFGGNG
|
| |