Gene Msed_0514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0514 
Symbol 
ID5103674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp470094 
End bp471260 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content47% 
IMG OID640506418 
Productpeptidase U32 
Protein accessionYP_001190613 
Protein GI146303297 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000252039 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000774187 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGTATAACA AGTTTAAATA CAATAAAAAT GATCTAAATT TCATGAGGCT CGTTGTAGGA 
ACAAATTTCG ACGATGAACT CATAGGGAAA ATAAAGGAAT ACCCAGTTAG CCACATCTTT
GGGAGTCACA CAAAAACCTT GACGGGACAC GGGAGAGCTT CCTTCATCCT CCCACAGGTT
GACGATGAGA GGTTCAAGGC CCATCTCGAC GTCGTGCATG AGGCTGGAAT AAAGTTCCTT
TATACCATGA ATACTGCTAC GCTGAACGGT GGGGAATACT CTGAGAAGTT CGTGAAGAGG
TTATCAGAGG AAATTGAAAG ACTCGTGGGT TTCGGAGTAG ATGGCTTCGT CGTGGCTCTA
CCCTTTCTAG TCAGGTTAAT AAAGAGGGAG CATCCGGAGT TGGAGGTGTC TATCTCGTCC
TACGCTAGAG TCTACAATAT CAGGGAGGTT GAGAACTTCA TGGAACTTGG GGCGGACACG
GTGATACTTC ACGAGGACGA TAACAGGAAC TTCAGGTTGT TGAGATCTCT ACAGAAGTTA
CAGAGGAGGG TTGATTTCGA GCTTATTACC AACAATTCTT GCCTTTGGGG TTGCGTCTAT
AGGAGAACGC ATGATATAGT CTCGTCACAG AGCTCAGTTG AGGGGGGAAT AGAGGCGTGG
TTTGAGTATC CCATTCTCTT CTGTGCTACA GACGTTAGGA ACGACTTGGC TAACATCATT
AGGATGAGAT GGATAAGGCC AGAGGACCTG GTAGTATATG AAGGCCTGGG ATTTGATAGG
TTCAAAATTG CGGGAAGGAA CAAGAGGACA GAGTGGTTAG TTAGGGCGGT AAAAGCTTAC
GCCAACAGGA AGTACGACGG CAACTTGCTG GACATAGTCA GCTACCCTCA GGGAAGGGCT
GTCCCGAAGG TAATGGAGAA GGTGGGAGGT CCTAAGGATT ATGACGTGTT AAAGGAGGTT
TACGTGGATA ACACAAAGTT TCCGCCCAAT TGGCTGAGCT TTTTCAGGTA TAACCAATGC
GAGGAGAGAT CTTGCTCAGA GTGCGGTTAC TGCACTGCAG TGGCAAGGGA AGTTATGAGG
GTTGAGGGGA AAGAGATCTC TGAACTTGAC TTAGGGAAGA TTCAAGCGCC CATAGATCTA
ATTCCGAGGT TTGGTGGAAA TGGTTAG
 
Protein sequence
MYNKFKYNKN DLNFMRLVVG TNFDDELIGK IKEYPVSHIF GSHTKTLTGH GRASFILPQV 
DDERFKAHLD VVHEAGIKFL YTMNTATLNG GEYSEKFVKR LSEEIERLVG FGVDGFVVAL
PFLVRLIKRE HPELEVSISS YARVYNIREV ENFMELGADT VILHEDDNRN FRLLRSLQKL
QRRVDFELIT NNSCLWGCVY RRTHDIVSSQ SSVEGGIEAW FEYPILFCAT DVRNDLANII
RMRWIRPEDL VVYEGLGFDR FKIAGRNKRT EWLVRAVKAY ANRKYDGNLL DIVSYPQGRA
VPKVMEKVGG PKDYDVLKEV YVDNTKFPPN WLSFFRYNQC EERSCSECGY CTAVAREVMR
VEGKEISELD LGKIQAPIDL IPRFGGNG