Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1035 |
Symbol | |
ID | 5104335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 958934 |
End bp | 960289 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640506931 |
Product | phosphoesterase |
Protein accession | YP_001191124 |
Protein GI | 146303808 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3511] Phospholipase C |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.297271 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGTTGG TCTCGGTAGC TTTGTTCCTA GTACTCCTTC CATCACTTCA CGTGGTATCT AACGTGACGA CCCAGACGGG CACGGCGACC CCTATCAAGC ACGTGATCAT CGTGATAGAT GAAAATCATT CCTTCGATAA CCTCTTCGGG GTTTACCCCT TTGGGGTTCC TCCCATAGTG AATAACGTGA CCTGTTCAGT GATGAGGCCA GTCAACCTAG TAGATGGTCC AGGGAAATTA ATGCAGATAG ACGTTCCCTG GATACCTGGA ATACCCCTGT ACACTCATCC CTTCTACATT AACTCCTCAA CTCCACCAGA CCCGATCGAG GGGTATACCA CATACCACGA AGATTACTGG TACGCCACAC AGGATGGGTT CCCCCTCTTC TCGGGACCGC AATCCATGGG GTACTTCTCC TACGAACAGG TTGGGGTCCT TTGGGATTAC GCGGAGGAAT ACGTCCTCTT TGACAACTAC TACTCCCCTG TCCTTGACGT GACTGAACCC AATAGGATTG CCTACCTAGT GGGTTTCCCT CCATCATTTC ACAACGACGA GGCATCAGGA ATTTACTCCT TCAATGAGAC CATCATGTAC CAGTTAACTG CGCACAATAT CTCGTGGGGG TATTTCGTAT ACGACTTGGA GGGAATACCG TGGCCGATCT CCACCCTCAA GGGTGTGTCA GGAAATTTTT ACAACCTTAG TGTGTTCTAT CAAGACCTTC AGGACGGGAA TCTGCCCAGC GTGTCCTGGG TCATGTTCCT TGGCGGGGAG ACGGGAAAAT ACGACATGCA TCCTCCCGAC AACGTGACCG TTGGTGCAAT TGCCTTCTCA CAGGTGGTGA ATGCGGTCAT GAGGAGTAGG TACTGGAACT CGACGGCGAT CTTCTTTACC TTTGATGAGG GTGGCGGTTA TTATGATCAG GTAACTCCTC CCTTCGTGAA CGGGACGTCA CTGGGCCAGA GGATTCCGTT ACTCGTGATA TCGCCTTACG CAAAGGAGGC CTATGTGGAC AACTACACGG TGTCAGGTTA CACATTGTTA GCCTTCGTGG ATTATAACTG GAAACTTCCT TGGCTGACTC CCTGGGTTCA GAACAGTGAC CTTCAGGGTC TTCTTAACGC GTTCGACTTC TCCGCGATTA GGTCACCCAT CATTCTAACC CCAAGTAACT GGACCTACCC TGTTCCCCTA CAGTACCCGG TGAAGTACGG GTACGTGGCT ACCGTGAACC ACCAAGTGGA TCCCTCGCTA TACTCCTTCT CCTTTCCCGT GTACATCTTT GTGATTTTAT TCGTGTTAGT CGCAGTGGTC ATGGTGAAGA GGAGACGGTC TCGTAAGGTG TCTTAA
|
Protein sequence | MRLVSVALFL VLLPSLHVVS NVTTQTGTAT PIKHVIIVID ENHSFDNLFG VYPFGVPPIV NNVTCSVMRP VNLVDGPGKL MQIDVPWIPG IPLYTHPFYI NSSTPPDPIE GYTTYHEDYW YATQDGFPLF SGPQSMGYFS YEQVGVLWDY AEEYVLFDNY YSPVLDVTEP NRIAYLVGFP PSFHNDEASG IYSFNETIMY QLTAHNISWG YFVYDLEGIP WPISTLKGVS GNFYNLSVFY QDLQDGNLPS VSWVMFLGGE TGKYDMHPPD NVTVGAIAFS QVVNAVMRSR YWNSTAIFFT FDEGGGYYDQ VTPPFVNGTS LGQRIPLLVI SPYAKEAYVD NYTVSGYTLL AFVDYNWKLP WLTPWVQNSD LQGLLNAFDF SAIRSPIILT PSNWTYPVPL QYPVKYGYVA TVNHQVDPSL YSFSFPVYIF VILFVLVAVV MVKRRRSRKV S
|
| |