Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1577 |
Symbol | |
ID | 5104022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1526471 |
End bp | 1527694 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640507463 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_001191656 |
Protein GI | 146304340 |
COG category | [R] General function prediction only |
COG ID | [COG1078] HD superfamily phosphohydrolases |
TIGRFAM ID | [TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00516026 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAATTAA TAAGAGATCC AGTTCATGGT TACATAGAGG TACCAGATAG GCTACTTCCC ATCGTATCAA ATCCCCTTTT TCAGCGTCTA AGATACGTGA AGCAGACTGC CTTAGCTTAC ATGGTGTACC CCGGGATGAA CCATACGCGT TTTGAGCATA GCCTCGGGGT CATGCACCTT TCCCTTGAGT TCCTAAAGTA CATCCGGGAA AATTCCTCCC TCGAGTTAGA CCAAGAGGTC ATGGAACTGA TAGCTGTAAC TGCGATGCTC CATGACGTGG GTCACCTTCC CTTCTCCCAC ACCTTCGAGA ACGCCCTTCA GGTCGCGAGG CAGGTTTATG GCGTAGATGT GTTTGAGAAA GATAAGAAAA CCCACGTAAT TCTAGGCACT CAACTGATTG AGAAGGGTCT CGCATCTGAC CTGGAGAAGA ATTTCAAGAC ATTTGAGGAC CCCGTCAAGT TCGTCACTAG AGTGCTGATG GAGACACCGC GTAACAGGGA AGAGAAATTA GCTCATCTAA TAATTTCCAA TTTCATCGAC GCAGATAGGA GCGATTACCT TCTCAGGGAT TCATATTATG CAGGCGTGGA ATATGGGCAG TTCGACATCG AGAGAATGAA GAGATTTCTA TATTTCGAGA ATGACATGTT GGTTGTTCTA GGCAAAGTCT TGCCCATAGT GGAGCAATTT CTCCTTGCAA GGATGTACAT GTTTCAGAAC GTGTATTTCC ACAGTGTTGT TGGAATGTAT AACGCTATTC TTTCACAGGC TATAGCCCAG TTACTTAGGG ATGGGATCAT AGAGATGCCA GTGGAAGTGG AGGATTTTTT AGTATTCGAT GATAATTTTG TGATTTCCAA GCTACAACAT GTCAGGCGTG AGTTACGCGA TGCCATAATG TATAGGCAAG GTTTCAGGAG AATCAAGATA GAACCCAGTC CTAACTGCAT TGAGAAATTG GAGTCGATAA AACAGGAGAT ACGTGAGGAT ATGAGATCGT CGGGTGGTCT CCTCATTTAT CACGAATTCA ACGATGTTCC ATATAGGGAG GAGAAGGATG AGGCTGTTTA TATACTCACC CCTCATGGAG TGGAAAGGTT GAAAAGGGTA TCTCCCTTGA TTGGTTCACT GAGCGAAATA AGGAAAGTGG TAGTGGGATA TCACGTGAGC AGAGACGATC TGGGAAGAAA ATATGAGAAG ATATTGAGCG AATGTAAGGA CTAG
|
Protein sequence | MKLIRDPVHG YIEVPDRLLP IVSNPLFQRL RYVKQTALAY MVYPGMNHTR FEHSLGVMHL SLEFLKYIRE NSSLELDQEV MELIAVTAML HDVGHLPFSH TFENALQVAR QVYGVDVFEK DKKTHVILGT QLIEKGLASD LEKNFKTFED PVKFVTRVLM ETPRNREEKL AHLIISNFID ADRSDYLLRD SYYAGVEYGQ FDIERMKRFL YFENDMLVVL GKVLPIVEQF LLARMYMFQN VYFHSVVGMY NAILSQAIAQ LLRDGIIEMP VEVEDFLVFD DNFVISKLQH VRRELRDAIM YRQGFRRIKI EPSPNCIEKL ESIKQEIRED MRSSGGLLIY HEFNDVPYRE EKDEAVYILT PHGVERLKRV SPLIGSLSEI RKVVVGYHVS RDDLGRKYEK ILSECKD
|
| |