Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0220 |
Symbol | |
ID | 5104086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 180729 |
End bp | 182330 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640506125 |
Product | type I phosphodiesterase/nucleotide pyrophosphatase |
Protein accession | YP_001190321 |
Protein GI | 146303005 |
COG category | [S] Function unknown |
COG ID | [COG3379] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000131562 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0129038 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAAGTTT TATTGGTTGT TGTGGACGGG CTGGCCTATC ATTTAATGGA GAGATTCATA GATCAACTTC CCACTATTCA GGAGATGGCG GAGGAAGGGG TGTATGGTCC CCTTGAGAGC ACTTTCCCGT CCATAACCCC TGTGGCTTTA GCCTCTCTTT TCACTGGGGT ATCACCAAAG GTTCACGGTG TTGTAGCCCC CAGAATATTC GTGAAGGGAA GAAAGATTCA GTCTAGCATA TCTGCCTTTT CCAGTAGCTC ACTCATGGTA GACCCTATCT GGGCTACGCT GGGAAGGAAA GGTTACAAGG TTGTGATAAC CTCTGCTCCT CAGGCCTTAC CAGATAAATG GAAACTAGAT AATGTGATTC TGTTCGATCC ATACAAGGCC AAGGTTAAGA GATGCTCCGA GGGAACCCTC CTCAAGGAGG GCGAAAACGA GTTTATTGGA AAGAAGTGGA GCGTAAAGGT AGAGGACCAG ATTTACCTTG TGGGAGTGGA AGGACAAGAA TTCAAAGTGG AGATGGGGTC ATGGATTGGA CCTCTGGAAG TAAACGGAAA GTGCGGAGAG GAAGAGTTGA AGGCTTCTAT CTTTCTTCAT GCAACTCCCC GTGGAGTATA TGTAACTCCG CCAGCCTTTC TGAACTTTAA GTGGGGAAAC AACAGGGACC TTCTCCTTGA GGTTTGGGAG AATGTAGTCA AGAAAGTTGG AATGTTGCTG GACGGTGATT ATAAGGGATT AAACAAGGGC CTAATCACGT TCGATGAGTA TTTAAAGACA GCTGAACTCT CCTTTAACTT TTTCGTGGAA TACTCGCTCT ACCTCCTTAG GAGAAGCGAT TGGGATTTTG GAGTCACCTA TCTTCCGATA GTGGATAATC TTCAGCACCT TTTGTATGGC GTGGATGATG GGAAGGCACT AGAGCACATT TTTCAGGCTT ACAAAATGGC TGATAAGTTT CTAATGTTGC ATAGGAGTTT AGCAGAAAAT ATATTCCTTT GTTCCGACCA CGGAATAACA AAAATAAAGA AGAGAGTCTA TGTCAATAAA ATATTGGAGA GGTTAAACGT GTTGAAAATG GACGACGGTA AAATAAACTG GGGGAAAACT AAGGCCTACT ATGGCGGGGG AGGCTTAATC AGGATAAACC TTAAAGATAG GGAGGAAGCC GGCGTGGTCT ATCCGAAGGA GTATCAGAAG CTGGTAAGGT ATATCGTGAA AAACCTAGAG GATCTTAAGG ACGATGATGG TGAATCCATT TTCACGGGTA TTTACATGAG GGACACTCCT GCCTCAGACA GACAGGGTGA CATTGAGCTT AGCATTAGGG ACTATTATTC GCTCAGTTCC AACGTTGATC ATGAGAACGA GATAGATACC GTGAAACCCT ATTCCACTTC CACGGGAGAT CACGGGTTTT ACAGGAAGGA GGACCTTTAT GGAGTGATCC TAGGAATAGG ACCCAAAATA GCCAGGGGTA AGAAGATAAA GGCCAAGATC GTCGACATAG CTCCCACAAT CCTGAAGATT ATGGACGTTC AAGGTCCTAA GATGGAGGGA AGGGTCCTAG TGGAGGCCCT GAGTAATGGA GGTCAGGAGT AA
|
Protein sequence | MKVLLVVVDG LAYHLMERFI DQLPTIQEMA EEGVYGPLES TFPSITPVAL ASLFTGVSPK VHGVVAPRIF VKGRKIQSSI SAFSSSSLMV DPIWATLGRK GYKVVITSAP QALPDKWKLD NVILFDPYKA KVKRCSEGTL LKEGENEFIG KKWSVKVEDQ IYLVGVEGQE FKVEMGSWIG PLEVNGKCGE EELKASIFLH ATPRGVYVTP PAFLNFKWGN NRDLLLEVWE NVVKKVGMLL DGDYKGLNKG LITFDEYLKT AELSFNFFVE YSLYLLRRSD WDFGVTYLPI VDNLQHLLYG VDDGKALEHI FQAYKMADKF LMLHRSLAEN IFLCSDHGIT KIKKRVYVNK ILERLNVLKM DDGKINWGKT KAYYGGGGLI RINLKDREEA GVVYPKEYQK LVRYIVKNLE DLKDDDGESI FTGIYMRDTP ASDRQGDIEL SIRDYYSLSS NVDHENEIDT VKPYSTSTGD HGFYRKEDLY GVILGIGPKI ARGKKIKAKI VDIAPTILKI MDVQGPKMEG RVLVEALSNG GQE
|
| |