Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Memar_0551 |
Symbol | |
ID | 4846965 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanoculleus marisnigri JR1 |
Kingdom | Archaea |
Replicon accession | NC_009051 |
Strand | + |
Start bp | 532337 |
End bp | 533860 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640115232 |
Product | thymidine phosphorylase |
Protein accession | YP_001046466 |
Protein GI | 126178501 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02645] putative thymidine phosphorylase [TIGR03327] AMP phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000734459 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTGA CTGTGAAACT GCTCGACATC GAAAACCGGG GGGTTCTCCT CCACTGCACC GATGCACGGA GCATGCGGGT TCGCGACGGC GACCGCGTCC AGATCGTCGA CGAAGCGACC GGAAAGACTG CCCAGGCTCA CGTCGACACC ACCGGTTCGC TCATCGAGCC GGGCGTCATA GGCGTCTACC GGCCGGTGAA CGCCACGCTC GCCGTCGACG AGGGAACTCC CGTCGAGGTC CGCGGCGCCG AGCGGCCGGC ATCCCTTGAG CACATCAAGA AGAAGATGGA CGGCGGCCGG TTCACCAAAG ACGATACCGT GGATATCGTC AGGGACATCG TCGACGACGT CCTCTCGCCC GGCGAGATCA CCGCTTACGT CACCGCTTCC TACATCAACG GCCTCGACAT GGACGAGGTG GAGTACCTGA CGAGGGCCAC GGTCGAGACC GGAGAACGCC TCCACTTCAC CCGGCACCCC ATCGTCGACA AACACTCCAT CGGAGGTGTC CCGGGAAACA AAATCACGCT CCTGATCGCC CCGATCATCG CCGCGAGCGG TCTTTTGATC CCCAAGACCA GCTCCCGCGC CATCACCGGC GCAGGGGGAA CCGCGGATCT CATGGAAGTC CTCGCGCCCG TCTCGTTCCC GGCGCTCGAG GTGCAGCAGA TGACCGAGAA GGTCGGCGGC GCCATCGTCT GGGGCGGTGC CACGAACATC GCCCCCGCCG ACGACAAGAT CATCACTTAT GAATACCCAC TCCGGATCGA CGCCCGTGGC CAGATGATCG CGAGCGTCAT GGCAAAGAAG TTCGCCGTCG GCGCCGACCT CGTGGTCATC GATATTCCGG TCGGCCGGAA CACCAAGATC GCGACTGCCC AGGAGGGACG GAAACTCGCC CGGGAGTTCA TCGATCTCGG GGAACGGCTC GGGATGCGGG TCGAGTGTGC GTTAAGCTAC GGGGAGTCGC TCGTCGGCCA CACCATCGGC CCGAACCTCG AGGTGCGCGA GGCGCTCGCC GTCCTCGAGG GGGCGACCGA GCCGAACTCC CTGATCCAGA AGAGCCTCTC CCTCGCCGGG ATCGCGCTCG AGATGGCCGG GAAGGCCGGG CCGGGGCAGG GTGCCCGGGC GGCCGCCGAT ATCCTCCGGA GCGGCAAGGC GCTCGAGAAG ATGCGGCAGA TCATCGAGAT CCAGGGTGGT GATCCGAACG TAAAGGCCGA GGATATCGTT CCGGGAGAGT GCAGGTTCGA TGTAAATGCG CCGCAGGATG GTTACGTCAT CGAACTGAAC AACAGCGCTC TCGTCACGCT CGCCCGGCTC GCCGGTTCGC CCTATGACCA CGGCGCGGGC CTGCTCGTCC ACGCAAAGAA AGGAACCCGG GTCCGGAAAG GCGATCCCAT CTTCACCATC TACGCCGACC GCGAATGGCG GCTCGAGCGT GCGATCGAGG TCGGCCGGAC GCTGATGCCG GTGCTTGTGG AAGGTATGGT ACTTGAACGT ATCCCTCATG AACGGTGGGT GTAA
|
Protein sequence | MKLTVKLLDI ENRGVLLHCT DARSMRVRDG DRVQIVDEAT GKTAQAHVDT TGSLIEPGVI GVYRPVNATL AVDEGTPVEV RGAERPASLE HIKKKMDGGR FTKDDTVDIV RDIVDDVLSP GEITAYVTAS YINGLDMDEV EYLTRATVET GERLHFTRHP IVDKHSIGGV PGNKITLLIA PIIAASGLLI PKTSSRAITG AGGTADLMEV LAPVSFPALE VQQMTEKVGG AIVWGGATNI APADDKIITY EYPLRIDARG QMIASVMAKK FAVGADLVVI DIPVGRNTKI ATAQEGRKLA REFIDLGERL GMRVECALSY GESLVGHTIG PNLEVREALA VLEGATEPNS LIQKSLSLAG IALEMAGKAG PGQGARAAAD ILRSGKALEK MRQIIEIQGG DPNVKAEDIV PGECRFDVNA PQDGYVIELN NSALVTLARL AGSPYDHGAG LLVHAKKGTR VRKGDPIFTI YADREWRLER AIEVGRTLMP VLVEGMVLER IPHERWV
|
| |