Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0462 |
Symbol | |
ID | 4462619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 478003 |
End bp | 479541 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639699464 |
Product | thymidine phosphorylase |
Protein accession | YP_842893 |
Protein GI | 116753775 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02645] putative thymidine phosphorylase [TIGR03327] AMP phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCGAGG TTGTTCCTTT CGATATAGAG ATCGGGCAGT ACAAGGTAAT GCTCAACATC GCAGACGCCA GAGCGATGGG CCTCAACCCG GGAGACAGGG TTCGGGTGAG AACGAGAGGC GCATCTCTGA CCGCGATTCT TGACGTGACT GGACAGATGA TCGGGCAGGG GCAGGTCGGC ATATTCACAG AGGCGTTCAG GGATCTGAAG GAGGCGAAGA GCGTGGAGAT ATCGCCCGCC CCAAGGCCTG CCTCGATATC ATACATAAAG ATGCTGATGG ACAGGCAGAA GCTGAGCGAG GATCAGATCA GGAGCATAGT GAGGGATATC GTCTATAACA ACCTCAGCGA GATCGAGCTC TCAGCCTACA TCACAGCATC CTACATCCAT AACCTGGATC CACAGGAGAC GGAGTGGCTC ACCAGGGCGA TGATAGAGAC TGGCGAGAGG ATATACTTCG ATAAGCATCC TGTCGTGGAC AAGCACAGCA TAGGTGGGGT TCCAGGGAAC AAGGTCTCGA TGCTCGTGGT TCCGATTGTC GCAGCCTCAG GCCTGCTCAT ACCGAAGACA AGCTCGAGGG CGATAACAGG CGCAGGCGGG ACCGCGGATC TGATGGAGGT TCTTGCGCCC GTTGAGTTCA CAGCCGATGA GATCAAGGAG ATCACCGAAA CCGTGGGAGG GGTCATCGCA TGGGGCGGCG CCACAAACAT AGCGCCGGCG GACGACAGGT TGATAAAAGC GGAGTACGCC CTCGCCATCG ATCCCTACAG CCAGATGCTC GCCTCGATAA TGGCGAAGAA GGGGGCAGTC GGAGCTGACG CTGTCGTGGT CGATATGCCC ACAGGCCCTG GAACGAAGCT GGAGACGCCG GAGAAGGCGA GGGTACTCGC GAAAGATCTC ACAGACCTTG GAGAGCGGCT GGGGATCAGG GTGGAGTGCG CGATGACCTT TGGCGGCTCT CCCGTCGGTC GCACAGTGGG ACCTGCTCTC GAGGTCAGAG AGGCTTTAAA GATGCTGGAG ACAGGCGAGG GGCCGAACAG CCTCAGAGAG AAGAGCCTCG CCCTCGCAGG CATACTCCTG GAGATGGGAG GGGTTGCGGC CAGGGGCGAG GGATACAGAG CCGCGGAGGA GATACTGGTG AGCGGAAAGG CCCACAGGAA GCTCATGGAG ATCGTAGAGG CGCAGGGAGG GGATCCGAAG ATAAGGAGCG AGGACATCCA GATCGGAGAG CATCAGAAGC AGATACTCTC CCCGACGAAC GGATACGTGG TCGCATTCTA CAACAAGAGA ATCATAGAGA TCGCGAGGGC AGCAGGCGCG CCCGGTGACA AGAGGGCGGG AGTGATAATA CACAAGAAGA TGGGAGAGAT CGTGAAGAAG GGTGAGCCGC TGCTCACGAT ATGCTCCAGC ACAGACTGGG AGCTGGAGTG CGCGGTGAAG ATGTGCTCGA TGAGGGACGC CTTGGAGCAG CCGCCGATAG TCGTGGAGGG CATGCTGCTC GAGAGGTATC CGACCGAGAG GTATCCGAGA ACGATATGA
|
Protein sequence | MFEVVPFDIE IGQYKVMLNI ADARAMGLNP GDRVRVRTRG ASLTAILDVT GQMIGQGQVG IFTEAFRDLK EAKSVEISPA PRPASISYIK MLMDRQKLSE DQIRSIVRDI VYNNLSEIEL SAYITASYIH NLDPQETEWL TRAMIETGER IYFDKHPVVD KHSIGGVPGN KVSMLVVPIV AASGLLIPKT SSRAITGAGG TADLMEVLAP VEFTADEIKE ITETVGGVIA WGGATNIAPA DDRLIKAEYA LAIDPYSQML ASIMAKKGAV GADAVVVDMP TGPGTKLETP EKARVLAKDL TDLGERLGIR VECAMTFGGS PVGRTVGPAL EVREALKMLE TGEGPNSLRE KSLALAGILL EMGGVAARGE GYRAAEEILV SGKAHRKLME IVEAQGGDPK IRSEDIQIGE HQKQILSPTN GYVVAFYNKR IIEIARAAGA PGDKRAGVII HKKMGEIVKK GEPLLTICSS TDWELECAVK MCSMRDALEQ PPIVVEGMLL ERYPTERYPR TI
|
| |