Gene Mthe_0462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0462 
Symbol 
ID4462619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp478003 
End bp479541 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content58% 
IMG OID639699464 
Productthymidine phosphorylase 
Protein accessionYP_842893 
Protein GI116753775 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02645] putative thymidine phosphorylase
[TIGR03327] AMP phosphorylase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGAGG TTGTTCCTTT CGATATAGAG ATCGGGCAGT ACAAGGTAAT GCTCAACATC 
GCAGACGCCA GAGCGATGGG CCTCAACCCG GGAGACAGGG TTCGGGTGAG AACGAGAGGC
GCATCTCTGA CCGCGATTCT TGACGTGACT GGACAGATGA TCGGGCAGGG GCAGGTCGGC
ATATTCACAG AGGCGTTCAG GGATCTGAAG GAGGCGAAGA GCGTGGAGAT ATCGCCCGCC
CCAAGGCCTG CCTCGATATC ATACATAAAG ATGCTGATGG ACAGGCAGAA GCTGAGCGAG
GATCAGATCA GGAGCATAGT GAGGGATATC GTCTATAACA ACCTCAGCGA GATCGAGCTC
TCAGCCTACA TCACAGCATC CTACATCCAT AACCTGGATC CACAGGAGAC GGAGTGGCTC
ACCAGGGCGA TGATAGAGAC TGGCGAGAGG ATATACTTCG ATAAGCATCC TGTCGTGGAC
AAGCACAGCA TAGGTGGGGT TCCAGGGAAC AAGGTCTCGA TGCTCGTGGT TCCGATTGTC
GCAGCCTCAG GCCTGCTCAT ACCGAAGACA AGCTCGAGGG CGATAACAGG CGCAGGCGGG
ACCGCGGATC TGATGGAGGT TCTTGCGCCC GTTGAGTTCA CAGCCGATGA GATCAAGGAG
ATCACCGAAA CCGTGGGAGG GGTCATCGCA TGGGGCGGCG CCACAAACAT AGCGCCGGCG
GACGACAGGT TGATAAAAGC GGAGTACGCC CTCGCCATCG ATCCCTACAG CCAGATGCTC
GCCTCGATAA TGGCGAAGAA GGGGGCAGTC GGAGCTGACG CTGTCGTGGT CGATATGCCC
ACAGGCCCTG GAACGAAGCT GGAGACGCCG GAGAAGGCGA GGGTACTCGC GAAAGATCTC
ACAGACCTTG GAGAGCGGCT GGGGATCAGG GTGGAGTGCG CGATGACCTT TGGCGGCTCT
CCCGTCGGTC GCACAGTGGG ACCTGCTCTC GAGGTCAGAG AGGCTTTAAA GATGCTGGAG
ACAGGCGAGG GGCCGAACAG CCTCAGAGAG AAGAGCCTCG CCCTCGCAGG CATACTCCTG
GAGATGGGAG GGGTTGCGGC CAGGGGCGAG GGATACAGAG CCGCGGAGGA GATACTGGTG
AGCGGAAAGG CCCACAGGAA GCTCATGGAG ATCGTAGAGG CGCAGGGAGG GGATCCGAAG
ATAAGGAGCG AGGACATCCA GATCGGAGAG CATCAGAAGC AGATACTCTC CCCGACGAAC
GGATACGTGG TCGCATTCTA CAACAAGAGA ATCATAGAGA TCGCGAGGGC AGCAGGCGCG
CCCGGTGACA AGAGGGCGGG AGTGATAATA CACAAGAAGA TGGGAGAGAT CGTGAAGAAG
GGTGAGCCGC TGCTCACGAT ATGCTCCAGC ACAGACTGGG AGCTGGAGTG CGCGGTGAAG
ATGTGCTCGA TGAGGGACGC CTTGGAGCAG CCGCCGATAG TCGTGGAGGG CATGCTGCTC
GAGAGGTATC CGACCGAGAG GTATCCGAGA ACGATATGA
 
Protein sequence
MFEVVPFDIE IGQYKVMLNI ADARAMGLNP GDRVRVRTRG ASLTAILDVT GQMIGQGQVG 
IFTEAFRDLK EAKSVEISPA PRPASISYIK MLMDRQKLSE DQIRSIVRDI VYNNLSEIEL
SAYITASYIH NLDPQETEWL TRAMIETGER IYFDKHPVVD KHSIGGVPGN KVSMLVVPIV
AASGLLIPKT SSRAITGAGG TADLMEVLAP VEFTADEIKE ITETVGGVIA WGGATNIAPA
DDRLIKAEYA LAIDPYSQML ASIMAKKGAV GADAVVVDMP TGPGTKLETP EKARVLAKDL
TDLGERLGIR VECAMTFGGS PVGRTVGPAL EVREALKMLE TGEGPNSLRE KSLALAGILL
EMGGVAARGE GYRAAEEILV SGKAHRKLME IVEAQGGDPK IRSEDIQIGE HQKQILSPTN
GYVVAFYNKR IIEIARAAGA PGDKRAGVII HKKMGEIVKK GEPLLTICSS TDWELECAVK
MCSMRDALEQ PPIVVEGMLL ERYPTERYPR TI