Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbur_0255 |
Symbol | |
ID | 3998750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcoides burtonii DSM 6242 |
Kingdom | Archaea |
Replicon accession | NC_007955 |
Strand | + |
Start bp | 230876 |
End bp | 232396 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637958090 |
Product | thymidine phosphorylase |
Protein accession | YP_565012 |
Protein GI | 91772320 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02645] putative thymidine phosphorylase [TIGR03327] AMP phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.456015 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACTAA AAGTACAACC CATTGATGTC AAAGTAGGCA AATATAAAGT GATCCTGAAC ACAATTGATG CTAAGGAGCT AGGTGTCCAT GAAGGAGACA GGGTACGTAT AAAGAACCAC GTGACCCTAA CAGCCATCGT AGATTTTACG GAAGATATGA TATCACCTGG CATGATAGGG TTATACCATG AAGTGAAAGA GGCACTGAGT AAGGAATGGA CCGAGACGGT CGAAGTATTC CCTGCTGAAA AACCAAAGTC AACCTATATT ATAAGAAAGA CAATGGATGG ACAGAAGCTC ACAAAGGAAG AGATAGACAT ACTCGTGAAG GACATTGTCG AAGAGAACCT TGCCGAGATA GAAATTGCTG CATTCCTTAC TGCAACATAT ATAAACGACA TGACCGATGA TGAAACCGAA TGGCTTACAC GTGCCATGAT AGATTCTGGG GACAAGCTGG AATTCGATAC CCATCCAATA ATGGACAAGC ATTCCATCGG AGGAGTTCCC GGAAATAAGA TCTCATTGCT CATCGTTCCC ATAGTAGCTG CAAACGGATT ACTTATCCCA AAGACCAGCT CAAGGGCAAT CACTGGTGCA GGAGGGACTG CAGACCTGAT GGAGATACTT GCACCTGTTG AGTTCGATGC TGCTGAGATC AAGAGGATGA CCGAAGAGGT TGGTGGCGTA CTTGTCTGGG GTGGTGCTAC CAATATTGCA CCTGCGGATG ATAAGCTCAT AAAGGTCGAA TACCCGCTTT CAATTGACCC GCATTGCCAG ATGCTTGCAT CCATCATGGC AAAGAAAGGT GCCATTGGTG CAGACCATGT CGTAATGGAC ATACCCACCG GACCCGGTAC AAAGATCAAG AACGTACAGG AAGGAAGAAA GCTCGCAAGG GACCTGATCA ATCTTGGTGA CAGGCTTGGT ATGGATGTGG ATTGTGCTCT GACCTATGGT GCCTCCCCCG TGGGACGCAC TATAGGACCT GCACTTGAAG TGATCGAGGC ATTGAAGGTC CTTGAGAGCT TTGAGGGACC GAACAGCTTG ATAGAGAAGA GTGCATCACT TGCAGGAATG CTTCTTGAGA TGGGGAATGT GGCTGGCAAA GACAAGGGAT ATGACCTTGC CATTGAGACC CTGAAGAACG GAAAAGCACT GACGAAGTTC AAAGAGATCA TCAAGATACA GGGCGGTAAT CCTGATGTGA CCCACAAAGA CATTTCCGTG GGAGAGTTCA CTGAAGATAT CATTGCCCCT AATAACGGAT ATATACTTGA GATGGACAAC AAGCGCCTTG TTCAGATCGC AAGGCTCGCA GGAGCACCTA ATGACAAAGG TGCAGGAATA CTTTTGCATA GGAAACAGGG AGAACCTTTG AAAGAAGGCG ATCCTGTGAT GACCATTTAT GCAGAGAAGA AATCAAAACT TGAAAACGCG GTCAAGAGCG CAAAGGAAAG ACCACCGTTC ATTGTAGAGG GAATGATGCT GGAGCGCATC CAGAGTTTCA AGGAGATATA A
|
Protein sequence | MQLKVQPIDV KVGKYKVILN TIDAKELGVH EGDRVRIKNH VTLTAIVDFT EDMISPGMIG LYHEVKEALS KEWTETVEVF PAEKPKSTYI IRKTMDGQKL TKEEIDILVK DIVEENLAEI EIAAFLTATY INDMTDDETE WLTRAMIDSG DKLEFDTHPI MDKHSIGGVP GNKISLLIVP IVAANGLLIP KTSSRAITGA GGTADLMEIL APVEFDAAEI KRMTEEVGGV LVWGGATNIA PADDKLIKVE YPLSIDPHCQ MLASIMAKKG AIGADHVVMD IPTGPGTKIK NVQEGRKLAR DLINLGDRLG MDVDCALTYG ASPVGRTIGP ALEVIEALKV LESFEGPNSL IEKSASLAGM LLEMGNVAGK DKGYDLAIET LKNGKALTKF KEIIKIQGGN PDVTHKDISV GEFTEDIIAP NNGYILEMDN KRLVQIARLA GAPNDKGAGI LLHRKQGEPL KEGDPVMTIY AEKKSKLENA VKSAKERPPF IVEGMMLERI QSFKEI
|
| |