Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2275 |
Symbol | |
ID | 3831386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2382738 |
End bp | 2384045 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637830195 |
Product | thymidine phosphorylase |
Protein accession | YP_431105 |
Protein GI | 83591096 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000000178657 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000323962 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGATGC TTGACTTAAT CCGTCGCAAG CGGGAGGGCC AGGCCCTGGC CCCAGCCGAA ATTGAGGCCA TGATACGGGA TTATACGGCC GGGATAATCC CCGACTATCA GATGGCAGCC TTTCTCATGG CCGTCTATTT TCGCGGCCTG GACCGGGAGG AAACGGCGGC CCTCACCAGG GCCATGATAG CCTCGGGGGA ACAGATTGAG TGGAGTTCCA TCCCGGGGGT GAAGGTCGAC AAGCACAGTA CCGGAGGTGT GGCCGACACC ACCACCTTGG TCCTGGCGCC CCTGGTGGCC GCCGCCGGAG TGCCGGTAGT TAAGATGTCC GGCCGCGGCC TGGGACACAC TGGGGGCACT ATTGACAAAC TGGAATCCAT CCCCGGCTTC AGGGTGCAGC TGACGCGGGA AGAGATGATT CGCCAGGTAA AGGAGATCGG CCTGGCCGTC ACTGCTCCCA CGGGGAAGCT GGCCCCGGCT GACGGCAAGC TCTACGCCCT GCGGGACGTC ACAGCGACTG TTGAGAGCAT ACCCCTCATT GCCAGCAGTG TAATGAGCAA AAAGATCGCC GCTGGCGCCG ACGCCATAGT CCTCGATGTC AAGGTTGGCA GCGGCGCCTT TATGCCCGAC CTGGAGTCGG CCCGGGAACT GGCCCGGATC ATGGTGGATC TGGGCCGGGA GATGGGGCGG CGGGTGGTAG CTGTGATTAC CAATATGGAC GAACCCCTGG GGATGATGGT GGGCAACGCC CTGGAAGTCG GGGAGGCCAT CGCCGTTTTA TCCGGCGGCG GGCCGCGGGA GTTGCGGGAG GTTTGCCTCA CCCTGGGCAG CCAGATGCTT CTACTGGCCG GGGCTACCGG TAGTGACGGT GAGGCGCGCC GGCGTCTGGA GGAGCTCCTG GCCGGTGGCG CCGCCCTGGC CAAATTCCGG CGGTTCATTG CCGCCCAGGG CGGCGACCCG GCGGTAGTCG ACCGGCCGGA ACTTCTCCCC CGCGCCACGG ATCAGGTTAC CATTGCCGCC CTAAGCAGCG GCTACATCAG CGCCGTCCAG GCACGCCTGG TGGGCGAGGC GGCTATGCTC CTGGGGGCCG GGCGAATAAC CAAAGAAAGC CCCATCGACC TGGCGGTTGG TATCGAACTA AAAAAACGTC TGGGAGATTA TGTTAACGCC GGCGAGCCCC TGGCTGTATT CCACGTCAAC GACCGGGCCA ACCTGGAGGC AGCCCGGGAG AGATTCCTGG CGGCCTATAT TCTGGCCGCC GCACCGCCCA CCCCGCAACC CCTGGTGTAT GAGATAATCA GGGGATAA
|
Protein sequence | MQMLDLIRRK REGQALAPAE IEAMIRDYTA GIIPDYQMAA FLMAVYFRGL DREETAALTR AMIASGEQIE WSSIPGVKVD KHSTGGVADT TTLVLAPLVA AAGVPVVKMS GRGLGHTGGT IDKLESIPGF RVQLTREEMI RQVKEIGLAV TAPTGKLAPA DGKLYALRDV TATVESIPLI ASSVMSKKIA AGADAIVLDV KVGSGAFMPD LESARELARI MVDLGREMGR RVVAVITNMD EPLGMMVGNA LEVGEAIAVL SGGGPRELRE VCLTLGSQML LLAGATGSDG EARRRLEELL AGGAALAKFR RFIAAQGGDP AVVDRPELLP RATDQVTIAA LSSGYISAVQ ARLVGEAAML LGAGRITKES PIDLAVGIEL KKRLGDYVNA GEPLAVFHVN DRANLEAARE RFLAAYILAA APPTPQPLVY EIIRG
|
| |