Gene Moth_2275 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2275 
Symbol 
ID3831386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2382738 
End bp2384045 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content64% 
IMG OID637830195 
Productthymidine phosphorylase 
Protein accessionYP_431105 
Protein GI83591096 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000178657 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000323962 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGATGC TTGACTTAAT CCGTCGCAAG CGGGAGGGCC AGGCCCTGGC CCCAGCCGAA 
ATTGAGGCCA TGATACGGGA TTATACGGCC GGGATAATCC CCGACTATCA GATGGCAGCC
TTTCTCATGG CCGTCTATTT TCGCGGCCTG GACCGGGAGG AAACGGCGGC CCTCACCAGG
GCCATGATAG CCTCGGGGGA ACAGATTGAG TGGAGTTCCA TCCCGGGGGT GAAGGTCGAC
AAGCACAGTA CCGGAGGTGT GGCCGACACC ACCACCTTGG TCCTGGCGCC CCTGGTGGCC
GCCGCCGGAG TGCCGGTAGT TAAGATGTCC GGCCGCGGCC TGGGACACAC TGGGGGCACT
ATTGACAAAC TGGAATCCAT CCCCGGCTTC AGGGTGCAGC TGACGCGGGA AGAGATGATT
CGCCAGGTAA AGGAGATCGG CCTGGCCGTC ACTGCTCCCA CGGGGAAGCT GGCCCCGGCT
GACGGCAAGC TCTACGCCCT GCGGGACGTC ACAGCGACTG TTGAGAGCAT ACCCCTCATT
GCCAGCAGTG TAATGAGCAA AAAGATCGCC GCTGGCGCCG ACGCCATAGT CCTCGATGTC
AAGGTTGGCA GCGGCGCCTT TATGCCCGAC CTGGAGTCGG CCCGGGAACT GGCCCGGATC
ATGGTGGATC TGGGCCGGGA GATGGGGCGG CGGGTGGTAG CTGTGATTAC CAATATGGAC
GAACCCCTGG GGATGATGGT GGGCAACGCC CTGGAAGTCG GGGAGGCCAT CGCCGTTTTA
TCCGGCGGCG GGCCGCGGGA GTTGCGGGAG GTTTGCCTCA CCCTGGGCAG CCAGATGCTT
CTACTGGCCG GGGCTACCGG TAGTGACGGT GAGGCGCGCC GGCGTCTGGA GGAGCTCCTG
GCCGGTGGCG CCGCCCTGGC CAAATTCCGG CGGTTCATTG CCGCCCAGGG CGGCGACCCG
GCGGTAGTCG ACCGGCCGGA ACTTCTCCCC CGCGCCACGG ATCAGGTTAC CATTGCCGCC
CTAAGCAGCG GCTACATCAG CGCCGTCCAG GCACGCCTGG TGGGCGAGGC GGCTATGCTC
CTGGGGGCCG GGCGAATAAC CAAAGAAAGC CCCATCGACC TGGCGGTTGG TATCGAACTA
AAAAAACGTC TGGGAGATTA TGTTAACGCC GGCGAGCCCC TGGCTGTATT CCACGTCAAC
GACCGGGCCA ACCTGGAGGC AGCCCGGGAG AGATTCCTGG CGGCCTATAT TCTGGCCGCC
GCACCGCCCA CCCCGCAACC CCTGGTGTAT GAGATAATCA GGGGATAA
 
Protein sequence
MQMLDLIRRK REGQALAPAE IEAMIRDYTA GIIPDYQMAA FLMAVYFRGL DREETAALTR 
AMIASGEQIE WSSIPGVKVD KHSTGGVADT TTLVLAPLVA AAGVPVVKMS GRGLGHTGGT
IDKLESIPGF RVQLTREEMI RQVKEIGLAV TAPTGKLAPA DGKLYALRDV TATVESIPLI
ASSVMSKKIA AGADAIVLDV KVGSGAFMPD LESARELARI MVDLGREMGR RVVAVITNMD
EPLGMMVGNA LEVGEAIAVL SGGGPRELRE VCLTLGSQML LLAGATGSDG EARRRLEELL
AGGAALAKFR RFIAAQGGDP AVVDRPELLP RATDQVTIAA LSSGYISAVQ ARLVGEAAML
LGAGRITKES PIDLAVGIEL KKRLGDYVNA GEPLAVFHVN DRANLEAARE RFLAAYILAA
APPTPQPLVY EIIRG