Gene Mjls_1258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_1258 
SymboldeoA 
ID4876994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp1335034 
End bp1336353 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content72% 
IMG OID640138566 
Productthymidine phosphorylase 
Protein accessionYP_001069551 
Protein GI126433860 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGACT TCACTTTCGA TGCTCCGACC GTCATCCGGG TCAAACGCGA CGGCGGCGTC 
CTCCCCGACG AGGCGATCGA CTGGGTGATC GACGCGTACA CCCGCGGGCA GGTGGCCGAC
GAGCAGATGT CGGCCCTGCT GATGGCGATC TTCCTGCGCG GGATGACCGG CCCCGAGATC
GCGCGCTGGA CCGCGGCGAT GGTGGCCTCG GGGCAGCGGT TCGACTTCAC CGATCTGCGA
CGCGGCGGCC GTCCGCTGGC GCTGGTCGAC AAACACTCCA CCGGTGGGGT CGGCGACAAG
ATCACCATCC CGCTGGTGCC TGTCGTGATG GCCTGCGGCG GTGCGGTGCC CCAGGCCGCC
GGACGCGGGC TCGGCCACAC CGGCGGCACC CTCGACAAAC TCGAAGCCAT CCCCGGATTC
ACCGCCGAAC TGACCAAAAG CCAGATCCGC CAACAACTCA GCGAGATCGG TGCGGCGATC
TTCGCCGCGG GTGAGCTGGC CCCGGCCGAC CGCAAGATCT ACGCGCTGCG CGACGTCACC
GCCACCACCG AATCGCTGCC GCTGATCGCC AGCTCGGTGA TGAGCAAGAA GATCGCCGAG
GGCGCCCGCG CACTGGTGCT CGACACGAAG GTCGGTTCGG GCGCCTTCCT GCCCACCGAG
GCCGAGGCCC GCGAACTGGC CCGCACGATG GTCGAGTTGG GTCACGCGCA CGGTCTGGTG
ACGCGCGCCC TGCTGACCGA CATGTCGGTG CCGCTGGGCC GCGCCGTCGG CAACGCGGTC
GAGGTCGTCG AATCCCTGGA GGTGCTCGCC GGCGGCGGGC CCGACGACGT GGTGGAACTG
ACGCTGGCAC TGGCGGCCGA GATGCTCGAC GCCGCCGGGA TCGACGGCAC CGACCCCGCC
GAGACGCTGC GCGACGGTAC CGCCATGGAC TGTTTCCGCG CGCTCGTCGC GGCCCAGGGC
GGCGACACCA CCCGATTGGC CGCCGACGCG TTGCCCATCG GTGTCCACAC CGACACCGTC
ACGGCACCGC GCGGTGGCAC CATGGGTGAC ATCGACGCGA TGGCGGTGGG TCTGGCGGTG
TGGCGGCTCG GAGCGGGCCG CTCGGCGCCC GGTGAGCAGG TGCAGTTCGG CGCCGGCCTC
CGCATCCACC GCCGTCCCGG TGAGCCGGTG AGTGCGGGCG AGCCGCTGTT CACCCTCTAC
ACCGACACCC CCGACCGGCT CGGGCCGGCC CGCGCCGAAC TCGAGGGTGC CTGGACGGTG
GGGGACAGCG CCCCGCCGGC GCGTCCGCTG ATCATCGATC GGATCACCGC GACAGGCTGA
 
Protein sequence
MTDFTFDAPT VIRVKRDGGV LPDEAIDWVI DAYTRGQVAD EQMSALLMAI FLRGMTGPEI 
ARWTAAMVAS GQRFDFTDLR RGGRPLALVD KHSTGGVGDK ITIPLVPVVM ACGGAVPQAA
GRGLGHTGGT LDKLEAIPGF TAELTKSQIR QQLSEIGAAI FAAGELAPAD RKIYALRDVT
ATTESLPLIA SSVMSKKIAE GARALVLDTK VGSGAFLPTE AEARELARTM VELGHAHGLV
TRALLTDMSV PLGRAVGNAV EVVESLEVLA GGGPDDVVEL TLALAAEMLD AAGIDGTDPA
ETLRDGTAMD CFRALVAAQG GDTTRLAADA LPIGVHTDTV TAPRGGTMGD IDAMAVGLAV
WRLGAGRSAP GEQVQFGAGL RIHRRPGEPV SAGEPLFTLY TDTPDRLGPA RAELEGAWTV
GDSAPPARPL IIDRITATG