Gene Mmcs_1232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1232 
SymboldeoA 
ID4110069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1330080 
End bp1331426 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content71% 
IMG OID638030353 
Productthymidine phosphorylase 
Protein accessionYP_638400 
Protein GI108798203 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.32576 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGGACT TCACTTTCGA CGCGCCCCGG CCGATGTCGC ATCTCGATGC TCCGACCGTC 
ATCCGGGTCA AACGCGACGG CGGCGACCTG CCCGACGAGG CGATCGACTG GGTGATCGAC
GCGTACACCC GCGGGCAGGT GGCCGACGAG CAGATGTCGG CCCTGCTGAT GGCGATCTTC
CTGCGCGGGA TGACCGGCCC CGAGATCGCG CGCTGGACAG CGGCGATGGT GGCATCGGGG
CAGCGGTTCG ACTTCACCGA TCTGCGACGC GGCGGCCGTC CGCTGGCGCT GGTCGACAAA
CACTCCACCG GCGGGGTCGG CGACAAGATC ACCATCCCGC TGGTGCCTGT CGTGATGGCC
TGCGGCGGCG CGGTGCCCCA GGCCGCCGGA CGCGGGCTCG GCCACACCGG CGGCACCCTC
GACAAACTCG AAGCCATCCC CGGATTCACC GCCGAACTGA CCAAAAGCCA GATCCGCCAA
CAACTCAGCG AGATCGGTGC GGCGATCTTC GCCGCGGGTG AGCTGGCCCC GGCCGACCGC
AAGATCTATG CGCTGCGCGA CGTCACCGCC ACCACCGAAT CGCTGCCGCT GATCGCCAGC
TCGGTGATGA GCAAGAAGAT CGCCGAGGGC GCCCGCGCGC TGGTGCTCGA CACGAAGGTC
GGTTCGGGCG CCTTCCTGCC CACCGAGGCC GAGGCCCGCG AACTGGCCCG CACGATGGTC
GAGTTGGGTC ACGCGCACGG TCTGGTGACG CGCGCCCTGC TGACCGACAT GTCGGTGCCG
CTGGGCCGCG CCGTCGGCAA CGCGGTCGAG GTCGTCGAAT CGCTGGAGGT GCTCGCCGGC
GGCGGGCCCG ACGACGTGGT GGAACTGACG CTGGCGTTGG CGGCCGAGAT GCTCGACGCC
GCCGGGATCG ACGGGACCGA CCCCGCCGAG ACGCTGCGCG ACGGTACCGC CATGGACTGT
TTCCGCGCGC TCGTCGCGGC CCAGGGCGGC GACACCTCCC GATTGGCCGC CGACGCGTTG
CCCATCGGTG TCCACACCGA CACCGTCACG GCACCGCGCG GTGGCACCAT GGGTGACATC
GACGCGATGG CGGTGGGTCT GGCGGTGTGG CGGCTCGGAG CGGGCCGCTC GGCGCCCGGT
GAGCAGGTGC AGTTCGGCGC CGGTCTCCGC ATCCACCGCC GTCCCGGTGA GCCGGTGAGT
GCGGGCGAGC CGCTGTTCAC CCTCTACACC GACACCCCCG ACCGGCTCGG GCCGGCCCGA
GCCGAACTCG AGGGTGCCTG GACGGTGGGG GACAGTGCCC CGCCGGCGCG TCCGCTGATC
ATCGACCGGA TCACCGCGAC AGGCTGA
 
Protein sequence
MTDFTFDAPR PMSHLDAPTV IRVKRDGGDL PDEAIDWVID AYTRGQVADE QMSALLMAIF 
LRGMTGPEIA RWTAAMVASG QRFDFTDLRR GGRPLALVDK HSTGGVGDKI TIPLVPVVMA
CGGAVPQAAG RGLGHTGGTL DKLEAIPGFT AELTKSQIRQ QLSEIGAAIF AAGELAPADR
KIYALRDVTA TTESLPLIAS SVMSKKIAEG ARALVLDTKV GSGAFLPTEA EARELARTMV
ELGHAHGLVT RALLTDMSVP LGRAVGNAVE VVESLEVLAG GGPDDVVELT LALAAEMLDA
AGIDGTDPAE TLRDGTAMDC FRALVAAQGG DTSRLAADAL PIGVHTDTVT APRGGTMGDI
DAMAVGLAVW RLGAGRSAPG EQVQFGAGLR IHRRPGEPVS AGEPLFTLYT DTPDRLGPAR
AELEGAWTVG DSAPPARPLI IDRITATG