Gene Mvan_1591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1591 
SymboldeoA 
ID4648632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1683759 
End bp1685060 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content71% 
IMG OID639805086 
Productthymidine phosphorylase 
Protein accessionYP_952426 
Protein GI120402597 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.038951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.387396 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCAGT TCACATTCGA CATGCCGTCG ATCATCCGGA CCAAACGTGA CGGCGGAGTC 
CTCTCCGACG ACGCGATCGA CTGGGTGATC GACGCCTACA CCCACGGCCG CGTCGCCGAG
GCGCAGATGT CGGCGCTGCT GATGGCGATC TTTCTGCGCG GCATGACCAA CGGCGAGATC
GCCCGGTGGA CCGCCGCGAT GGTCGACTCG GGGGAGCGCC TGGACTTCTC GGATCTTCGT
CGTGACGGAA AGCCGCTGGC CCTGGTTGAC AAGCATTCCA CCGGAGGGGT CGGCGACAAG
ATCACCATCC CGCTGGTGTC CGTCGTGATG GCCTGCGGCG GAGCGGTTCC GCAGGCGGCC
GGACGCGGAC TCGGCCACAC CGGCGGCACC CTGGACAAAT TGGAGTCGAT CCCCGGATTC
ACCGCCGAGA TCTCCAAAAC CCAGATCCGC CAACAACTCT GCGAGCTCGG CGCCGCGATC
TTCGCGGCCG GCGAGCTGGC GCCCGCGGAC CGCAAGATCT ACGCGCTGCG CGACGTGACC
GCCACCACGG AATCGCTGCC GCTGATCGCG AGCTCGGTGA TGAGCAAGAA GATCGCCGAG
GGCACCCGCG CGCTGGTGCT CGACACCAAG GTCGGCTCCG GCGCCTTCCT CAAGACCGAA
GCGGAATCCC GGGAATTGGC CCGCACCATG GTCGAGCTGG GCACCGCGCA CGGCGTGCGC
ACCCGGGCCC TGCTGACCGA CATGAACACC CCGCTGGGAC GCACGGTCGG CAACGCCGTC
GAGGTCGCCG AATCGCTCGA GGTGCTCGCC GGCGGCGGCC CCGACGACGT CGTCGAGCTC
ACGCTGGCGC TGGCGCGGGA GATGTGCGAC GCCGCGGGCC TGGACGGCGT CGACCCGGCC
GAGACGTTGC GCGACGGGAC GGCGATGGAC CGGTTCCGGG CTCTGGTCGC CGCGCAGGGC
GGGGACCCGG ACGCGGCCTT GCCGCTGGGT GCGCATTCCG AGACCGTGAG CGCCCCGCGC
GGTGGCACGA TGGGGGACAT CGACGCGATG GCGGTGGGAC TGGCGGTGTG GCGGCTCGGA
GCGGGCCGCT CGGCGCCGGG TGAGCAGGTG CAGTTCGGCG CCGGGATGCG CATCCACCGC
AGGCCGGGTG AGCCCGTCGC GGCCGGCGAG CCGCTGTTCA CCCTCTACAC CGACACCCCG
GAACGGCTTG CCGGCGCGGT GTCCGAACTC GACGGGGCAT GGAGCGTCGG CGACGAGCCG
CCGGCCAGGC GTCCACTGAT CATCGATCGG ATCACCGGGT AG
 
Protein sequence
MTQFTFDMPS IIRTKRDGGV LSDDAIDWVI DAYTHGRVAE AQMSALLMAI FLRGMTNGEI 
ARWTAAMVDS GERLDFSDLR RDGKPLALVD KHSTGGVGDK ITIPLVSVVM ACGGAVPQAA
GRGLGHTGGT LDKLESIPGF TAEISKTQIR QQLCELGAAI FAAGELAPAD RKIYALRDVT
ATTESLPLIA SSVMSKKIAE GTRALVLDTK VGSGAFLKTE AESRELARTM VELGTAHGVR
TRALLTDMNT PLGRTVGNAV EVAESLEVLA GGGPDDVVEL TLALAREMCD AAGLDGVDPA
ETLRDGTAMD RFRALVAAQG GDPDAALPLG AHSETVSAPR GGTMGDIDAM AVGLAVWRLG
AGRSAPGEQV QFGAGMRIHR RPGEPVAAGE PLFTLYTDTP ERLAGAVSEL DGAWSVGDEP
PARRPLIIDR ITG