Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_1591 |
Symbol | deoA |
ID | 4648632 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 1683759 |
End bp | 1685060 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639805086 |
Product | thymidine phosphorylase |
Protein accession | YP_952426 |
Protein GI | 120402597 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.038951 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.387396 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCCAGT TCACATTCGA CATGCCGTCG ATCATCCGGA CCAAACGTGA CGGCGGAGTC CTCTCCGACG ACGCGATCGA CTGGGTGATC GACGCCTACA CCCACGGCCG CGTCGCCGAG GCGCAGATGT CGGCGCTGCT GATGGCGATC TTTCTGCGCG GCATGACCAA CGGCGAGATC GCCCGGTGGA CCGCCGCGAT GGTCGACTCG GGGGAGCGCC TGGACTTCTC GGATCTTCGT CGTGACGGAA AGCCGCTGGC CCTGGTTGAC AAGCATTCCA CCGGAGGGGT CGGCGACAAG ATCACCATCC CGCTGGTGTC CGTCGTGATG GCCTGCGGCG GAGCGGTTCC GCAGGCGGCC GGACGCGGAC TCGGCCACAC CGGCGGCACC CTGGACAAAT TGGAGTCGAT CCCCGGATTC ACCGCCGAGA TCTCCAAAAC CCAGATCCGC CAACAACTCT GCGAGCTCGG CGCCGCGATC TTCGCGGCCG GCGAGCTGGC GCCCGCGGAC CGCAAGATCT ACGCGCTGCG CGACGTGACC GCCACCACGG AATCGCTGCC GCTGATCGCG AGCTCGGTGA TGAGCAAGAA GATCGCCGAG GGCACCCGCG CGCTGGTGCT CGACACCAAG GTCGGCTCCG GCGCCTTCCT CAAGACCGAA GCGGAATCCC GGGAATTGGC CCGCACCATG GTCGAGCTGG GCACCGCGCA CGGCGTGCGC ACCCGGGCCC TGCTGACCGA CATGAACACC CCGCTGGGAC GCACGGTCGG CAACGCCGTC GAGGTCGCCG AATCGCTCGA GGTGCTCGCC GGCGGCGGCC CCGACGACGT CGTCGAGCTC ACGCTGGCGC TGGCGCGGGA GATGTGCGAC GCCGCGGGCC TGGACGGCGT CGACCCGGCC GAGACGTTGC GCGACGGGAC GGCGATGGAC CGGTTCCGGG CTCTGGTCGC CGCGCAGGGC GGGGACCCGG ACGCGGCCTT GCCGCTGGGT GCGCATTCCG AGACCGTGAG CGCCCCGCGC GGTGGCACGA TGGGGGACAT CGACGCGATG GCGGTGGGAC TGGCGGTGTG GCGGCTCGGA GCGGGCCGCT CGGCGCCGGG TGAGCAGGTG CAGTTCGGCG CCGGGATGCG CATCCACCGC AGGCCGGGTG AGCCCGTCGC GGCCGGCGAG CCGCTGTTCA CCCTCTACAC CGACACCCCG GAACGGCTTG CCGGCGCGGT GTCCGAACTC GACGGGGCAT GGAGCGTCGG CGACGAGCCG CCGGCCAGGC GTCCACTGAT CATCGATCGG ATCACCGGGT AG
|
Protein sequence | MTQFTFDMPS IIRTKRDGGV LSDDAIDWVI DAYTHGRVAE AQMSALLMAI FLRGMTNGEI ARWTAAMVDS GERLDFSDLR RDGKPLALVD KHSTGGVGDK ITIPLVSVVM ACGGAVPQAA GRGLGHTGGT LDKLESIPGF TAEISKTQIR QQLCELGAAI FAAGELAPAD RKIYALRDVT ATTESLPLIA SSVMSKKIAE GTRALVLDTK VGSGAFLKTE AESRELARTM VELGTAHGVR TRALLTDMNT PLGRTVGNAV EVAESLEVLA GGGPDDVVEL TLALAREMCD AAGLDGVDPA ETLRDGTAMD RFRALVAAQG GDPDAALPLG AHSETVSAPR GGTMGDIDAM AVGLAVWRLG AGRSAPGEQV QFGAGMRIHR RPGEPVAAGE PLFTLYTDTP ERLAGAVSEL DGAWSVGDEP PARRPLIIDR ITG
|
| |