Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_C6660 |
Symbol | deoA |
ID | 3733982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007509 |
Strand | + |
Start bp | 173064 |
End bp | 174380 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637760367 |
Product | thymidine phosphorylase |
Protein accession | YP_366354 |
Protein GI | 78059779 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0180706 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0516478 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTTAC CGCAGGAATT CATTCGCCAG AAGCGCAACC GGCAGGCACT CGATCGCGAC GGGATCGCCG CGTTCGTGCG CGGCGTGACC GACGGCAGCG TGACCGAGGG CCAGGTGGCC GCGTTCGCGA TGGCCGTGTA TTTCAACGAC CTGAGCACCG ACGAGCGCGT CGCGCTGACG CTCGCGCAGC GCGACTCGGG CGACGTGCTC GACTGGCATG CGCTGGAGCT CGACGGGCCG GTGATCGACA AGCACTCGAC CGGCGGCGTC GGCGATGTCG TGTCGCTGAT GCTCGGGCCG ATGGTGGCCG CGTGCGGCGG CTACGTGCCG ATGATCTCGG GGCGCGGGCT CGGCCACACC GGCGGCACGC TCGACAAGTT GAGCGCGATT CCCGGTTACA ACGTGACGCC CGATACGGAC GCGTTTCGCC GTGCGGTGCG CGACGTCGGC GTCGCGATCA TCGGACAGAC CGCACGGCTG GCGCCGGCCG ACATGCGCAT CTATGCGATT CGCGACGTGA CGGCGACCGT CGAGTCGGTC GCGATGATCA CCGCGTCGAT CCTGTCGAAG AAGCTCGCGG CCGGGCTCGA CGGGCTCGTG ATGGACGTCA AGGTCGGCTC CGGCGCATTC ATGCCGACCG CCGAGCAATC GGCGGAACTG GCCCGCAGCA TCGTCGACGT CGGTAACGGC GCCGGCATGA AGACGACCGC GATCCTGACC GACATGAACC AGTCGCTCGC GCCATGCGCG GGCAATGCAC TCGAAGTGGC GTGCGCGATC GACTACCTGA CGGGCAAGTC GCGTCCGGCC CGCCTGCATG ACGTCACGAT GGCGCTGTCG GCGGAGCTGC TCGTCACCGG CGGGCTGGCG CACGACGTCG CAGACGCGCG CGCGAAGTTG CTGCGCGCGC TCGATTCGGG TGCGGCAGCG GAACGCTTCG CGAGGATGGT CACGGCGCTC GGCGGCCCTG CGGACCTGAT CGACGCACCT GCTCGTCATC TCGCGCGGGC GAAGGTGGTC GTGCCGGTTC CGGCGCGCGC GAGCGGCGTG GTGCAGCGGG TCGATTGCCG CGCGCTCGGG CTGGCGGTCG TCGCGCTCGG CGGTGGCCGC ACGCGTGCGG CGGATGCGAT CGACTACAGT GTCGGCCTGA CGGCGCTCGC GGAGATCGGG CAGCGTGTTG AAGCCGACCA GCCGCTCGGT TACGTGCATG CCCGTGACGC CGCCGCCGCG GCGCATGCGG TGGACACGAT CCAGCGCAGC TACGTGCTGG GTGAGGCGGG TGACGCGCCG CCGACCATTT ATCAGCAGAT CGGCTGA
|
Protein sequence | MFLPQEFIRQ KRNRQALDRD GIAAFVRGVT DGSVTEGQVA AFAMAVYFND LSTDERVALT LAQRDSGDVL DWHALELDGP VIDKHSTGGV GDVVSLMLGP MVAACGGYVP MISGRGLGHT GGTLDKLSAI PGYNVTPDTD AFRRAVRDVG VAIIGQTARL APADMRIYAI RDVTATVESV AMITASILSK KLAAGLDGLV MDVKVGSGAF MPTAEQSAEL ARSIVDVGNG AGMKTTAILT DMNQSLAPCA GNALEVACAI DYLTGKSRPA RLHDVTMALS AELLVTGGLA HDVADARAKL LRALDSGAAA ERFARMVTAL GGPADLIDAP ARHLARAKVV VPVPARASGV VQRVDCRALG LAVVALGGGR TRAADAIDYS VGLTALAEIG QRVEADQPLG YVHARDAAAA AHAVDTIQRS YVLGEAGDAP PTIYQQIG
|
| |