Gene BURPS1710b_A1066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A1066 
SymboldeoA 
ID3692435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp1331141 
End bp1332463 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content74% 
IMG OID637731320 
Productthymidine phosphorylase 
Protein accessionYP_336224 
Protein GI162210112 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTTCC TGCCGCAGGA ATTCATCCGC AAGGTGCGCG ACCGCGCGCC GCTCGACACG 
GCCGACGTCG CGCGTTTCGT CCAAGGCGTG ACGGCGGGCG ACGTGACCGA AGGCCAGATC
GCCGCGTTCG CGATGGCGGT CTATTTCAAC GAGTTGCCGC TGTCCGCGCG CATCGCGCTG
ACGCTCGCGC AGCGCGATTC CGGCGACGTG CTCGACTGGC GCGGCGCGCG CCTGAACGGG
CCGGTGGTCG ACAAGCACTC GACGGGCGGC GTCGGCGATC TGACCTCGCT CGTGATCGGG
CCGATGGTGG CCGCGTGCGG CGGCTACGTG CCGATGATCT CGGGCCGCGG CCTCGGCCAC
ACGGGCGGCA CGCTCGACAA GCTCGAGGCG ATTCCCGGTT ACGATGTCGC GCCGTCCGTC
GACATGCTGC GCCGCGTCGT GCGCGACGCG GGCCTTGCGA TCGTCGGCCA GACCGCGCAG
CTCGCGCCCG CCGACAAGCG GATCTATGCG GTGCGCGACG TGACGGCGAC CGTCGAATCG
ATCTCGCTGA TCACCGCGTC GATCCTGTCG AAGAAGCTCG CGGCGGGCGT CGGCGCGCTC
GCGATGGACG TGAAGGTCGG CTCCGGCGCG TTCATGCCGA GCGCGGAGCA ATCGGCCGAA
CTCGCGCGCA GCATCGTCGA CGTCGGCAAC GGCGCGGGGA TGAGGACGGC CGCGACGCTC
ACCGACATGA ACCAGGCGCT CGCGCCATGC GCGGGCAACG CGATCGAGGT GCGCTGCGCG
ATCGATTTCC TGACGGGCGC GGCGCGCCCC GCGCGGCTCG AAGCGGTCAG CTTCGCGCTC
GCCGCGCAGA TGCTGACGAT GGGCGGGCTT GCCGCGGACG CGCACGATGC GCGCCGCCGG
TTGCGCGCGG TGCTCGAATC GGGCGCGGCC GCGGAGCGGT TCGCGCGGAT GGTCGCGGCG
CTCGGCGGGC CCGCCGATCT GGTCGAGCGG CCCGAGCGGC ATCTGCCGCG CGCGGCTGCC
GCCGCCCCCG TGGCCGCCGC GCGCGCCGGC TGGATCGAGC GGATCGACGC GCGCGCGCTC
GGCCTGGCGG TCGTCGGCCT GGGCGGCGGG CGCGCGAAGA TCGGCGACAC GCTCGATTAC
TCGGTCGGAC TGTCCGCGCT CGCGGAGCTG GGCGAGCGCG TCGAGGCGGG CCAGCCGCTC
GCGACCGTTC ACGCGCGCGA CGCCGATTCG GCCGCGCAGG CGACCGACGC GGTGCGGCGC
GCCTACCGGA TCGGCGCGGA GCCGCCGGCG CAGACGCGCG TCGTTCATGC CGTGATCGAA
TGA
 
Protein sequence
MTFLPQEFIR KVRDRAPLDT ADVARFVQGV TAGDVTEGQI AAFAMAVYFN ELPLSARIAL 
TLAQRDSGDV LDWRGARLNG PVVDKHSTGG VGDLTSLVIG PMVAACGGYV PMISGRGLGH
TGGTLDKLEA IPGYDVAPSV DMLRRVVRDA GLAIVGQTAQ LAPADKRIYA VRDVTATVES
ISLITASILS KKLAAGVGAL AMDVKVGSGA FMPSAEQSAE LARSIVDVGN GAGMRTAATL
TDMNQALAPC AGNAIEVRCA IDFLTGAARP ARLEAVSFAL AAQMLTMGGL AADAHDARRR
LRAVLESGAA AERFARMVAA LGGPADLVER PERHLPRAAA AAPVAAARAG WIERIDARAL
GLAVVGLGGG RAKIGDTLDY SVGLSALAEL GERVEAGQPL ATVHARDADS AAQATDAVRR
AYRIGAEPPA QTRVVHAVIE