Gene BURPS668_A2806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2806 
SymboldeoA 
ID4886924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2674568 
End bp2675890 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content74% 
IMG OID640132742 
Productthymidine phosphorylase 
Protein accessionYP_001063798 
Protein GI126444334 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.529241 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTTCC TGCCGCAGGA ATTCATCCGC AAGGTGCGCG ACCGCGCGCC GCTCGACACG 
GCCGACGTCG CGCGTTTCGT CCAAGGCGTG ACGGCGGGCG ACGTGACCGA AGGCCAGATC
GCCGCGTTCG CGATGGCGGT CTATTTCAAC GAGCTGCCGC TGTCCGCGCG CATCGCGCTG
ACGCTCGCGC AGCGCGATTC CGGCGATGTG CTCGACTGGC GCGGCGCGCG CCTGAACGGG
CCGGTGGTCG ACAAGCACTC GACGGGCGGC GTCGGCGATC TGACCTCGCT CGTGATCGGG
CCGATGGTGG CCGCGTGCGG CGGCTACGTG CCGATGATCT CGGGCCGCGG CCTCGGCCAC
ACGGGCGGCA CGCTCGACAA GCTCGAGGCG ATTCCCGGTT ACGATGTCGC GCCGTCCGTC
GACATGCTGC GCCGCGTCGT GCGCGACGCG GGCCTTGCGA TCGTCGGCCA GACCGCGCAG
CTCGCGCCCG CCGACAAGCG GATCTATGCG GTGCGCGACG TGACGGCGAC CGTCGAATCG
ATCTCGCTGA TCACCGCGTC GATCCTGTCG AAGAAGCTCG CGGCGGGCGT CGGCGCGCTC
GCGATGGACG TGAAGGTCGG CTCCGGCGCG TTCATGCCGA GCGCGGAGCA ATCGGCCGAA
CTCGCGCGCA GCATCGTCGA CGTCGGCAAC GGCGCGGGGA TGAGGACGGC CGCGACGCTC
ACCGACATGA ACCAGGCGCT CGCGCCATGC GCGGGCAACG CGATCGAGGT GCGCTGCGCG
ATCGATTTCC TGACGGGCGC GGCGCGCCCC GCACGGCTCG AAGCGGTCAG CTTCGCGCTC
GCCGCGCAGA TGCTGACGAT GGGCGGGCTT GCCGCGGACG CGCACGATGC GCGCCGCCGG
TTGCGCGCGG CGCTCGAATC GGGCGCGGCC GCGGAGCGGT TCGCGCGGAT GGTCGCGGCG
CTCGGCGGGC CCGCCGATCT GGTCGAGCGG CCCGAGCGGC ATCTGCCGCG CGCGGCCGCC
GCCGCCCCCG TGGCCGCCGC GCGCGCCGGC TGGATCGAGC GGATCGACGC GCGCGCGCTC
GGCCTGGCGG TCGTCGGCCT GGGCGGCGGG CGCGCGAAGA TCGGCGACAC GCTCGATTAC
TCGGTCGGAC TGTCCGCGCT CGCGGAGCTG GGCGAGCGCG TCGAGGCGGG CCAGCCGCTC
GCGACCGTTC ACGCGCGCGA CGCCGATTCG GCCGCGCAGG CGGCCGACGC GGTGCGGCGC
GCCTACCGGA TCGGCGCGGA GCCGCCGGCG CAGACGCGCG TCGTTCATGC CGTGATCGAA
TGA
 
Protein sequence
MTFLPQEFIR KVRDRAPLDT ADVARFVQGV TAGDVTEGQI AAFAMAVYFN ELPLSARIAL 
TLAQRDSGDV LDWRGARLNG PVVDKHSTGG VGDLTSLVIG PMVAACGGYV PMISGRGLGH
TGGTLDKLEA IPGYDVAPSV DMLRRVVRDA GLAIVGQTAQ LAPADKRIYA VRDVTATVES
ISLITASILS KKLAAGVGAL AMDVKVGSGA FMPSAEQSAE LARSIVDVGN GAGMRTAATL
TDMNQALAPC AGNAIEVRCA IDFLTGAARP ARLEAVSFAL AAQMLTMGGL AADAHDARRR
LRAALESGAA AERFARMVAA LGGPADLVER PERHLPRAAA AAPVAAARAG WIERIDARAL
GLAVVGLGGG RAKIGDTLDY SVGLSALAEL GERVEAGQPL ATVHARDADS AAQAADAVRR
AYRIGAEPPA QTRVVHAVIE