Gene BURPS1106A_A2662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2662 
SymboldeoA 
ID4904913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2601559 
End bp2602881 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content74% 
IMG OID640145765 
Productthymidine phosphorylase 
Protein accessionYP_001076692 
Protein GI126455479 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTTCC TGCCGCAGGA ATTCATCCGC AAGGTGCGCG ACCGCGCGCC GCTCGACACG 
GCCGACGTCG CGCGTTTCGT CCAAGGCGTG ACGGCGGGCG ACGTGACCGA AGGCCAGATC
GCCGCGTTCG CGATGGCGGT CTATTTCAAC GAGTTGCCGC TGTCCGCGCG CATCGCGCTG
ACGCTCGCGC AGCGCGATTC CGGCGACGTG CTCGACTGGC GCGGCGCGCG CCTGAACGGG
CCGGTGGTCG ACAAGCACTC GACGGGCGGC GTCGGCGATC TGACCTCGCT CGTGATCGGG
CCGATGGTGG CCGCGTGCGG CGGCTACGTG CCGATGATCT CGGGCCGCGG CCTCGGCCAC
ACGGGCGGCA CGCTCGACAA GCTCGAGGCG ATTCCCGGTT ACGATGTCGC GCCGTCCGTC
GACATGCTGC GCCGCGTCGT GCGCGACGCG GGCCTTGCGA TCGTCGGCCA GACCGCGCAG
CTCGCGCCCG CCGACAAGCG GATCTATGCG GTGCGCGACG TGACGGCGAC CGTCGAATCG
ATCTCGCTGA TCACCGCGTC GATCCTGTCG AAGAAGCTCG CGGCGGGCGT CAGCGCGCTC
GCGATGGACG TGAAGGTCGG CTCCGGCGCG TTCATGCCGA GCGCGGAGCA ATCGGCCGAA
CTCGCGCGCA GCATCGTCGA CGTCGGCAAC GGCGCGGGGA TGAGGACGGC CGCGACGCTC
ACCGACATGA ACCAGGCGCT CGCGCCATGC GCGGGCAACG CGATCGAGGT GCGCTGCGCG
ATCGATTTCC TGACGGGCGC GGCGCGCCCC GCGCGGCTCG AAGCGGTCAG CTTCGCGCTC
GCCGCGCAGA TGCTGACGAT GGGCGGGCTT GCCGCGGACG CGCACGATGC GCGCCGCCGG
TTGCGCGCGG TGCTCGAATC GGGCGCGGCC GCGGAGCGGT TCGCGCGGAT GGTCGCGGCG
CTCGGCGGGC CCGCCGATCT GGTCGAGCGG CCCGAGCGGC ATCTGCCGCG CGCGGCTGCC
GCCGCCCCCG TGGCCGCCGC GCGCGCCGGC TGGATCGAGC GGATCGACGC GCGCGCGCTC
GGCCTGGCGG TCGTCGGCCT GGGCGGCGGG CGCGCGAAGA TCGGCGACAC GCTCGATTAC
TCGGTCGGAC TGTCCGCGCT CGCGGAGCTG GGCGAGCGCG TCGAGGCGGG CCAGCCGCTC
GCGACCGTTC ACGCGCGCGA CGCCGATTCG GCCGCGCAGG CGACCGACGC GGTGCGGCGC
GCCTACCGGA TCGGCGCGGA GCCGCCGGCG CAGACGCGCG TCGTTCATGC CGTGATCGAA
TGA
 
Protein sequence
MTFLPQEFIR KVRDRAPLDT ADVARFVQGV TAGDVTEGQI AAFAMAVYFN ELPLSARIAL 
TLAQRDSGDV LDWRGARLNG PVVDKHSTGG VGDLTSLVIG PMVAACGGYV PMISGRGLGH
TGGTLDKLEA IPGYDVAPSV DMLRRVVRDA GLAIVGQTAQ LAPADKRIYA VRDVTATVES
ISLITASILS KKLAAGVSAL AMDVKVGSGA FMPSAEQSAE LARSIVDVGN GAGMRTAATL
TDMNQALAPC AGNAIEVRCA IDFLTGAARP ARLEAVSFAL AAQMLTMGGL AADAHDARRR
LRAVLESGAA AERFARMVAA LGGPADLVER PERHLPRAAA AAPVAAARAG WIERIDARAL
GLAVVGLGGG RAKIGDTLDY SVGLSALAEL GERVEAGQPL ATVHARDADS AAQATDAVRR
AYRIGAEPPA QTRVVHAVIE