Gene BURPS1710b_3564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_3564 
SymbolpaaN 
ID3690045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp3888530 
End bp3890236 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content66% 
IMG OID637730019 
Productphenylacetic acid degradation protein paaN 
Protein accessionYP_334929 
Protein GI76809987 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02288] phenylacetic acid degradation protein paaN 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000332176 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCATC CTCTGTTCAC GAAGCATGAA GACACGTTGA AGCACGCGCT CTCCACGATC 
GAAACGCGCG GCTACTGGAG CCCGTTCGCC GAGATGCCGA GCCCCAAAGT GTACGGGGAA
AGCGCCAATA CAGACGGCGA AGCAGCATTC AAAACACAGT TGGACAAGCC CTTTGAACTC
GACCAACCCG CCTCGGGCGG AACGGTCGGC GCCGAGCGTT CGCCATACGG GTTTGCGCTC
GGCGTCCGCT ACCCGAAGTC GACGCCCGAC GAGCTCATCG CCGCCGCCGC GCAGGCGGAA
TGCGCGTGGC GCAAAGCCGG GCCGACCGCG TGGGCTGGCG TGTGTCTCGA AATTCTCGCC
CGGCTGAATC GCGCGAGCTT CGAGATCGCA TACAGCGTGA TGCACACCAC GGGACAGGCG
TTCATGATGG CGTTCCAGGC GGGCGGCCCG CACGCGCAGG ATCGCGCGCT CGAAGCCGTC
GCCTATGCAT GGCAAGAACT GCAGCGCATT CCCGCCGAAG CGCACTGGGA GAAGCCGCAG
GGCAAGAACC CGCCGCTCGC GATGCGCAAG CGCTACACGA TCGTGCCGCG CGGGACGGGG
CTCGTGCTCG GGTGCTGCAC GTTCCCGACC TGGAACGGCT ATCCCGGTCT GTTCGCCGAT
CTGGCGACCG GCAACACAGT CATCGTCAAG CCGCATCCCG GCGCGATCCT GCCGCTCGCG
ATCACCGTGC GCATCGCGCG CGACGTGCTG CGCGAAGCGG GCTTCGATCC GAACATCGTC
ACGCTGCTCG CGACCGAAGG AAACGACGGC GCACTCGTCC AGGATCTGGC GCGCCGGCCG
GAAATCAAGC TGATCGACTT CACCGGCAGC TCGCAAAACG GCACCTGGCT CGAGCGCAAT
GCGTACCAGG CGCAGGTCTA TACGGAGAAG GCGGGCGTCA ACCAGATCGT GATCGATTCC
GTCGACGACC TGAAAGCCGC CGTCAAGAAC ATCGCGTTCT CGCTTGCGCT CTACTCCGGC
CAGATGTGCA CAGCGCCGCA AAACATCTAT GTGCCGCGTG ACGGCATCCG CACCGCCGAA
GGGCACGTCG GCTTCGACGA CGTCGCGCAG GCGATCGCCG ACGCCGTGCA AAAGCTGACG
GGCGACCCGG CACGCTCGGT CGAACTCATC GGGGCGCTGC AGAACGAAGG CGTCGCGGCA
CGTATCGACG AAGCGCGCAA GCTCGGCCGC ATTCTCGCCG ACAGCCAGGC GCTCGAGCAC
CCGGCATTCA AGGACGCGCG CGTGCGCACG CCGCTCGTGC TGCAACTCGA CGTCGCGGAC
CGTGCGAAGT ACACGCAGGA ATGGTTCGGT CCGATCTCGT TCGTCATCGC GACCGATTCG
ACTGCGCAAT CGCTCGATCT CGCCGGCTCG ATCGCGGCCG AGCATGGCGC GCTCACGCTG
TCCGTCTATA GCACGGACGA CGCCGTCGTC GAAGCGGCGC ACGAAGCGGC GGTGCGCGGC
GGCGTCGCGC TGTCGATCAA TCTGACGGGC GGCGTGTTCG TCAATCAGTC GGCGGCGTTC
TCCGACTTTC ACGGCACGGG CGCCAATCCG GCCGCGAATG CGTCGCTCGC CGACGCCGCG
TTCGTCGCGA ACCGCTTCCG CGTCGTTCAG AGCCGCCACC ATGTTGCGCC GAAGGCGGCT
CCCGCGGAAG CCGGCCAAAC GGCATAA
 
Protein sequence
MTHPLFTKHE DTLKHALSTI ETRGYWSPFA EMPSPKVYGE SANTDGEAAF KTQLDKPFEL 
DQPASGGTVG AERSPYGFAL GVRYPKSTPD ELIAAAAQAE CAWRKAGPTA WAGVCLEILA
RLNRASFEIA YSVMHTTGQA FMMAFQAGGP HAQDRALEAV AYAWQELQRI PAEAHWEKPQ
GKNPPLAMRK RYTIVPRGTG LVLGCCTFPT WNGYPGLFAD LATGNTVIVK PHPGAILPLA
ITVRIARDVL REAGFDPNIV TLLATEGNDG ALVQDLARRP EIKLIDFTGS SQNGTWLERN
AYQAQVYTEK AGVNQIVIDS VDDLKAAVKN IAFSLALYSG QMCTAPQNIY VPRDGIRTAE
GHVGFDDVAQ AIADAVQKLT GDPARSVELI GALQNEGVAA RIDEARKLGR ILADSQALEH
PAFKDARVRT PLVLQLDVAD RAKYTQEWFG PISFVIATDS TAQSLDLAGS IAAEHGALTL
SVYSTDDAVV EAAHEAAVRG GVALSINLTG GVFVNQSAAF SDFHGTGANP AANASLADAA
FVANRFRVVQ SRHHVAPKAA PAEAGQTA