Gene BURPS1106A_3573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3573 
SymbolpaaN 
ID4901788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3480926 
End bp3482632 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content66% 
IMG OID640136799 
Productphenylacetic acid degradation protein paaN 
Protein accessionYP_001067804 
Protein GI126455462 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02288] phenylacetic acid degradation protein paaN 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCATC CTCTGTTCAC GAAGCATGAA GACACGTTGA AGCACGCGCT CTCCACGATC 
GAAACGCGCG GCTACTGGAG CCCGTTCGCC GAGATGCCGA GCCCCAAAGT GTACGGGGAA
AGCGCCAATA CAGACGGCGA AGCAGCATTC AAAGCCCAGT TGGACAAGCC CTTTGAACTC
GACCAACCCG CCTCGGGCGG AACGGTCGGC GCCGAGCGTT CGCCATACGG GTTTGCGCTC
GGCGTCCGCT ACCCGAAGTC GACGCCCGAC GAGCTCATCG CCGCCGCCGC GCAGGCGGAA
TGCGCGTGGC GCAAGGCCGG GCCGACCGCG TGGGCTGGCG TGTGTCTCGA AATTCTCGCC
CGGCTGAATC GCGCGAGCTT CGAGATCGCA TACAGCGTGA TGCACACCAC GGGACAGGCG
TTCATGATGG CGTTCCAGGC GGGCGGCCCG CACGCGCAGG ATCGCGCGCT CGAAGCCGTC
GCCTATGCAT GGCAAGAACT GCAGCGCATT CCCGCCGAAG CGCACTGGGA GAAGCCGCAG
GGCAAGAACC CGCCGCTCGC GATGCGCAAG CGCTACACGA TCGTGCCGCG CGGGACGGGG
CTCGTGCTCG GGTGCTGCAC GTTCCCGACC TGGAACGGCT ATCCCGGTCT GTTCGCCGAT
CTGGCGACCG GCAACACAGT CATCGTCAAG CCGCATCCCG GCGCGATCCT GCCGCTCGCG
ATCACCGTGC GCATCGCGCG CGACGTGCTG CGCGAAGCGG GCTTCGATCC GAACATCGTC
ACGCTGCTCG CGACCGAAGG AAACGACGGC GCACTCGTCC AGGATCTGGC GCGCCGGCCG
GAAATCAAGC TGATCGACTT CACCGGCAGC TCGCAAAACG GCACCTGGCT CGAGCGCAAT
GCGTACCAGG CGCAGGTCTA TACGGAGAAG GCGGGCGTCA ACCAGATCGT GATCGATTCC
GTCGACGACC TGAAAGCCGC CGTCAAGAAC ATCGCGTTCT CGCTTGCGCT CTACTCCGGC
CAGATGTGCA CAGCGCCGCA AAACATCTAT GTGCCGCGTG ACGGCATCCG CACCGCCGAA
GGGCACGTCG GCTTCGACGA CGTCGCGCAG GCGATCGCCG ACGCCGTGCA AAAGCTGACG
GGCGACCCGG CACGCTCGGT CGAACTCATC GGGGCGCTGC AGAACGAAGG CGTCGCGGCA
CGTATCGACG AAGCGCGCAA GCTCGGCCGC ATTCTCGCCG ACAGCCAGGC GCTCGAGCAC
CCGGCATTCA AGGACGCGCG CGTGCGCACG CCGCTCGTGC TGCAACTCGA CGTCGCGGAC
CGTGCGAAGT ACACGCAGGA ATGGTTCGGT CCGATCTCGT TCGTCATCGC GACCGATTCG
ACTGCGCAAT CGCTCGATCT CGCCAGCTCG ATCGCGGCCG AGCATGGCGC GCTCACGCTG
TCCGTCTATA GCACGGACGA CGCCGTCGTC GAAGCGGCGC ACGAAGCGGC GGTGCGCGGC
GGCGTCGCGC TGTCGATCAA TCTGACGGGC GGCGTGTTCG TCAATCAGTC GGCGGCGTTC
TCCGACTTTC ACGGCACGGG CGCCAATCCG GCCGCGAATG CGTCGCTCGC CGACGCCGCG
TTCGTCGCGA ACCGCTTCCG CGTCGTTCAG AGCCGCCACC ATGTTGCGCC GAAGGCGGCT
CCCGCGGAAG CCGGCCAAAC GGCATAA
 
Protein sequence
MTHPLFTKHE DTLKHALSTI ETRGYWSPFA EMPSPKVYGE SANTDGEAAF KAQLDKPFEL 
DQPASGGTVG AERSPYGFAL GVRYPKSTPD ELIAAAAQAE CAWRKAGPTA WAGVCLEILA
RLNRASFEIA YSVMHTTGQA FMMAFQAGGP HAQDRALEAV AYAWQELQRI PAEAHWEKPQ
GKNPPLAMRK RYTIVPRGTG LVLGCCTFPT WNGYPGLFAD LATGNTVIVK PHPGAILPLA
ITVRIARDVL REAGFDPNIV TLLATEGNDG ALVQDLARRP EIKLIDFTGS SQNGTWLERN
AYQAQVYTEK AGVNQIVIDS VDDLKAAVKN IAFSLALYSG QMCTAPQNIY VPRDGIRTAE
GHVGFDDVAQ AIADAVQKLT GDPARSVELI GALQNEGVAA RIDEARKLGR ILADSQALEH
PAFKDARVRT PLVLQLDVAD RAKYTQEWFG PISFVIATDS TAQSLDLASS IAAEHGALTL
SVYSTDDAVV EAAHEAAVRG GVALSINLTG GVFVNQSAAF SDFHGTGANP AANASLADAA
FVANRFRVVQ SRHHVAPKAA PAEAGQTA