Gene BTH_I2900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_I2900 
SymbolpaaN 
ID3847097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007651 
Strand
Start bp3334776 
End bp3336482 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content66% 
IMG OID637842568 
Productphenylacetic acid degradation protein paaN 
Protein accessionYP_443407 
Protein GI83718561 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02288] phenylacetic acid degradation protein paaN 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000137012 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCATC CTCTGTTCAC GAAGCATGAA GACACGTTGA AGCACGCGCT CTCCACGATC 
GAAACGCGCG GCTACTGGAG CCCGTTCGCC GAGATGCCGA GCCCCAAAGT GTACGGGGAA
AGCGCCAACG CAGACGGCGA AGCAGCATTC AAAGCCCAAC TCTACAAACC CTTCGAACTC
GATCAGCCCG CCTCTGGCGG AACGGTCGGC GCCGAGCGCT CGCCGTACGG ATTTGCACTC
GGCATCCGCT ACCCGAAATC TTCGCCCGAC GAGTTGATCG CCGCTGCCGC CCAGGCACAG
ACCGCATGGC GCAAGGCCGG ACCGTCCGCA TGGGTCGGTG TGAGCCTTGA AATCCTCGCC
CGGCTGAATC GCGCGAGCTT CGAAATCGCA TACAGCGTGA TGCACACCAC CGGACAGGCG
TTCATGATGG CTTTCCAGGC GGGCGGGCCG CATGCGCAGG ACCGCGCGCT TGAAGCCGTC
GCCTACGCGT GGCACGAACT GCAGCGCATC CCCGCCGACG CCCATTGGGA AAAGCCGCAG
GGCAAGAATC CGCCGCTCGC GATGCACAAG CGCTACACGG TCGTGCCGCG CGGCACGGGG
CTCGTGCTCG GTTGCTGCAC GTTCCCGACC TGGAACGGCT ATCCGGGCCT GTTTGCCGAT
CTGGCAACCG GCAACGCGGT CATCGTCAAG CCGCATCCCG GCGCGATCCT GCCACTCGCG
ATCACCGTGC GCATCGCGCG CGCCGTGCTG CGTGAAGCGG GCTTCGATCC GAACGTCGTC
ACACTGCTCG CGACCGAAGG GAGCGACGGC GCGCTCGTCC AAGAGCTGGC GCTCCGCCCG
GAGATCAAGC TGATCGACTT CACCGGCAGC TCGCAAAACG GCAACTGGCT CGAACGCAAC
GCGCACCAGG CACAGGTGTA TACGGAGAAG GCGGGCGTCA ACCAGATCGT GATCGATTCC
ATCGACGATC TGAAAGCCGC CGTCAAGAAC ATCGCGTTCT CGCTCGCGCT CTACTCCGGC
CAGATGTGCA CCGCGCCGCA AAACATCTAT GTTCCGCGCG ACGGCATCCG CACCGCGGAA
GGGCACGTCA GTTTCGACGA CGTTGCACTG GCAATCGCCG GCGCCGTGCA GAAGCTGACG
GGCGACCCGG CGCGCTCGGT CGAGCTCATC GGCGCGCTCC AGAACGAAGG CATCGCGGCC
CGCATCGACG GCGCGCGCAA ACTCGGTCGC GTGCTCGCCG ACAGTCAGGC GCTCGAGCAT
CCGGCATTCA AGGACGCACG CGTGCGTACG CCGCTCGTGG TGCAGCTCGA CGTCGCGGAT
CGCGCGAAGT ACACGCAGGA GTGGTTTGGT CCGATCTCGT TCGTCATCGC GACCGACTCG
ACTGCGCAAT CGCTCGATCT CGCCGGCTCG ATCGCGGCCG AGCACGGCGC GCTGACGCTG
TCCGTCTATA GCACGGACGA TGCCGTCGTC GAAGCGGCGC ACGATGCGGC GATACGAGGC
GGCGTCGCGC TGTCGATCAA TCTGACGGGT GGCGTGTTCG TCAATCAGTC GGCGGCGTTC
TCCGACTTTC ACGGCACGGG CGCCAATCCG GCCGCGAACG CGTCGCTCGC CGATGCAGCG
TTCGTCGCGA ATCGCTTCCG CGTCGTACAG AGCCGGCACC ATGTTGCGCC GAAGGCCGTT
CCCGCGGAAG CTGGCCAAAC GGCATAA
 
Protein sequence
MTHPLFTKHE DTLKHALSTI ETRGYWSPFA EMPSPKVYGE SANADGEAAF KAQLYKPFEL 
DQPASGGTVG AERSPYGFAL GIRYPKSSPD ELIAAAAQAQ TAWRKAGPSA WVGVSLEILA
RLNRASFEIA YSVMHTTGQA FMMAFQAGGP HAQDRALEAV AYAWHELQRI PADAHWEKPQ
GKNPPLAMHK RYTVVPRGTG LVLGCCTFPT WNGYPGLFAD LATGNAVIVK PHPGAILPLA
ITVRIARAVL REAGFDPNVV TLLATEGSDG ALVQELALRP EIKLIDFTGS SQNGNWLERN
AHQAQVYTEK AGVNQIVIDS IDDLKAAVKN IAFSLALYSG QMCTAPQNIY VPRDGIRTAE
GHVSFDDVAL AIAGAVQKLT GDPARSVELI GALQNEGIAA RIDGARKLGR VLADSQALEH
PAFKDARVRT PLVVQLDVAD RAKYTQEWFG PISFVIATDS TAQSLDLAGS IAAEHGALTL
SVYSTDDAVV EAAHDAAIRG GVALSINLTG GVFVNQSAAF SDFHGTGANP AANASLADAA
FVANRFRVVQ SRHHVAPKAV PAEAGQTA