Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I2900 |
Symbol | paaN |
ID | 3847097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 3334776 |
End bp | 3336482 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637842568 |
Product | phenylacetic acid degradation protein paaN |
Protein accession | YP_443407 |
Protein GI | 83718561 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02288] phenylacetic acid degradation protein paaN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000137012 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCATC CTCTGTTCAC GAAGCATGAA GACACGTTGA AGCACGCGCT CTCCACGATC GAAACGCGCG GCTACTGGAG CCCGTTCGCC GAGATGCCGA GCCCCAAAGT GTACGGGGAA AGCGCCAACG CAGACGGCGA AGCAGCATTC AAAGCCCAAC TCTACAAACC CTTCGAACTC GATCAGCCCG CCTCTGGCGG AACGGTCGGC GCCGAGCGCT CGCCGTACGG ATTTGCACTC GGCATCCGCT ACCCGAAATC TTCGCCCGAC GAGTTGATCG CCGCTGCCGC CCAGGCACAG ACCGCATGGC GCAAGGCCGG ACCGTCCGCA TGGGTCGGTG TGAGCCTTGA AATCCTCGCC CGGCTGAATC GCGCGAGCTT CGAAATCGCA TACAGCGTGA TGCACACCAC CGGACAGGCG TTCATGATGG CTTTCCAGGC GGGCGGGCCG CATGCGCAGG ACCGCGCGCT TGAAGCCGTC GCCTACGCGT GGCACGAACT GCAGCGCATC CCCGCCGACG CCCATTGGGA AAAGCCGCAG GGCAAGAATC CGCCGCTCGC GATGCACAAG CGCTACACGG TCGTGCCGCG CGGCACGGGG CTCGTGCTCG GTTGCTGCAC GTTCCCGACC TGGAACGGCT ATCCGGGCCT GTTTGCCGAT CTGGCAACCG GCAACGCGGT CATCGTCAAG CCGCATCCCG GCGCGATCCT GCCACTCGCG ATCACCGTGC GCATCGCGCG CGCCGTGCTG CGTGAAGCGG GCTTCGATCC GAACGTCGTC ACACTGCTCG CGACCGAAGG GAGCGACGGC GCGCTCGTCC AAGAGCTGGC GCTCCGCCCG GAGATCAAGC TGATCGACTT CACCGGCAGC TCGCAAAACG GCAACTGGCT CGAACGCAAC GCGCACCAGG CACAGGTGTA TACGGAGAAG GCGGGCGTCA ACCAGATCGT GATCGATTCC ATCGACGATC TGAAAGCCGC CGTCAAGAAC ATCGCGTTCT CGCTCGCGCT CTACTCCGGC CAGATGTGCA CCGCGCCGCA AAACATCTAT GTTCCGCGCG ACGGCATCCG CACCGCGGAA GGGCACGTCA GTTTCGACGA CGTTGCACTG GCAATCGCCG GCGCCGTGCA GAAGCTGACG GGCGACCCGG CGCGCTCGGT CGAGCTCATC GGCGCGCTCC AGAACGAAGG CATCGCGGCC CGCATCGACG GCGCGCGCAA ACTCGGTCGC GTGCTCGCCG ACAGTCAGGC GCTCGAGCAT CCGGCATTCA AGGACGCACG CGTGCGTACG CCGCTCGTGG TGCAGCTCGA CGTCGCGGAT CGCGCGAAGT ACACGCAGGA GTGGTTTGGT CCGATCTCGT TCGTCATCGC GACCGACTCG ACTGCGCAAT CGCTCGATCT CGCCGGCTCG ATCGCGGCCG AGCACGGCGC GCTGACGCTG TCCGTCTATA GCACGGACGA TGCCGTCGTC GAAGCGGCGC ACGATGCGGC GATACGAGGC GGCGTCGCGC TGTCGATCAA TCTGACGGGT GGCGTGTTCG TCAATCAGTC GGCGGCGTTC TCCGACTTTC ACGGCACGGG CGCCAATCCG GCCGCGAACG CGTCGCTCGC CGATGCAGCG TTCGTCGCGA ATCGCTTCCG CGTCGTACAG AGCCGGCACC ATGTTGCGCC GAAGGCCGTT CCCGCGGAAG CTGGCCAAAC GGCATAA
|
Protein sequence | MTHPLFTKHE DTLKHALSTI ETRGYWSPFA EMPSPKVYGE SANADGEAAF KAQLYKPFEL DQPASGGTVG AERSPYGFAL GIRYPKSSPD ELIAAAAQAQ TAWRKAGPSA WVGVSLEILA RLNRASFEIA YSVMHTTGQA FMMAFQAGGP HAQDRALEAV AYAWHELQRI PADAHWEKPQ GKNPPLAMHK RYTVVPRGTG LVLGCCTFPT WNGYPGLFAD LATGNAVIVK PHPGAILPLA ITVRIARAVL REAGFDPNVV TLLATEGSDG ALVQELALRP EIKLIDFTGS SQNGNWLERN AHQAQVYTEK AGVNQIVIDS IDDLKAAVKN IAFSLALYSG QMCTAPQNIY VPRDGIRTAE GHVSFDDVAL AIAGAVQKLT GDPARSVELI GALQNEGIAA RIDGARKLGR VLADSQALEH PAFKDARVRT PLVVQLDVAD RAKYTQEWFG PISFVIATDS TAQSLDLAGS IAAEHGALTL SVYSTDDAVV EAAHDAAIRG GVALSINLTG GVFVNQSAAF SDFHGTGANP AANASLADAA FVANRFRVVQ SRHHVAPKAV PAEAGQTA
|
| |