Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_3573 |
Symbol | paaN |
ID | 4901788 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 3480926 |
End bp | 3482632 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640136799 |
Product | phenylacetic acid degradation protein paaN |
Protein accession | YP_001067804 |
Protein GI | 126455462 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02288] phenylacetic acid degradation protein paaN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCATC CTCTGTTCAC GAAGCATGAA GACACGTTGA AGCACGCGCT CTCCACGATC GAAACGCGCG GCTACTGGAG CCCGTTCGCC GAGATGCCGA GCCCCAAAGT GTACGGGGAA AGCGCCAATA CAGACGGCGA AGCAGCATTC AAAGCCCAGT TGGACAAGCC CTTTGAACTC GACCAACCCG CCTCGGGCGG AACGGTCGGC GCCGAGCGTT CGCCATACGG GTTTGCGCTC GGCGTCCGCT ACCCGAAGTC GACGCCCGAC GAGCTCATCG CCGCCGCCGC GCAGGCGGAA TGCGCGTGGC GCAAGGCCGG GCCGACCGCG TGGGCTGGCG TGTGTCTCGA AATTCTCGCC CGGCTGAATC GCGCGAGCTT CGAGATCGCA TACAGCGTGA TGCACACCAC GGGACAGGCG TTCATGATGG CGTTCCAGGC GGGCGGCCCG CACGCGCAGG ATCGCGCGCT CGAAGCCGTC GCCTATGCAT GGCAAGAACT GCAGCGCATT CCCGCCGAAG CGCACTGGGA GAAGCCGCAG GGCAAGAACC CGCCGCTCGC GATGCGCAAG CGCTACACGA TCGTGCCGCG CGGGACGGGG CTCGTGCTCG GGTGCTGCAC GTTCCCGACC TGGAACGGCT ATCCCGGTCT GTTCGCCGAT CTGGCGACCG GCAACACAGT CATCGTCAAG CCGCATCCCG GCGCGATCCT GCCGCTCGCG ATCACCGTGC GCATCGCGCG CGACGTGCTG CGCGAAGCGG GCTTCGATCC GAACATCGTC ACGCTGCTCG CGACCGAAGG AAACGACGGC GCACTCGTCC AGGATCTGGC GCGCCGGCCG GAAATCAAGC TGATCGACTT CACCGGCAGC TCGCAAAACG GCACCTGGCT CGAGCGCAAT GCGTACCAGG CGCAGGTCTA TACGGAGAAG GCGGGCGTCA ACCAGATCGT GATCGATTCC GTCGACGACC TGAAAGCCGC CGTCAAGAAC ATCGCGTTCT CGCTTGCGCT CTACTCCGGC CAGATGTGCA CAGCGCCGCA AAACATCTAT GTGCCGCGTG ACGGCATCCG CACCGCCGAA GGGCACGTCG GCTTCGACGA CGTCGCGCAG GCGATCGCCG ACGCCGTGCA AAAGCTGACG GGCGACCCGG CACGCTCGGT CGAACTCATC GGGGCGCTGC AGAACGAAGG CGTCGCGGCA CGTATCGACG AAGCGCGCAA GCTCGGCCGC ATTCTCGCCG ACAGCCAGGC GCTCGAGCAC CCGGCATTCA AGGACGCGCG CGTGCGCACG CCGCTCGTGC TGCAACTCGA CGTCGCGGAC CGTGCGAAGT ACACGCAGGA ATGGTTCGGT CCGATCTCGT TCGTCATCGC GACCGATTCG ACTGCGCAAT CGCTCGATCT CGCCAGCTCG ATCGCGGCCG AGCATGGCGC GCTCACGCTG TCCGTCTATA GCACGGACGA CGCCGTCGTC GAAGCGGCGC ACGAAGCGGC GGTGCGCGGC GGCGTCGCGC TGTCGATCAA TCTGACGGGC GGCGTGTTCG TCAATCAGTC GGCGGCGTTC TCCGACTTTC ACGGCACGGG CGCCAATCCG GCCGCGAATG CGTCGCTCGC CGACGCCGCG TTCGTCGCGA ACCGCTTCCG CGTCGTTCAG AGCCGCCACC ATGTTGCGCC GAAGGCGGCT CCCGCGGAAG CCGGCCAAAC GGCATAA
|
Protein sequence | MTHPLFTKHE DTLKHALSTI ETRGYWSPFA EMPSPKVYGE SANTDGEAAF KAQLDKPFEL DQPASGGTVG AERSPYGFAL GVRYPKSTPD ELIAAAAQAE CAWRKAGPTA WAGVCLEILA RLNRASFEIA YSVMHTTGQA FMMAFQAGGP HAQDRALEAV AYAWQELQRI PAEAHWEKPQ GKNPPLAMRK RYTIVPRGTG LVLGCCTFPT WNGYPGLFAD LATGNTVIVK PHPGAILPLA ITVRIARDVL REAGFDPNIV TLLATEGNDG ALVQDLARRP EIKLIDFTGS SQNGTWLERN AYQAQVYTEK AGVNQIVIDS VDDLKAAVKN IAFSLALYSG QMCTAPQNIY VPRDGIRTAE GHVGFDDVAQ AIADAVQKLT GDPARSVELI GALQNEGVAA RIDEARKLGR ILADSQALEH PAFKDARVRT PLVLQLDVAD RAKYTQEWFG PISFVIATDS TAQSLDLASS IAAEHGALTL SVYSTDDAVV EAAHEAAVRG GVALSINLTG GVFVNQSAAF SDFHGTGANP AANASLADAA FVANRFRVVQ SRHHVAPKAA PAEAGQTA
|
| |