Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_A3630 |
Symbol | |
ID | 3748808 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007510 |
Strand | - |
Start bp | 504183 |
End bp | 505889 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637761904 |
Product | phenylacetic acid degradation protein paaN2 |
Protein accession | YP_367875 |
Protein GI | 78065106 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02288] phenylacetic acid degradation protein paaN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000209323 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.623158 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCATG CACTGTTCAC GAAGCACGAA GACACGCTGA AACACGCACT CGCCGCCATC GAGAGCCGCG GGTACTGGAG CCCGTTCGCC GAAATGCCGA GCCCCAAAGT GTACGGGGAA AGCGCGAACG CAGATGGCGA GGCCGCGTTC AAGTCGCACC TCGACAAGAC GTTCGCGCTC GACCAGCCAG CGTCCGGAGA AACGGTCGGC GCAGAGCGGT CGCCGTACGG CGTTGCACTG GGCATCCGGT ATCCGAAGTC GACGCCCGAC GAACTGATCG CCGCCGCTGC CGCCGCGCAA CGCTCGTGGC GCGAAGCCGG CCCGAGCGCC TGGATCGGCG TCAGCCTCGA AATCCTCGCG CGCCTGAATC GGGCCAGCTT CGAAATCGCC TACAGCGTGA TGCACACCAC GGGGCAGGCA TTCATGATGG CGTTCCAGGC CGGCGGCCCG CACGCGCAAG ATCGCGCGCT CGAGGCAGTC GCCTATGCAT GGGACGAACT GCGCCGCATC CCCGCCGAAG CACACTGGGA AAAGCCGCAA GGCAAGAACC CGCCGCTCGC GATGCACAAG CGCTATACGA TCGTCCCGCG CGGCACCGGG CTCGTGCTCG GCTGCTGCAC GTTCCCGACC TGGAACGGCT ACCCCGGCCT GTTCGCCGAT CTCGCAACCG GCAACACCGT GATCGTCAAA CCGCATCCGG GCGCGATCCT GCCCCTCGCC GTCACCGTCC GGATCGCTCG CGACGTGCTG CGCGAAGCCG GTTTCGATCC GAACGTCGTC ACGCTCCTCG CGACCGAACC CAACGACGGC GCACTCGTTC AGGATCTCGC GCTGCGCCCC GAGATCAAGT TGATCGACTT CACCGGCAGC ACGCAAAACG GCACGTGGCT CGAACGCCAT GCTCACCAGG CACAGGTCTA TACGGAGAAG GCTGGCGTCA ACCAGATCGT GATCGACTCG ACCGACGACC TGAAGGCCGC CGCCAAAAAC ATCGCGTTCT CGCTGGCGCT GTACTCCGGC CAGATGTGCA CGGCACCGCA GAACATCTAC GTGCCGCGCG ACGGCATCCG GACGGCGGAC GGCCATGCGA GCTTCGACGA GGTCGCACAG GCAATCGCCC TTTCCGTGCA AAAACTCACC GGCGACCCGG CACGCTCGGT CGAACTGATC GGCGCAATCC AGAACGAAGG CGTGACCGCA CGCATCGACG ATGCCCGCCA GCTCGGCCGC GTGCTCGCCG ACAGCCTGAC CCTCCAACAC CCTGCGTTCC CCGATGCCCG CGTGCGCACG CCGCTCGTGC TGCAACTCGA CGTGGCCGAT CGCGAGAAAT TCACGCAGGA ATGGTTCGGC CCGATCTCGT TCGTGATCGC GACCGACTCG ACCGTGCAAT CGCTCGACCT CGCCGGGGAA ATCGCGGCGG AACATGGCGC GCTGACGCTC TCCGTCTACA GCACCGACGA TGCGATCGTC GACGCCGCCC ACGACGCGGC GGTGCGCGGT GGCGTCGCGC TGTCGATCAA CCTGACGGGC GGTGTGTTCG TGAACCAGTC GGCAGCGTTC TCGGATTTCC ACGGCACGGG CGCGAACCCT GCAGCCAACG CGGCGCTGGC CGACCCGGCG TTCGTCGCCA ACCGGTTCCG CGTGGTGCAA AGCCGCGTCC ATGTTGCCCC GAAGGCTGCA CCTGTGGAAG CCGGCCAGCC GGCATAA
|
Protein sequence | MTHALFTKHE DTLKHALAAI ESRGYWSPFA EMPSPKVYGE SANADGEAAF KSHLDKTFAL DQPASGETVG AERSPYGVAL GIRYPKSTPD ELIAAAAAAQ RSWREAGPSA WIGVSLEILA RLNRASFEIA YSVMHTTGQA FMMAFQAGGP HAQDRALEAV AYAWDELRRI PAEAHWEKPQ GKNPPLAMHK RYTIVPRGTG LVLGCCTFPT WNGYPGLFAD LATGNTVIVK PHPGAILPLA VTVRIARDVL REAGFDPNVV TLLATEPNDG ALVQDLALRP EIKLIDFTGS TQNGTWLERH AHQAQVYTEK AGVNQIVIDS TDDLKAAAKN IAFSLALYSG QMCTAPQNIY VPRDGIRTAD GHASFDEVAQ AIALSVQKLT GDPARSVELI GAIQNEGVTA RIDDARQLGR VLADSLTLQH PAFPDARVRT PLVLQLDVAD REKFTQEWFG PISFVIATDS TVQSLDLAGE IAAEHGALTL SVYSTDDAIV DAAHDAAVRG GVALSINLTG GVFVNQSAAF SDFHGTGANP AANAALADPA FVANRFRVVQ SRVHVAPKAA PVEAGQPA
|
| |