Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcen_4973 |
Symbol | |
ID | 4094032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia cenocepacia AU 1054 |
Kingdom | Bacteria |
Replicon accession | NC_008061 |
Strand | - |
Start bp | 2240716 |
End bp | 2244282 |
Gene Length | 3567 bp |
Protein Length | 1188 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 638018255 |
Product | amino acid adenylation |
Protein accession | YP_624821 |
Protein GI | 107027310 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins [COG3320] Putative dehydrogenase domain of multifunctional non-ribosomal peptide synthetases and related enzymes |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain [TIGR01746] thioester reductase domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.432296 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCGA TCATCATTGC GATCGGCGGG CAGACCCGTC TCGCCGCACG GCTGGCGGCT GAACTCGAGG CGACCGGCCA TACCGTCTGT ATCGCCGACG GACGCGACGG CCCGCTGCGC GACGCGGCCG ATGCGCTGCT GCTGCCGGCG CCGGCACCCG AACCGTTGCG TAACGCGGCC GAGTGGTCGA TCGTCTGCGG CGTGCCGGAC CCCGCGGTGC ATTGCGCCCA CGCACGGCCG CTGCATTGCG AGATCGTCGC CGATGCGTTT CCGCCGCGCG AAGTACGCGT CGCGTGGCTC ACGGACGCGG CGCCCGGCGA CACGCGCGTG CTGGCCGACG CGACGCTCGC GTTTGGCGCG GCGCTCAACG GCGCCGACCT CGCGGACGAG ATCGCCGCAT GCGCGGTCGA TCTGATGCAC GAAGTCGCGA GCGGGATCGC GCGCGAGCTG AGCGATCCGA ACGACGCGCC GCGCGCCACG CCGCCGGGGC CGTCCGATAT CGTGCTCGAT CTCGAACGGC TGGCCGGCTG GCATGCGTCT AATGACACGG CCGCCGCGTG GCCGTGCGAG CCGTCGCTGC CGGAGCTCGT GTCGCGTGCG GCGGCCGCTG CGCCGGACGC CTGCGCGCTC GTCGCGGCCG ATGGCCAGCT GACCTATCGC GAGCTGGTCA CGCGCGCCCG CCAGGTTGCC GCGCGCATCG CCGCGCGGCC GGCCGCGCCA CGTGTGGTCG CGGTGCGACT CGACAAGGGC GTCGCGCTGT ACCCGGCGAT CGTCGGCGTG CTGGGCGCCG GCGCGACCTG CGTGCCGCTC GATCCCGCGT TTCCACCCGA GCGGGCCCGC ACGATCCTGC GTGAATCGGG GGCGCAGGCG CTCGTCGTCG GCGGGGCGGT CGAGCCGGCG CTGCTCGACG GATTCGACCT CGACGTGATC GACTGCGGCG CGCACGCGGA AGCGGACGCG CACGCCTCGC CCGATGCGCT GGCCGGCCAG TGGCCGCTCG AACGGGATGC GGACCGCGAT GCGCGCTGCG CGGTCGCGAT CTACACGTCG GGGTCGACCG GCGTGCCGAA AGGCGTGATG CTGTCGCACC GGAACATCGT GCAGTTCTGC CACTGGTATC GCGCGCACGT GTCGCTCGAT GCGTCGTCGC GCGTGCTGCA GTTCTCGACG GTGGCGTTCG ATGCGTCGCT GCTCGACATG TTCCCGACCT GGCTCGCCGG CGCGACGCTG GTCGCGCCGA GCGAAGCGCA GCGCCGCGAA CTCGACGCGC TCGCGACGCT CGTCGCCGAC GCGCGCATCA CGCACGCGTT CCTGCCGCCC GCGCTGCTCG CGGCGCTGCC GGATTGCGAC TGGCCCGCGC TCGCGCACCT TGTCACGGGC GGCGACGTGT GCGACCCCGA CACGATCGCG CGCTGGAGCG CGAACCGGCG GCTGCACAAC ATCTACGGGC CGACCGAATG CACGGTGCTC GCGACGACGG GCGAACTGCG CGCCGGCGAC AGCAACCGGC GCATCGGCCG GCCGATCGCG AACGCGCGCT GCCATGTGCT CGCGGCCGAC GGCCGTCCGG CGCTCACCGG CGAGGAAGGC GAACTGTGCA TCGCCGGCGC GGGCGTCGGG CTCGGCTATC TCGGTCGGCC CGACCTGAGC GCCGAGCGTT TCGTCGCCGA TCCGTACGGC GCGCCGGGCG CGACGATGTA CCGGACCGGC GACATCGCCA GCTGGGAGCC GGACGGCACG CTGCGCTACG TGGGCCGCCG CGACACGCAA CTGAAGATTC GCGGTTTTCG CGTCGAGCCG GGCGAGATCG AGACGGCCGC GCTGGCGGCG GGGCTGTACC GCCAGTGCGC GGTGGTGCCC GACGAGCGCA AGCGCATCCG GCTGTTCGCC GCGAAGCCGG TCGACGCGGC GGCGACGCCC GACGCGCTGC GCGCCGTGCT GGCCGCCACG CTGCCCGACT ACATGGTGCC GTACGACATC ACGGCGCTCG ACGTATTGCC CGCGACGCCG AACGGCAAGA TCGACCGCGC CGCACTCGCG CGCCTGCCGG TGTCGCGTGC GGGAAGCGAC ACGCGCGACG CACCGCGCGG CGCGCTGGAG CTTCGTCTCG CCGCGATGTG GGCGACGCTG CTCGAACTCG CGCCGGACGA GATCGGCCGC GACGCGTCGT TCTTCGAACT GGGCGGCCAT TCGCTGCTGG TGTCGCGGCT GATGCTGGCG GTGAAGCGCG AGCTGGGCGG CAACGCAGCG CTCGCGCGTT TCATGGAGCG GCCGACCATC GCCGCGCTCG CGGCGCTGCT GACCGACGAG TCGGGCGAGC GCGGCGCGAA CGTGCCGGCG CGCGTGCACG ACGATCGCCG CCTGCCCGAC GACGTGCGAC TGCCGGCCGG CCAGCCGGCG GGCGACGGCT CCGGCGCGGT GCTGCTGACC GGCGCGAACG GCTTCCTGGG CTGCTTCATC CTCAGCGAAC TGATCAGCCG CACGAACCAG ATCGTCTACT GCGTCGTGCG CGGCGACGAC GATGCGAGCG CGCGCCGGCG GCTCGACGAA GCGGCGTTCG TCAACGGGCT CGGCCATCTG TGCGGGCATC CGCGCGTGCG GGTGCTGCGC GGCGATCTCG GTGCGCCGCG CCTCGGGCTG TCGGATGCGG TGTGGCAGAC GCTCGCGGCC GAGGTCGGCG CGATTCATCA CAACGGCGCA CACGTGAACC ACGTGTACGA CTACCCGTAC CTGCACGCGG AGAACGTCGG CTCGACGCTC GAACTGCTCC GCCTGTGCTG CAGCGGCCGC CGCAAGGCGC TGCATTTCGT ATCGACGCTA TCCGCGGCGA GCGCGACGGG CCCGACGGGA CGCCTGATCG AGGCGGCCCC GTCGGAGGCC GGCCCTGCCT TCGTCAACAA CGGCTACAAC CTGACGAAGT GGGTGTCCGA GCATCTCGTT GCCGAAGCGG CTGCGCGCGG GATCGACACG ACGATCCTGC GGCCCGGCAA CATCACGGGC CATTCGCGCA CGGGGCTGTG CCAGCCGGGA CGCAACCGGA TCCTGCTGCT GCTCAAGGGC GCGGTGCAGC TCGGCTGCGC GCCGCTCGCG AGCGAGGGCG GGCTGTTCGA CCTGAGCCCG GTCGACTACC TGGCGCGGGC GATCGTCGCG TGCACGCTCG ACGGTGCGCG CACCGAGCGC GTGTTCCATC TGCACAACCC GCGTCCGCTC GACTGGGCCG GCTACCTGCG CGCACTCGCG CGGCGCGGCT ATCCGCTGCG ATTCGAAGCG CCCGGGGTCT GGCGCGAGCG GCTGTTGTCG ATCGACGAAT CGAATGCGCT GTTCGACGTC GTCGCGTTCT ATCTCGACGA CCGGCAGGAC GATATCGGCG ACATGGCCGT GATCGATCAC GCGCGCACCG AGGCGACGCT GCGCCGGCTA GGCGTCACGT ATCCGGACAA GGACGACGCG CTGCTCGATG CGCATTTCGG CTACCTCGCC GAATGCGGCT TCATGCCGCC GCCGCCCGAG CCGGCGCCGC GCGCGAACGA CGCCCGCCGC GACGCCGAGC CCGAACCCGC GTGGTGA
|
Protein sequence | MKAIIIAIGG QTRLAARLAA ELEATGHTVC IADGRDGPLR DAADALLLPA PAPEPLRNAA EWSIVCGVPD PAVHCAHARP LHCEIVADAF PPREVRVAWL TDAAPGDTRV LADATLAFGA ALNGADLADE IAACAVDLMH EVASGIAREL SDPNDAPRAT PPGPSDIVLD LERLAGWHAS NDTAAAWPCE PSLPELVSRA AAAAPDACAL VAADGQLTYR ELVTRARQVA ARIAARPAAP RVVAVRLDKG VALYPAIVGV LGAGATCVPL DPAFPPERAR TILRESGAQA LVVGGAVEPA LLDGFDLDVI DCGAHAEADA HASPDALAGQ WPLERDADRD ARCAVAIYTS GSTGVPKGVM LSHRNIVQFC HWYRAHVSLD ASSRVLQFST VAFDASLLDM FPTWLAGATL VAPSEAQRRE LDALATLVAD ARITHAFLPP ALLAALPDCD WPALAHLVTG GDVCDPDTIA RWSANRRLHN IYGPTECTVL ATTGELRAGD SNRRIGRPIA NARCHVLAAD GRPALTGEEG ELCIAGAGVG LGYLGRPDLS AERFVADPYG APGATMYRTG DIASWEPDGT LRYVGRRDTQ LKIRGFRVEP GEIETAALAA GLYRQCAVVP DERKRIRLFA AKPVDAAATP DALRAVLAAT LPDYMVPYDI TALDVLPATP NGKIDRAALA RLPVSRAGSD TRDAPRGALE LRLAAMWATL LELAPDEIGR DASFFELGGH SLLVSRLMLA VKRELGGNAA LARFMERPTI AALAALLTDE SGERGANVPA RVHDDRRLPD DVRLPAGQPA GDGSGAVLLT GANGFLGCFI LSELISRTNQ IVYCVVRGDD DASARRRLDE AAFVNGLGHL CGHPRVRVLR GDLGAPRLGL SDAVWQTLAA EVGAIHHNGA HVNHVYDYPY LHAENVGSTL ELLRLCCSGR RKALHFVSTL SAASATGPTG RLIEAAPSEA GPAFVNNGYN LTKWVSEHLV AEAAARGIDT TILRPGNITG HSRTGLCQPG RNRILLLLKG AVQLGCAPLA SEGGLFDLSP VDYLARAIVA CTLDGARTER VFHLHNPRPL DWAGYLRALA RRGYPLRFEA PGVWRERLLS IDESNALFDV VAFYLDDRQD DIGDMAVIDH ARTEATLRRL GVTYPDKDDA LLDAHFGYLA ECGFMPPPPE PAPRANDARR DAEPEPAW
|
| |