Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A2789 |
Symbol | |
ID | 4904416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 2718741 |
End bp | 2720594 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640145892 |
Product | AMP-binding domain-containing protein |
Protein accession | YP_001076818 |
Protein GI | 126457489 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.472303 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACATC CGCGCGAAAC GACGGCCGCC GACACGTTCG CCGACTTCGT CGAACTCACG CGATATCGCG CCGCGCATCA GCCCGATCTG CCGGTCTACA CGTTCGTCAC CGGCGGCGAT CGCGACGAAC GACCTCTCAC ATGCGCGCAG TTGGACAGGC GCGCAAGCGC CGTGGCGGCA GCGCTATCGG AAATCGCCCG GCCCGGCGAG CGCGTGCTGC TGCTGTTTGC ACCCGGCATC GACTATATCG CCGCGCTGTT CGGCTGCATG TACGCGGGCG TCGTGGCCGT GCCCGCGTAT CCGGTCGAGC CCGCGCAGCC CGAGCGCACG CTCCGGCGGC TGCTCGGCAT CGTCGCGGAT TGCGCGCCCG CCGCGGTGCT GTCGACGGCG GCCGTGCGCG ACGGCATGAA CCGCGTCGAA ACCGGTTCGC CCGCGCTGCG CGGCCTGCGC TGGATCGAGA TCGACGCACT CTCGCCCGGC GACGATGCCG CGCACGGCGC GCCGCGCGCG GCCGACCCGC GCGTGCCCGT CTACCTGCAA TACACGTCGG GTTCGACCGG CGCGCCGAAG GGCGTGATGA TCAGCCACCG GAACCTGCTG CACAACTCCG CGCTGATCGC GCGCCGCTTC GAGCACGGCG CGAGCAGCCG CGGCGTGATC TGGCTGCCGC CGTATCACGA CATGGGTCTG ATCGGCGGCA TCCTGCAGCC GCTGTACGTC GGCTTTCCTG TGACGCTGAT GTCGCACGTC GATTTTCTCA AGCACCCGCT GCGCTGGCTG CGCGCGATCG GCGAGCGCCG CGCGACGACG AGCGGCGGGC CGAACTTCGC GTATCAGATG CTCGCGACGA TGCGTATCGC CGACGCCGAT TTCGACAAGC TCGACCTGCG CTCGTGGGAC GTCGCGTTCG TCGGCGCGGA GCCGATCCGG GCCGCCACGC TGCACGCGTT CGCGCAGCGC TTCGCGCGCT GCGGCTTCGA TGCGCGCGCG TTCTATCCGT GCTACGGCCT CGCCGAACAC ACGCTGTTCA TGACGGGCGG ACTGAAATCG CAGCCGCCCG TCGTCGCGAA CGAGCCAAGC GACGCGCGGC TGCCGCGCGC ATCCGACCAC GCCGACGCCC CCGGCCAGGC CGACCCGGCC GGCGACGGGC GGCAAGCAGG CGCGCGCGCC GCCGTCGGAT GCGGCGACGC GGCCAGCGAC AGTTTGGTGC TGATCGTCGA TCCCGACACG CGCGTTCCGT GCGATGATGG CCGGGTCGGC GAAATCTGGG CGCAAGGGCC GAGCGTCGCG CTCGGTTACT GGAACAATCG CGCGCTCAGC GAGCAGACCT TCGAGGCCGA GCTGCCCGGC TACGCGGGGC GATTCCTGCG CACCGGCGAT TACGGCTATC GGTCGGGCTC CGAAGTGTTC GTCACCGGGC GGCTGAAGGA CATGATGCTG ATTCGCGGCG CGAATCATTA TCCGCACGAC GTCGAGGCGA CGATCGAGGC GCTCGACGCC GAGCTGTTCC GCCCCGGCGG CTGCGCGGTG TTCGCGCTCG ATACCGGCGC GGCGCCGCAA GTGACCGTCG TGCGCGAGTT GCGGGCGCGC TATTTGAAGG CATTCGGCGA CGGCGGCCAA GAAGCCGGCC ACACGCCCGA CGCGCTGTTC GGCAGGCTGC GTCGGGCGAT CAACCTGCAT CACGGCATTG CGGTACACCA TATCGTCTTC ACGTCGCCTT CTGCGATACC GAAGACGACG AGCGGAAAGG TCCAGCGGCA CGCCTGTCGC GAACTGTTTC TCAACGACAC GCTGCCGGTG GTCACCCAGT GGCGCGCGCC GTGCGGCGCG CCGAACGACA TCCGGAACAT CTGA
|
Protein sequence | MTHPRETTAA DTFADFVELT RYRAAHQPDL PVYTFVTGGD RDERPLTCAQ LDRRASAVAA ALSEIARPGE RVLLLFAPGI DYIAALFGCM YAGVVAVPAY PVEPAQPERT LRRLLGIVAD CAPAAVLSTA AVRDGMNRVE TGSPALRGLR WIEIDALSPG DDAAHGAPRA ADPRVPVYLQ YTSGSTGAPK GVMISHRNLL HNSALIARRF EHGASSRGVI WLPPYHDMGL IGGILQPLYV GFPVTLMSHV DFLKHPLRWL RAIGERRATT SGGPNFAYQM LATMRIADAD FDKLDLRSWD VAFVGAEPIR AATLHAFAQR FARCGFDARA FYPCYGLAEH TLFMTGGLKS QPPVVANEPS DARLPRASDH ADAPGQADPA GDGRQAGARA AVGCGDAASD SLVLIVDPDT RVPCDDGRVG EIWAQGPSVA LGYWNNRALS EQTFEAELPG YAGRFLRTGD YGYRSGSEVF VTGRLKDMML IRGANHYPHD VEATIEALDA ELFRPGGCAV FALDTGAAPQ VTVVRELRAR YLKAFGDGGQ EAGHTPDALF GRLRRAINLH HGIAVHHIVF TSPSAIPKTT SGKVQRHACR ELFLNDTLPV VTQWRAPCGA PNDIRNI
|
| |