Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0653 |
Symbol | |
ID | 4905881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 633972 |
End bp | 635597 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640143759 |
Product | 2-aminobenzoate-CoA ligase |
Protein accession | YP_001074689 |
Protein GI | 126457315 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACAACT TTTGCCGCAC GAATCTGCCT GCACCGACCG ATCTGCCGGA GTTCGTCTTC GAGTTGCCGG GCCTGCAGTA TCCGGCGCGC ATCAACTGCG CGGCGGCGCT GCTCGACGAC GCGGTGACCC GGCGGGGCTG GGGCGAGCGC GTTGCGATCA GGACCGAGTC CGGTGCCGCC TGGTCGTATC GCGCGCTGTT CGAGCTGAGC AACCGGATCG CCAACCTGCT GGTGCGCGAC GGCGGGCTCG TGCCGGGCAA CCGGGTGCTG CTGCACGGAA CCAATCATCC GTTTCTCGCC GCCGCATGGT TCGCGATCGT CAAGGCGGGC GGCGTCGTGG TGACGACGAT GCCGCTGCTG CGCGCGGGCG AGCTGTCGAA AGTCATCGCG CAGGCGCAGG TCACGCACGC GCTGTGCGAG GCGGCGGTGT CCGCCGAGTT GCGCGCCGCG ATGGCGGCGG CGCCGGGCGT CGCGTTCGTC CGGTACTACG AGACCGACGA CGCGGCCGCG TTCGAGCCGC TGCTGCACGC GTGCCCGCGC ACGTTCGAGC CGGTCGATAC GCGCGCCGAC GAGCCGTGCA TCGTCGCGTT CACGTCGGGC ACGACGGGGC GCCCGAAGGC GACCGTGCAT TTTCATCGCG ACGTGATGGC GATCTGCCAT TGCTTTCCGC AGCACGTGCT GAAGCCGAAC GCCGACGACG TGTTCTGCGG CTCCCCGCCG CTCGCGTTCA CGTTCGGGCT CGGCGCGCTG CTGCTGTTTC CGCTGAGCGT CGGCGCGAGC GTCGTGCTGC TGCAGCGGGC GAAGCCGCAG CGGCTGCTCG CCGCGATCGG CGCGCATCGC GTGAGCATCC TCTTCACCGC GCCGGCCGCG TATCGCGCGA TGCTCGACGA GCTCGGCGAG CACGACATCG CCAGCCTGCG CAAGTGCGTG TGCGCGGGCG AGGCGCTGCC GGTGCCGACG CGCAACGCGT GGCTCGCGCG CACGGGCATT CGCATCATCG ACGGCATCGG CGCGACCGAG ATGCTGCACA TCTTCGCGTC CGCGGACGAA ACGCAGGCGA AGGAAGGCGC GATCGGCAAG GCGGTGCCCG GCTACCGGCT CGCGATCCTC GACGAGCGCG GCGAGCGCCT GCCGCCGTAT CACGTCGGCC GTCTCGCGGT GCAGGGGCCG ACCGGCTGCC GCTACCTGAA CGATGCGCGG CAGCGCGATT ACGTGCGGCA CGGCTGGAAC CTGACGGGCG ACGCCGCCTA CCTCGACGAG GACGGCTACC TGTTCTACCA GTCGCGCGCC GACGATCTGA TCATCAGCCT CGGCTACACC ATCTCGCCCG CCGAGGTGGA GGAGGCGCTG CTGAGCCACG CGGACGTGCT CGAGTGCGGT GTCGTCGGCG CGCCCGACGG GCGAGGCGGC ACGCTCGTGT GCGCGCACGT GGTGCCGCGG CCCGGCGTGC ACGGCTGCGA TGCGCTGACG GCCGCGTTGC AGCAGCACGT GAAGGCGCGG ATCGCGCCGT ACAAGTATCC GCGGCGCATC GAGTATCACG CGGCCGGGCT GCCGCGCAAC GACTCCGGCA AGCTGCAGCG CTTCAAGCTG CGGCAGGCGG CCGAGGAAGA CGTGCAGGCG GCCTGA
|
Protein sequence | MDNFCRTNLP APTDLPEFVF ELPGLQYPAR INCAAALLDD AVTRRGWGER VAIRTESGAA WSYRALFELS NRIANLLVRD GGLVPGNRVL LHGTNHPFLA AAWFAIVKAG GVVVTTMPLL RAGELSKVIA QAQVTHALCE AAVSAELRAA MAAAPGVAFV RYYETDDAAA FEPLLHACPR TFEPVDTRAD EPCIVAFTSG TTGRPKATVH FHRDVMAICH CFPQHVLKPN ADDVFCGSPP LAFTFGLGAL LLFPLSVGAS VVLLQRAKPQ RLLAAIGAHR VSILFTAPAA YRAMLDELGE HDIASLRKCV CAGEALPVPT RNAWLARTGI RIIDGIGATE MLHIFASADE TQAKEGAIGK AVPGYRLAIL DERGERLPPY HVGRLAVQGP TGCRYLNDAR QRDYVRHGWN LTGDAAYLDE DGYLFYQSRA DDLIISLGYT ISPAEVEEAL LSHADVLECG VVGAPDGRGG TLVCAHVVPR PGVHGCDALT AALQQHVKAR IAPYKYPRRI EYHAAGLPRN DSGKLQRFKL RQAAEEDVQA A
|
| |