Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_A1237 |
Symbol | |
ID | 3693923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007435 |
Strand | - |
Start bp | 1547312 |
End bp | 1550386 |
Gene Length | 3075 bp |
Protein Length | 1024 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637731491 |
Product | acetyltransferase |
Protein accession | YP_336394 |
Protein GI | 76817819 |
COG category | [C] Energy production and conversion |
COG ID | [COG1042] Acyl-CoA synthetase (NDP forming) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000574609 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAATTCGA ATCTGGTCAG TGATGGGCCA CGTCGTTCGC TTGATTGCAC GGCGATTTTT TGCCCTGCGC GGCGATTGCC CGCTATTGGG CAATGCATCG AATTTGCCGG TCCGCCATTT TCGATTCCCG AGCCGGCCGG ACCGATGTTT CGAGACGGCC GTTTCAGCAC CGACGAGTCG GGCTTCGAAT GTCGTGCAGC GGGTCGTGGC GATCCGCGGG ACGGGACCTC GACGCGCGGG TTTCTTCCGC TGGCGGACGT TGCAGTCGAG CCCGGGGCGC CGTGTTCGCG CGTGCATGCA GCGGCGGAGC GCCGCGGCGA GACGCGTTGC CGCGCGAGTC GTGACGGCAG GCCAGAATCG CATCGCGCGC CCGTATCGAC GCCGTCGAGC CGCGCGCGGC ATGCCGCCGG GCGCAAGACA GGGCTCGCCG CATGCGGGCC CCCGGCTTGT TCGCTCCATC CGTTCATTTC GTGTCGCTCC GTTTCGAAAG AGAAGGCTCG CGCCGTTCAA TGCATGCGGC CGCGAAGCGG CTGGTTCGGC GGCGGCGGCG AAGTGTGTGC GTCGATGATG CGGGCGCTTG CGTGGTTCGG CGATTCGGCC GAACGGCATA TAGGCGGCGC TCGATGCTTC GCCTATCCTA TGGGCACTCG GGCGCGGTCC GGCGTGGCCG CGCGTCGATC GCCGTCGCTC AATCCTTTCA GCGGAGTTGC CGTGACCGTT CGCAATCTCG ATGCGTTGTT CCATCCGACA GCCGTTGCCG TCGTCGGCGC TTCGCCGCGC CCGGGCAGCG TCGGCGCGAT GGTGTGGGCG TGCGTGCTCG ACGGCCGGTT CGGCGGCGCG ATCTGGCCTG TCAACCCGAA GTACGGCGAA CTGAACGGCC ACAAGGTTTA TCCCTATGTC GACCAGCTGC CGAGCGCGCC GTCGGTCGCG CTCATATGCA CGCCGCCCGC GACGTGGCCC GGCATCGTGC GCAAGCTGGG CGGCCTGGGC GTGCGTGCGG CGATCATCGT CGGCGAGACT CGTTCGGGCG CAGACCGGGC CGCGCTGGAA CGGGCGCTTG CCGCGGCGAA GCCATATCTG CTGCGCGTCG TCGGGCCGGG CAGCCTGGGC GTGGTGTCGC CCGCGCTGGG CGCGCATTTC GGCGCGCCGG CGTGCATCGT CAAGGCGGGT GGCGTCGCAT GGGTGTCGCA ATCGAATGCG CTGACGAACG CGGTGCTCGG CTGGGCGCAT GCGCGCGGGC TCGGTTTCTC GCATGCGGTC GCACTCGGCG GCGAGGCGGA TGTCGACGCG GCCGACGTGC TCGACTACCT GGCGAGCGAT GCCGAGACAC GCGCGATCCT GCTCGAGCTC GACACGGTGA AATCGGCGCG CAAGTTCATG TCGGCGGCGC GCGCCGCGGC GCGCAACAAG CCGGTGCTTG CGTTGCGCGC GGGGCGCGGC GATTCGGGCG ACCTGCTTTA CACGGCCGCG TTTCAGCGCG CGGGGATGGT GCGCGTCGAC GCGCTCGACG ATCTTCTCGA CGAGATCGAG GCGCTCGGCG TCGGCCGTGC CGCGGCGACG AGCGGCGGGG TGACGCTCGT CACGAGCGAC AGGGGCGTCG CGAAGCTTGC CGTCGACGCA CTCGCGGCGG CGGGGGAAAC GCTGGCGCAA TGGCCGCGGG CGGCCGTCGA CGAAGTCGGC GGCGCGCTGC CCGCAGGCAT CGTCGCCGGC AATCCGTTGC TGCTCGGCGA CGACGCGCGG CCCGAATATT TCGGCGCGGC GCTGAAGGCG CTCGCGCAGC ATCCGCCGAC AGGCACCGTG TTCGTGGTTC ATGCGACGTC GCATAGCGCG CCTGCCGTCG ACGTCGCACG TGTGCTGATC GAATCGCGCA AGTTCGCGCG TCGCGGCATG CTCGCGTGCT TCTTCGGCGG CGTCGACGCC GCGACGCGCG ACGCGCTGCA CGTGCACGGC ATCCCCGTGC ACACGACGCC GCAGCGTCTC GCGCGCGCGC ATGCGCGCCT TGTCGATTAT CAATTGGGCC GGGAGCTGTT GATGCAGACG CCGGAGGGCA CGCCGCCGCA ACCGGCGGCG TCGATCTCGG CCGCGCGGCA CACGGCGCGC GCGGCGCTCG CGCAGGGGCG CGACGGGTTC GAAGGCGACG CGGCGCTCGA ATGGCTGGCG GGTTTCGGCA TCGAGCGCGC GACCGACGCG GACGTCGACA TGGGCGATAC GATCGTCGAC ATCACGGTCG GGATGTACGA CGACCCGAAC TTCGGGCCGG TATTCCGCTA TTCGGTGCCG CCGGCGGATG GCGTGTCCGC GCCGTTCGTC GTCTACGGGC TGCCGCCTTT CAATACCGTG CTCGCGCGCG CGGTCGTCGC GCGATCGCCG TATGCGCATC GCGCGCCGCC CGAGCCGCTT CTGCAGGCAC TGACCGCGCT TTCGCAGGCG GTGTGCGATG TGCGCGAAAT CGTCGAGATG TCGCTTGTGC TGCGCGTGCG GCCCACACGC GTGGTCGCGC TTGGGCCGCA CATCAGGCTC GCGACGGGGC GCAGCCGGCT CGCGATCGTG CCATATCCGC GCTATCTCGA GCAGCAGCTC GACTGGCGCG GAGAACGCAT CACGGTGCGT CCGATTCGCC CCGAGGACGA GGCCGCGCAT CGCGAGCTGC TGAGCGCGAT GACGCCGGAC GATCTGCGAA TGCGCTTCTT CGGTGCGATA CGCAACTTCG ACCATTCGCA GATCGCGCGG ATGACGCAGA TCGATTACGA CCGCGAGATG GCGCTGATCG CGACGCTCGA CGATGCCGAC GGCCGCGCGC ATACGCTCGG CGCGGTGCGC GCGGTGACCG ATCCGGACAA CGAAGCGACG GAGTTCGCGA TCGCGGTGCG GCCTGACCAA AAGGGCAAGG GGCTCGGACG GATGTTGATG ACGCGCATCA TCGACTACGC GCGCTCGCGC GGAACGGCGT GGATGATCGG CGAGGCACTG CGCGAGAACA CGGCGATGAT CTCGCTCGCG AAGGACAGCG GTTTTGCGGT GTCGTCGACC GAGGAGCCTG GGGTGGTGGC GTTCCGGCTG AAGCTGCAGC CGTGA
|
Protein sequence | MNSNLVSDGP RRSLDCTAIF CPARRLPAIG QCIEFAGPPF SIPEPAGPMF RDGRFSTDES GFECRAAGRG DPRDGTSTRG FLPLADVAVE PGAPCSRVHA AAERRGETRC RASRDGRPES HRAPVSTPSS RARHAAGRKT GLAACGPPAC SLHPFISCRS VSKEKARAVQ CMRPRSGWFG GGGEVCASMM RALAWFGDSA ERHIGGARCF AYPMGTRARS GVAARRSPSL NPFSGVAVTV RNLDALFHPT AVAVVGASPR PGSVGAMVWA CVLDGRFGGA IWPVNPKYGE LNGHKVYPYV DQLPSAPSVA LICTPPATWP GIVRKLGGLG VRAAIIVGET RSGADRAALE RALAAAKPYL LRVVGPGSLG VVSPALGAHF GAPACIVKAG GVAWVSQSNA LTNAVLGWAH ARGLGFSHAV ALGGEADVDA ADVLDYLASD AETRAILLEL DTVKSARKFM SAARAAARNK PVLALRAGRG DSGDLLYTAA FQRAGMVRVD ALDDLLDEIE ALGVGRAAAT SGGVTLVTSD RGVAKLAVDA LAAAGETLAQ WPRAAVDEVG GALPAGIVAG NPLLLGDDAR PEYFGAALKA LAQHPPTGTV FVVHATSHSA PAVDVARVLI ESRKFARRGM LACFFGGVDA ATRDALHVHG IPVHTTPQRL ARAHARLVDY QLGRELLMQT PEGTPPQPAA SISAARHTAR AALAQGRDGF EGDAALEWLA GFGIERATDA DVDMGDTIVD ITVGMYDDPN FGPVFRYSVP PADGVSAPFV VYGLPPFNTV LARAVVARSP YAHRAPPEPL LQALTALSQA VCDVREIVEM SLVLRVRPTR VVALGPHIRL ATGRSRLAIV PYPRYLEQQL DWRGERITVR PIRPEDEAAH RELLSAMTPD DLRMRFFGAI RNFDHSQIAR MTQIDYDREM ALIATLDDAD GRAHTLGAVR AVTDPDNEAT EFAIAVRPDQ KGKGLGRMLM TRIIDYARSR GTAWMIGEAL RENTAMISLA KDSGFAVSST EEPGVVAFRL KLQP
|
| |