Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0269 |
Symbol | |
ID | 4887708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 236719 |
End bp | 239526 |
Gene Length | 2808 bp |
Protein Length | 935 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640130210 |
Product | putative non-ribosomal peptide synthase |
Protein accession | YP_001061275 |
Protein GI | 126442941 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II [COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.977076 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGCAT CCGCCCTCGA TTTACCGCGC GATTGCGAAC ACGCATTGCG CGCCGCTTCG CCGCCAAACA TCGTCGACCT GCTGTTGCGG GCCGCACGGC TGCATCCGCA TACGGGCGTG CGCTTCATCG CCGCCGAATC CGAACACAAG GGCGCCTTCG TCACGTATCC CGAGCTGCTC GACGAGGCGC GCCGCATCCT GGGCGGCCTG CGCGCGCGCG GCTATCGGTC CGGCATGAAG GTCGCGCTGC TGCTCGAGCA CGCGAGCGAT TTCATTCCGG CGTTCTGGGC CTGCGCGCTC GGCGGCTTCG TGCCGTGCCC GCTCGTGCCG ATCCGCAACG ATCCCGAGCG CTGGGCGAAG CACCTCGCGC ACGTCGATAC GCTGCTCGAC CATCCGCTGC TCGTCACCAC CGAAGCGCTG AACAACGATC TGCCGGGCGG CGCGTCGGCC GTCAACCTGA ACGCGCTGCG CGCGAGCCTG CCCGATGCGT CGACGCACGT CGCGCAACCG TCGGACCCGG CGGTTTTCGT GCTCACGTCG GGCTCCACCG GCAATTCGAA GGCGGTCGTG CTCACGCACG GCAACCTGCT CGCGTCGATG GCGGGCAAGA ACGATCGGCA GCAGCTCGCG GGCGCGGACG TCACGCTCAA CTGGATCTCG TTCGACCACG TCGCCGCGCT GCTCGAAGCG CACCTGCTGC CGCTGTACGT CGGCGCCGTG CAGCTTCACG TCGAAGCCGC GGCGGTTCTC ACCGATCCGC TGCGCTTCTT GCGGCTCGTC AGCCGCTATC GCGTGACGAT GACGTTCTCG CCGAACTTCC TGTTCGGGCA ACTGAACGCC GCGCTCGAAG CGATGGGCGA CGAGGCGCTC GCCGCGTGGC GCGGCGCGGT GGATCTGTCG TCGCTGCGGC ACGTCGTGTC GGGCGGCGAG GCGATCGTCG TCGCGACCGG GCAGCGTTTT CTCGATCTGC TCGCGCCGTG CGGCCTCGCG CGCGATGCGC TGTGGCCCGC GTTCGGGATG ACGGAGACGT GCGCCGGCTC CGTGTATTCG CGCGAGTTCC CGGAAGGCGA CGCGGGCCGC GAGTTCGCAT CGCTCGGCCT GCCGGTGGCC GGGCTGCAGA TGCGCATCGC GGACGACCGC AACAACGTGC TGCCGGAAGG CGAGGCGGGC GAGTTCCAGG TGCGCGGCCC GATGATCTTC CAGCGCTATC ACAACAATGC CGAGGCGACG CGCGCGGCGT TCACGAGCGA CGGCTGGTTC CGCACGGGCG ACCTCGGGCG CATCGAGCGC GGCCGGCTGT GGCTCGTCGG CCGCAGCAAG GACAGCATCA TCGTCAACGG CGTCAATTAC TTCAGCCACG AGCTGGAGAC GACTCTCGAG GCGCTCGACG GCGTCAAGCC CTCGTTCGTC GCGGCGTTTC CGACGCGCGG GGCCGGCGAC GAATCCGAGC AACTCGTCGT CACGTTCACG CCGTCGTTTC CGCTCGACGA CGAGGACGCG CTGTATCGCC TCGTCATCGC GATCCGCAAC AGCACGATCC TGCTGTGGGG CTTCCGGCCC GCGCTGATCC TGCCGCTGCC GGAGGACGAA TTCCCGAAGA CGAGCCTCGG CAAGACCCAG CGCGCGATCA TGCGCAAGCG CCTCGAAGCG GGCAGCTACG ACGGCTACAA GGCGCGCGTC GCCGATCTCG CGAACCGGCA GATGGGCGGC TATGTCGCGC CCGACGGGCA GACCGAGGCC GCGGTGGCCG CGATCTTCGC GCGGATGTTC CAGGTCGCGC CCGAGGCGAT CAGCGCGACC GCGAGCTTCT TCGATCTCGG CGGCACGTCG CTCGACATCC TGAAGCTCAA GCGCCACGTC GAACAGCGGC TCGGCGTGAT CGACCTGCCG ATCGTGACGA TCCTCCAGAA CCCGAGCGTG CGCGCGCTGG CCGCGCGTCT CGCCCCGGGC GAGCGCGTGG CGGCGGGCGA ATACGATCCG GTCGTGCCGT TGCAGCTCAC CGGCGGCAAG ACGCCGCTAT TCTGCGTGCA CCCCGGCGTC GGCGAGGTGC TCGTGTTCGT CAACCTCGCG AAGTACTTCG TCAACGAGCG CCCGTTCTAC GCATTGCGCG CGCGCGGCTT CAACGAAGGG GAGACGTATT TCTCCAGCTT CGACGAAATG GTGAACACGT ATGTCGACGC GATCCGCAAG CGGCAGCCGC ACGGGCCGTA CGCGGTGGCC GGCTATTCGT ACGGCGGCGC GGTCGCGTTC GAGATCGCGA AGGTGCTCGA AGCGCAGGGC GAGCGGGTGG ATTTCGTCGG CAGCTTCAAT CTGCCGCCGC ACATCAAGTA CCGGATGGAC GAGCTCGACG AGGTGGAGGG CGCGGTCAAC CTCGCGTTCT TCCTGTCGCT GATCGACAAG CAGCAGTCGC TCACGCTGCC GCCGCAACTG CGCGCGGCGA TGCCGGAGCA AGACCCGCTC GCGTACCTGA TCGACCACGC GCCGCCCGGG CGGCTCGTCG AGCTCGACCT CGATCTGCCG AAATTCCGCG CGTGGGCGGG GCTCGCGCAA TCGCTGCTCA CGCTCGGGCG TTCGTACGCG CCGTCGGGCA GCGTGCGGGC GATGTCGATC TTCTATGCGA TTCCGCTGCG CGGCACGAAG GACGACTGGC TGAACAAGGA ACTGCGCCGC TGGGACGAGT TCACGCGCGC GCCGAACCGC TATATCGACG TGGCGGGCGA ACACTACACG CTGATGGGGC CCGCGCACGT CGCGACGTTC CAGGCGGTGC TGCGGGCCGA GCTCGATCGC GCGCTCGGCG GCAAATGA
|
Protein sequence | MTASALDLPR DCEHALRAAS PPNIVDLLLR AARLHPHTGV RFIAAESEHK GAFVTYPELL DEARRILGGL RARGYRSGMK VALLLEHASD FIPAFWACAL GGFVPCPLVP IRNDPERWAK HLAHVDTLLD HPLLVTTEAL NNDLPGGASA VNLNALRASL PDASTHVAQP SDPAVFVLTS GSTGNSKAVV LTHGNLLASM AGKNDRQQLA GADVTLNWIS FDHVAALLEA HLLPLYVGAV QLHVEAAAVL TDPLRFLRLV SRYRVTMTFS PNFLFGQLNA ALEAMGDEAL AAWRGAVDLS SLRHVVSGGE AIVVATGQRF LDLLAPCGLA RDALWPAFGM TETCAGSVYS REFPEGDAGR EFASLGLPVA GLQMRIADDR NNVLPEGEAG EFQVRGPMIF QRYHNNAEAT RAAFTSDGWF RTGDLGRIER GRLWLVGRSK DSIIVNGVNY FSHELETTLE ALDGVKPSFV AAFPTRGAGD ESEQLVVTFT PSFPLDDEDA LYRLVIAIRN STILLWGFRP ALILPLPEDE FPKTSLGKTQ RAIMRKRLEA GSYDGYKARV ADLANRQMGG YVAPDGQTEA AVAAIFARMF QVAPEAISAT ASFFDLGGTS LDILKLKRHV EQRLGVIDLP IVTILQNPSV RALAARLAPG ERVAAGEYDP VVPLQLTGGK TPLFCVHPGV GEVLVFVNLA KYFVNERPFY ALRARGFNEG ETYFSSFDEM VNTYVDAIRK RQPHGPYAVA GYSYGGAVAF EIAKVLEAQG ERVDFVGSFN LPPHIKYRMD ELDEVEGAVN LAFFLSLIDK QQSLTLPPQL RAAMPEQDPL AYLIDHAPPG RLVELDLDLP KFRAWAGLAQ SLLTLGRSYA PSGSVRAMSI FYAIPLRGTK DDWLNKELRR WDEFTRAPNR YIDVAGEHYT LMGPAHVATF QAVLRAELDR ALGGK
|
| |