Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMAA1022 |
Symbol | |
ID | 3086978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei ATCC 23344 |
Kingdom | Bacteria |
Replicon accession | NC_006349 |
Strand | + |
Start bp | 1059105 |
End bp | 1061756 |
Gene Length | 2652 bp |
Protein Length | 883 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637564924 |
Product | putative polyketide synthase |
Protein accession | YP_105691 |
Protein GI | 53716904 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.160316 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGCGCG GCGAGGACGC GGCCGACGGC AGCCTCACCG TGTCGTTTGC GCTGCACGAG CGGCTCTGGT TTCTCGATGA ACACCGGATC TTCGACGGCG CGCCGGTGCT GCCCGGCACC GCCTGCATCG AACTCGTGCG GCGCGCGTAT TCGCTCGTGC GCCCGGGCGC GGCGGTGACG ATGCGCGACG TCTATTTTCC GACGGCGCTG ATTCTGTCGA CGGACGAATC GCGCAACGTG CGCGTCGTGT TCCGGCCGCC GGAACGCGTG TCGGACGGCG CGGCCGCCGG GGGCGACCTC GCGTTCGTGC TCGAATCGAA CGACGCGAAC GGGCCCGCCG GATGGACGCC GCACGCGAGC GGGCGCATCG GGGACGATCC GCCGGCGTGC GCCGCGCCCG CTTCGCTCGC CGCGCTTGCC GCTCTCGACG CTCCGGCCGC GCTGCGCGAG CAATGGGGGC TCACGGCGCT GGACGACGTC GCCGCGCTGT CCGCGCAGGC GTTCGCCGAT TACGGGGCGC GCTGGCACGG CGTCGACGCG CTCTGGCTCG GCGAACGGGC CGGGCTCGCG CGGCTCAGGC TGCCGGCCGC CGGCAGCGGC GACTTGCCCG ATTTCGCGCT GCATCCGGCC ATGCTCGACG TCGCCACGGC GTTCCTGCCC GCCTGCCTGC GCCCGCGCGA CGCGTCGGTG CCGTTCCGCT ACGAATCGAT CCGCATGCAC CGGCCGCTGC GCGCCGATTG CTACAGCTTC GCCGTCGAGA CCGCGCCGAA CGTGTACGAC GTCACGCTGT TCGCATGGGA CGAGGCCGCG CGGCGCGCCG ACGTGCTCGT CGCGATCGGC GGCTTCGCGC GCCGCGAGCC GGCGCACCGG GCGCGCGACG TCGCGCAGTG GTGCCGCACG GTGAGCTGGC GCGACGCGCC CGCGGCGCGC GCGTTGCCGC CCGAGCGCTG GCTCGTGTTC GGCGACGAAT GGTTCGCGCT CGCGCCCGCC GGCAGCGTGC TCGTGCGCGA GGACGACGCG TTTCGCGCGC ACGGCGACAA CGGCTATGGC GTGCGCCCGG GCGAGAAGGC CGATTGCGAC CGGCTGATCG CGCGGCTCGC CGAGCAGGGC GGCGTGCCGG CGCACGTCGT CTACGGCTGG GCGCAGACGG ACGTCGATCG CGCGTTCGCC GGGCTCGCCG CGTTGCTGCA GGCGCTGGGC GCGCATCCCG CCGATTTTCG CGTGTCGCTC GTGACGAAGG GCGCGCGCTC GGCGCGCACG CTGGACGCAT GCGCGGCCGC CGCGCCGGCG GGCTTGCTCA AGGCGGTGCG CTGGGAATAT CCGCGCATCG TGTGCCGCCA CATCGATATC GACGATGCAA GCGACGCGAC GATCGACGCG TTGCGCGCGG AGCTGTCGTC CGAGCCCGCC ACGCCGCCCG GCGCGCCGCC CGAGCTGCCG AGCAGCATCG CGCTCGCCGG CGCGCGCCGC GAGGCGCCCG GCTTCGCGGC GCTGCCGGAC GTCGCGCGCG ACGACGTCCT GCGCGACGGC GGTGCGTATC TGATCACGGG CGGCGCGAGC GGCATCGGCC TCGAGCTGGC GGCGCACATC GCGTCGCGGC GGCGGGACGT GAGGCTGGCG TTGCTGAGCC GCTCGCCGCA TGACGAAAAC GCCGCGCGGT TCGCCGCGCT GGACGAGGCC GCCGCGAGCG TGCTGCGGTT GACGGCCGAC GTCGCGCACG CCGCGCAGCT CGCCGACGCG CTGCGCACGG TGCGCGCGCG CTTCGGGCGC ATCGACGGCG TCATCCATGC GGCCGGCGTC GAGGCGAGCG GCCTGCTCGA AACCGGCACG CCCGACGCAT GGCGGCGCGT GATGGCGGCG AAGGTTCACG GCGCGCGACA CCTGTTCGAC CAACTGGCCG GCGATCCGCC CGATTTCATC GTGCTCTGCT CGTCGCTCGC CGCGGTCGTC GGCGGCCTCG GGCAGGCCGA CTACGCGGCG GCGAACGGTT ACATGGACGC GCTCGCGCAG CACTGGCGCC AACGCGGCGT CGCGGCCATC GCGATCGATT GGGATACGTG GTCGGACACG GGCATGGCGT TCGACCACGC GGCGCGCACG CGCCGCTCGA ATGACCGCCC GGGCGCGCTG CCCGGCCTCG CGAACCGCGA AGGGCGGGCG CTCTTCGATC TCGCGCTCGC GCACGACGCG CCGCGCATCG TTATCAGCAA GCGGGGCTTC GAACAGGACC GGCGCGACGC GCCCACGCGC GCGCGGCGCG CGGCCGCCCC GGGCGACGCG CAGGCGGCGC TCGTCGCGCT CTGGCAGGAA CTGCTGGGCG TCGAGCAGGT GGGCGTCGAC GACGACTTCT TCGATCTGGG CGGCCATTCG CTGCTCGCGA CGCAGTTGAT TTCCCGCGTG CGCGATCAGT ACGCGCGCAG CCCGACGCTC GGCGAATTCC TGGAGGAGCC GACGATCGCG CGGCTACTGC GCGCGATCGA CCATACGGGC GGCGACACGG GCGGCGACAT GAGCGGCGAC ACCGCCGGCG ACGCGCCCGA CGTCGACGAG ACGCTGCGCT ATTGCGTGGT GCCGATGGTG AAGGCCGGCA GCGGCGCGCC GTTCTTCTGC ATTCCCGGCA TGGGCGGCAA CATCACGCAG TTGCTGCCGT GA
|
Protein sequence | MRRGEDAADG SLTVSFALHE RLWFLDEHRI FDGAPVLPGT ACIELVRRAY SLVRPGAAVT MRDVYFPTAL ILSTDESRNV RVVFRPPERV SDGAAAGGDL AFVLESNDAN GPAGWTPHAS GRIGDDPPAC AAPASLAALA ALDAPAALRE QWGLTALDDV AALSAQAFAD YGARWHGVDA LWLGERAGLA RLRLPAAGSG DLPDFALHPA MLDVATAFLP ACLRPRDASV PFRYESIRMH RPLRADCYSF AVETAPNVYD VTLFAWDEAA RRADVLVAIG GFARREPAHR ARDVAQWCRT VSWRDAPAAR ALPPERWLVF GDEWFALAPA GSVLVREDDA FRAHGDNGYG VRPGEKADCD RLIARLAEQG GVPAHVVYGW AQTDVDRAFA GLAALLQALG AHPADFRVSL VTKGARSART LDACAAAAPA GLLKAVRWEY PRIVCRHIDI DDASDATIDA LRAELSSEPA TPPGAPPELP SSIALAGARR EAPGFAALPD VARDDVLRDG GAYLITGGAS GIGLELAAHI ASRRRDVRLA LLSRSPHDEN AARFAALDEA AASVLRLTAD VAHAAQLADA LRTVRARFGR IDGVIHAAGV EASGLLETGT PDAWRRVMAA KVHGARHLFD QLAGDPPDFI VLCSSLAAVV GGLGQADYAA ANGYMDALAQ HWRQRGVAAI AIDWDTWSDT GMAFDHAART RRSNDRPGAL PGLANREGRA LFDLALAHDA PRIVISKRGF EQDRRDAPTR ARRAAAPGDA QAALVALWQE LLGVEQVGVD DDFFDLGGHS LLATQLISRV RDQYARSPTL GEFLEEPTIA RLLRAIDHTG GDTGGDMSGD TAGDAPDVDE TLRYCVVPMV KAGSGAPFFC IPGMGGNITQ LLP
|
| |