Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_1539 |
Symbol | |
ID | 4883905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 1498946 |
End bp | 1501738 |
Gene Length | 2793 bp |
Protein Length | 930 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640127467 |
Product | alpha-amylase family protein |
Protein accession | YP_001058580 |
Protein GI | 126438912 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3280] Maltooligosyl trehalose synthase |
TIGRFAM ID | [TIGR02401] malto-oligosyltrehalose synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.604212 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCCGC GCGCGACGCT GCGCCTGCAG TTGCATGCGG GCTTCACGTT CGACGACGCG GCCGCGCACG TCGGCTATTT CGCGCGGCTC GGCGTGAGCC ATCTGTATCT GTCGCCGATC ACGGCCGCGG AGCCGGGTTC GCGCCACGGC TACGATGTGA TCGATTATTC GACGGTCAAC CCCGAGCTCG GCGGCGAGGC GGCGTTCGTG CGGCTGATCG ATGCGTTGCG GCGGCGGGGC ATGGGCGCGA TCGTCGACAT CGTGCCGAAC CACATGGGCG TGGGCGGCTC GTCCAACCGC TGGTGGAACG ACGTGCTCGA ATGGGGCGCG CGCAGCCGCT TCGCGCGGCA TTTCGACATC GACTGGCACG CGAGCGACCC CGCGTTGCAG CGCAAGGTGC TGCTGCCCTG CCTCGGCCGC CCCTACGGCG AGGCGCTCGC CGCGGGCGAC ATCGCGCTGC GCGCGGACGC CGCGCACGGG CGGTTCGCGA TCGCATGCGC GGGCCGCACG CTGCCCGTGC AGATCGGCGC GTATCCGGAC ATCCTGCGCG CGGCGAACCG AAGCGATCTG AACGCGCTCG CCGAGCGCTT CGACGCGCCG GGCGCGCGGC CGTCGAACCA CGCACGCCTC GACGCGGCGC ACGCGGCGCT GCGCGACTAC GCCGCCGCGC GCGGGCCGGG CGCGCTCGAC GCGGTGCTGC ACGGCTTCGA TCCGCGCATC GCGCGCTCGC GCGAGATGCT GCACCGCCTG CTCGAGCAGC AGCATTACCG CCTGGCGTGG TGGCGCACCG CCACCGACGA AATCAACTGG CGCCGCTTTT TCGACATCTC GACGCTCGCC TGCATGCGCA TCGAGGACGC AGCCGTGTTC GACGACGTGC ATGCGCTGCT GTGGCGCCTC TACGCCGCGG GGCTCGTCGA CGGCGTGCGG ATCGATCACG TCGACGGGCT CGCGGATCCG CGCGGATACT GCCGGCAGTT GCGCGGCCGG CTCGCCGCGT TGCGCGACGG CGAACCGTAT ATCGTCGTCG AGAAGATCCT CGCGCCCGAC GAACGCTTGC CCGAAGACTG GCGCGTCGAC GGCACGACAG GCTACGACTT CATGAACGAC GTATCGGCGC TGCTGCACGA CGCCGCCGGC GCCGCGCCGC TCGCCGCGCT GTGGGCTGAC ATGACGGGCG CCGAGACGAC ATTCGCGCGC GAAGCGCTGG ACGGCAAGCG CCGCGTGCTC GCCCGGCAGT TCGCGGCCGA GCACGAGCGC GTCGCGCGTG CGATGCATCG GCTCGCGCGC GCATCGCGCG ACGCCCGCGA CTTCGCGCTC AATCCGATCC GCCGCGCGGT CGCCGAGCTC GCGATCCGGC TGCCGGTGTA CCGGCTGTAT CCGTCGGCGG GCGCGCCGCA GCGGACCGAT CGCGCGCTTC TCGCCGGCGC GTGGCAAGCG GCGCGCAGCG CGATCGCGCC GGCCGATCGC GACGCGCTCG ACTACGTCGC CGCGACGCTC GGCCTGCCGG GCGTCGCGCG CGCCGTCGCC GGCCTCGGCG ACCCGGCGCG GCTCGCCGCG CGCGTCGCGT TCGCGCAACT CACCGCGCCG CTCGCCGCGA AAGGCGTCGA GGACACCGCG TGCTATCGAT ACGGCAGGCT GTTGTCGCGC AACGAAGTCG GCGCGCACGC GGATGCGCTC TCGCTCGCGC CCGGCGCGTT CCACACGCGC AATCGCCGGC GGCAGCGAAC GTTCCCGGGC GCGCTGCTCG CCACCGCCAC GCACGACCAC AAGCGCGGCG AAGACGCGCG CGCGCGGCTC GCGGTCCTGA GCGAAGCGCA TCGCGCGTGG CGCGCGGCGG CGCTCGACTG GGCGGCGTTC AACGCCCCGC ACCGTCACGG CGCGCCCGCG GCGGCCGACC GGATACCGGG GCCCGCCGCC GAAGCGATGC TGTATCAGAC GCTCGTCGGC GCGTGGCCGC CCGCGCTCGC GCCCGACGAC GCGCCCGGCC TCGCCGCGCT GACGGACCGG GTCGAGCGCT GGCAATTGAA GGCGCTGCGC GAAGCGAAGC GCGACACCGA CTGGCTCGAA CCGAATCTCG GATACGAAGC CGGCTGCGCG GCGTTCCTGC GCGCGATCAT GACGCCGCGC GGGCCCGACG ATTTCGCTCA TCGGCTGCAC CGCCTCGTTG CGCGCATCGC GCCCGCGGGC ATCGTCAACA GCCTGTCGCA AGCCGCGCTG CGGCTGCTGT CGCCCGGCGT GCCGGATCTG TATCAGGGCG CGCAGACATG GGATCACACG CTCGTCGATC CCGACAATCG CGCCGACGTG CCGTTCGCCC GCTACGCGGC GCAGCGCATC GACGCGCCCG TCGCCGCGTA TCTGCGCGAC TGGGCCGACG GCCGCGTCAA GCACGCGCTG ATCGGCAGGC TGCTCGCGTT GCGCGCCGCG CACCCGGAGA CGTTCGCGGC GGGCGCTTAC GTGCCGCTGC ACGTGCGCGG CACGCGTCGC GGCCATGCGC TGGCGTTCGC GAGACGAGAC GCGTCGACGA CGATCGTCGT GATCGCGACG CGGCTCGCCT ACCCGCTGCT CGGCGACGCG CCGGCGCGCC CGTGCGTGGA GGCCGCATGC TGGGCGGACA CGGCGGTCGG GCTCGCGCCC GGCTTCGCCG GCCCGTGGCG CGACGTGCTG AACGACGGCA CGCTCGACGC GCCGTCGGGC ATGCTGCCGC TTGCCGCCGC GCTCGCGCAT CTGCCCGTCG CGGTGCTGAT TCGCGAGGGC GGCGCAGCGG ATACGCCGCG ACGCGGCGCT TGA
|
Protein sequence | MKPRATLRLQ LHAGFTFDDA AAHVGYFARL GVSHLYLSPI TAAEPGSRHG YDVIDYSTVN PELGGEAAFV RLIDALRRRG MGAIVDIVPN HMGVGGSSNR WWNDVLEWGA RSRFARHFDI DWHASDPALQ RKVLLPCLGR PYGEALAAGD IALRADAAHG RFAIACAGRT LPVQIGAYPD ILRAANRSDL NALAERFDAP GARPSNHARL DAAHAALRDY AAARGPGALD AVLHGFDPRI ARSREMLHRL LEQQHYRLAW WRTATDEINW RRFFDISTLA CMRIEDAAVF DDVHALLWRL YAAGLVDGVR IDHVDGLADP RGYCRQLRGR LAALRDGEPY IVVEKILAPD ERLPEDWRVD GTTGYDFMND VSALLHDAAG AAPLAALWAD MTGAETTFAR EALDGKRRVL ARQFAAEHER VARAMHRLAR ASRDARDFAL NPIRRAVAEL AIRLPVYRLY PSAGAPQRTD RALLAGAWQA ARSAIAPADR DALDYVAATL GLPGVARAVA GLGDPARLAA RVAFAQLTAP LAAKGVEDTA CYRYGRLLSR NEVGAHADAL SLAPGAFHTR NRRRQRTFPG ALLATATHDH KRGEDARARL AVLSEAHRAW RAAALDWAAF NAPHRHGAPA AADRIPGPAA EAMLYQTLVG AWPPALAPDD APGLAALTDR VERWQLKALR EAKRDTDWLE PNLGYEAGCA AFLRAIMTPR GPDDFAHRLH RLVARIAPAG IVNSLSQAAL RLLSPGVPDL YQGAQTWDHT LVDPDNRADV PFARYAAQRI DAPVAAYLRD WADGRVKHAL IGRLLALRAA HPETFAAGAY VPLHVRGTRR GHALAFARRD ASTTIVVIAT RLAYPLLGDA PARPCVEAAC WADTAVGLAP GFAGPWRDVL NDGTLDAPSG MLPLAAALAH LPVAVLIREG GAADTPRRGA
|
| |