Gene BURPS668_1539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1539 
Symbol 
ID4883905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1498946 
End bp1501738 
Gene Length2793 bp 
Protein Length930 aa 
Translation table11 
GC content74% 
IMG OID640127467 
Productalpha-amylase family protein 
Protein accessionYP_001058580 
Protein GI126438912 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3280] Maltooligosyl trehalose synthase 
TIGRFAM ID[TIGR02401] malto-oligosyltrehalose synthase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.604212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCGC GCGCGACGCT GCGCCTGCAG TTGCATGCGG GCTTCACGTT CGACGACGCG 
GCCGCGCACG TCGGCTATTT CGCGCGGCTC GGCGTGAGCC ATCTGTATCT GTCGCCGATC
ACGGCCGCGG AGCCGGGTTC GCGCCACGGC TACGATGTGA TCGATTATTC GACGGTCAAC
CCCGAGCTCG GCGGCGAGGC GGCGTTCGTG CGGCTGATCG ATGCGTTGCG GCGGCGGGGC
ATGGGCGCGA TCGTCGACAT CGTGCCGAAC CACATGGGCG TGGGCGGCTC GTCCAACCGC
TGGTGGAACG ACGTGCTCGA ATGGGGCGCG CGCAGCCGCT TCGCGCGGCA TTTCGACATC
GACTGGCACG CGAGCGACCC CGCGTTGCAG CGCAAGGTGC TGCTGCCCTG CCTCGGCCGC
CCCTACGGCG AGGCGCTCGC CGCGGGCGAC ATCGCGCTGC GCGCGGACGC CGCGCACGGG
CGGTTCGCGA TCGCATGCGC GGGCCGCACG CTGCCCGTGC AGATCGGCGC GTATCCGGAC
ATCCTGCGCG CGGCGAACCG AAGCGATCTG AACGCGCTCG CCGAGCGCTT CGACGCGCCG
GGCGCGCGGC CGTCGAACCA CGCACGCCTC GACGCGGCGC ACGCGGCGCT GCGCGACTAC
GCCGCCGCGC GCGGGCCGGG CGCGCTCGAC GCGGTGCTGC ACGGCTTCGA TCCGCGCATC
GCGCGCTCGC GCGAGATGCT GCACCGCCTG CTCGAGCAGC AGCATTACCG CCTGGCGTGG
TGGCGCACCG CCACCGACGA AATCAACTGG CGCCGCTTTT TCGACATCTC GACGCTCGCC
TGCATGCGCA TCGAGGACGC AGCCGTGTTC GACGACGTGC ATGCGCTGCT GTGGCGCCTC
TACGCCGCGG GGCTCGTCGA CGGCGTGCGG ATCGATCACG TCGACGGGCT CGCGGATCCG
CGCGGATACT GCCGGCAGTT GCGCGGCCGG CTCGCCGCGT TGCGCGACGG CGAACCGTAT
ATCGTCGTCG AGAAGATCCT CGCGCCCGAC GAACGCTTGC CCGAAGACTG GCGCGTCGAC
GGCACGACAG GCTACGACTT CATGAACGAC GTATCGGCGC TGCTGCACGA CGCCGCCGGC
GCCGCGCCGC TCGCCGCGCT GTGGGCTGAC ATGACGGGCG CCGAGACGAC ATTCGCGCGC
GAAGCGCTGG ACGGCAAGCG CCGCGTGCTC GCCCGGCAGT TCGCGGCCGA GCACGAGCGC
GTCGCGCGTG CGATGCATCG GCTCGCGCGC GCATCGCGCG ACGCCCGCGA CTTCGCGCTC
AATCCGATCC GCCGCGCGGT CGCCGAGCTC GCGATCCGGC TGCCGGTGTA CCGGCTGTAT
CCGTCGGCGG GCGCGCCGCA GCGGACCGAT CGCGCGCTTC TCGCCGGCGC GTGGCAAGCG
GCGCGCAGCG CGATCGCGCC GGCCGATCGC GACGCGCTCG ACTACGTCGC CGCGACGCTC
GGCCTGCCGG GCGTCGCGCG CGCCGTCGCC GGCCTCGGCG ACCCGGCGCG GCTCGCCGCG
CGCGTCGCGT TCGCGCAACT CACCGCGCCG CTCGCCGCGA AAGGCGTCGA GGACACCGCG
TGCTATCGAT ACGGCAGGCT GTTGTCGCGC AACGAAGTCG GCGCGCACGC GGATGCGCTC
TCGCTCGCGC CCGGCGCGTT CCACACGCGC AATCGCCGGC GGCAGCGAAC GTTCCCGGGC
GCGCTGCTCG CCACCGCCAC GCACGACCAC AAGCGCGGCG AAGACGCGCG CGCGCGGCTC
GCGGTCCTGA GCGAAGCGCA TCGCGCGTGG CGCGCGGCGG CGCTCGACTG GGCGGCGTTC
AACGCCCCGC ACCGTCACGG CGCGCCCGCG GCGGCCGACC GGATACCGGG GCCCGCCGCC
GAAGCGATGC TGTATCAGAC GCTCGTCGGC GCGTGGCCGC CCGCGCTCGC GCCCGACGAC
GCGCCCGGCC TCGCCGCGCT GACGGACCGG GTCGAGCGCT GGCAATTGAA GGCGCTGCGC
GAAGCGAAGC GCGACACCGA CTGGCTCGAA CCGAATCTCG GATACGAAGC CGGCTGCGCG
GCGTTCCTGC GCGCGATCAT GACGCCGCGC GGGCCCGACG ATTTCGCTCA TCGGCTGCAC
CGCCTCGTTG CGCGCATCGC GCCCGCGGGC ATCGTCAACA GCCTGTCGCA AGCCGCGCTG
CGGCTGCTGT CGCCCGGCGT GCCGGATCTG TATCAGGGCG CGCAGACATG GGATCACACG
CTCGTCGATC CCGACAATCG CGCCGACGTG CCGTTCGCCC GCTACGCGGC GCAGCGCATC
GACGCGCCCG TCGCCGCGTA TCTGCGCGAC TGGGCCGACG GCCGCGTCAA GCACGCGCTG
ATCGGCAGGC TGCTCGCGTT GCGCGCCGCG CACCCGGAGA CGTTCGCGGC GGGCGCTTAC
GTGCCGCTGC ACGTGCGCGG CACGCGTCGC GGCCATGCGC TGGCGTTCGC GAGACGAGAC
GCGTCGACGA CGATCGTCGT GATCGCGACG CGGCTCGCCT ACCCGCTGCT CGGCGACGCG
CCGGCGCGCC CGTGCGTGGA GGCCGCATGC TGGGCGGACA CGGCGGTCGG GCTCGCGCCC
GGCTTCGCCG GCCCGTGGCG CGACGTGCTG AACGACGGCA CGCTCGACGC GCCGTCGGGC
ATGCTGCCGC TTGCCGCCGC GCTCGCGCAT CTGCCCGTCG CGGTGCTGAT TCGCGAGGGC
GGCGCAGCGG ATACGCCGCG ACGCGGCGCT TGA
 
Protein sequence
MKPRATLRLQ LHAGFTFDDA AAHVGYFARL GVSHLYLSPI TAAEPGSRHG YDVIDYSTVN 
PELGGEAAFV RLIDALRRRG MGAIVDIVPN HMGVGGSSNR WWNDVLEWGA RSRFARHFDI
DWHASDPALQ RKVLLPCLGR PYGEALAAGD IALRADAAHG RFAIACAGRT LPVQIGAYPD
ILRAANRSDL NALAERFDAP GARPSNHARL DAAHAALRDY AAARGPGALD AVLHGFDPRI
ARSREMLHRL LEQQHYRLAW WRTATDEINW RRFFDISTLA CMRIEDAAVF DDVHALLWRL
YAAGLVDGVR IDHVDGLADP RGYCRQLRGR LAALRDGEPY IVVEKILAPD ERLPEDWRVD
GTTGYDFMND VSALLHDAAG AAPLAALWAD MTGAETTFAR EALDGKRRVL ARQFAAEHER
VARAMHRLAR ASRDARDFAL NPIRRAVAEL AIRLPVYRLY PSAGAPQRTD RALLAGAWQA
ARSAIAPADR DALDYVAATL GLPGVARAVA GLGDPARLAA RVAFAQLTAP LAAKGVEDTA
CYRYGRLLSR NEVGAHADAL SLAPGAFHTR NRRRQRTFPG ALLATATHDH KRGEDARARL
AVLSEAHRAW RAAALDWAAF NAPHRHGAPA AADRIPGPAA EAMLYQTLVG AWPPALAPDD
APGLAALTDR VERWQLKALR EAKRDTDWLE PNLGYEAGCA AFLRAIMTPR GPDDFAHRLH
RLVARIAPAG IVNSLSQAAL RLLSPGVPDL YQGAQTWDHT LVDPDNRADV PFARYAAQRI
DAPVAAYLRD WADGRVKHAL IGRLLALRAA HPETFAAGAY VPLHVRGTRR GHALAFARRD
ASTTIVVIAT RLAYPLLGDA PARPCVEAAC WADTAVGLAP GFAGPWRDVL NDGTLDAPSG
MLPLAAALAH LPVAVLIREG GAADTPRRGA