Gene BURPS668_A1739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1739 
Symbol 
ID4886243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1685024 
End bp1686304 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content69% 
IMG OID640131677 
ProductABC-type sugar transport system, periplasmic component 
Protein accessionYP_001062734 
Protein GI126445207 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATACGA TACGATTGCT TGGCGCCGCC GCGCTCGTCG GCGCATGCGC GCCGCTCGCG 
GCGGCGGCCG CGACGCCAGT CTGCAAGGTG CCGACGCTGA AGGTCCTCGC GCAGAAGAGC
CTCGGGCTCT CGGTGATGGA GAAATCGCTG CCCGACTACG AGAAGACGAG CGGCACCCGG
ATCGAGATCA ATTACTTCGG CGAGAACGAC CGCCGTGCGA AATCGCGTCT CGACGCGTCG
ACGGGCGCGG GCTCGTATCA GATCTACTAC GTCGACGAGG CGAACGTCGC CGAATTCGCA
TCGGCCGGCT GGATCGCGCC GCTCCTCAAG TACTACCCGA AGGAATACGA TTACGACGAC
TTCCTGCCGG GCCGCCGCGC GGTGGCGAGC TACAAGGGCG TCGCGTACTT CGCGCCGCTC
ATCGGCGGGG GCGATTTCCT GTTCTACCGG CGCGACCTCC TCGACGCCGC GCACCTGCCG
GTGCCGAAGA CGCTGGACGA ACTCGTCGCC GCGGTCCGCA AGCTGAACGC GCCGCCGAAG
CTGTACGGCT GGGTCGCGCG CGGCCAGCGC GGCTCGGGCA TGAACGTGTG GCGCTGGGCG
CCGTTCATGC TCGCGCAGGG CGGCGCATGG ACCGACCCGC ACGGCCAGCC GGCGTTCAAC
TCGCCCTCCG CGGTGCAGGC GACCGAGCGC TACCGCGATC TCTTCAAGTA CGCGCCGCCG
GGCGCCGCGA CCTACGACTG GAGCAACGCG CTCGAAGCGT TCCGCTCGGG CAAGGTCGCG
TTCATGATCG AATCGACGCC GTTCGCCGAC TGGATGGAGG ACCCATCCAA GTCGAGCGTC
GCGGGCAAGG TCGGCTACGC GAGGCCGCCC GCGCCGCTGC CGTCGGCCGC TTACGGGCAC
GGGCTCGCGA TCTCGTCGGT CGGCGCGAAG GACGACTGCG CGCGGCAGGC GGCGGGCCGC
TTCATCGCAT GGGCGACGAG CAAGGAGCAG GAGCAGGCGC GGCTGCGCAA CGGCGTGTTC
AGCGACTACA ACCGCACGAG CACGATCGGC AGCGACTACT TCAGGCAGCA CGTGAAGCCG
CAGATCCTCG CCGGCCTGAA CGATACGAAC CCGGTGACGA AGGCGACGAT CTGGGCGACG
CCGCAATGGC CCGATATCGG CGACAACCTC GGCGTCGCGC TCGAGGAAGT CTTCACCGGC
ACGCAGACCG ACGTGCGCGG CGCGCTCGAC GACGCGGCGC AGTACGCGAA GGACGCGATG
GCGCACGGCG CGCGCAAGTG A
 
Protein sequence
MNTIRLLGAA ALVGACAPLA AAAATPVCKV PTLKVLAQKS LGLSVMEKSL PDYEKTSGTR 
IEINYFGEND RRAKSRLDAS TGAGSYQIYY VDEANVAEFA SAGWIAPLLK YYPKEYDYDD
FLPGRRAVAS YKGVAYFAPL IGGGDFLFYR RDLLDAAHLP VPKTLDELVA AVRKLNAPPK
LYGWVARGQR GSGMNVWRWA PFMLAQGGAW TDPHGQPAFN SPSAVQATER YRDLFKYAPP
GAATYDWSNA LEAFRSGKVA FMIESTPFAD WMEDPSKSSV AGKVGYARPP APLPSAAYGH
GLAISSVGAK DDCARQAAGR FIAWATSKEQ EQARLRNGVF SDYNRTSTIG SDYFRQHVKP
QILAGLNDTN PVTKATIWAT PQWPDIGDNL GVALEEVFTG TQTDVRGALD DAAQYAKDAM
AHGARK