Gene BURPS1106A_A1426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1426 
Symbol 
ID4904347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1390276 
End bp1391286 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content70% 
IMG OID640144532 
Productputative carbohydrate ABC transporter, periplasmic sugar-binding protein 
Protein accessionYP_001075460 
Protein GI126457631 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.134002 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGTTCG CCGTCGCGCT CGCGATCGGC GCCGCCCCCG CTTGCGCGTC GTCCGCCGCC 
GGCGCTGCGC CGCCCGGGCC GCGCGCGGGC CATGCGCCGC TGTCGCTCGC CGGCAAGCGG
ATCGGCATCA CGGCGGCCGG CACCGATCAC TACTGGGATC TGCAGGCGTA CCAGGGCGCG
GTAGACGAAG TGAAGCGCCT CGGCGGCACG CCGATCGCGC TCGACGCCGG CCGCAACGAC
AGCCGCCAGA TCGCGCAGAT CCAGACGCTG ATCGCGCAAC AGCCCGATGC GATCATCGAG
CAGCTCGGCA CCGCATCCGT GCTCGAGCCG TGGCTCAGGA AAATCCGGCA AGCGGGCATC
CCGCTTTTCA CGATCGACAC CGCGTCGCCG TCGAGCCTGA ACGTCGTCAC GTCGGACAAT
TTCGCGATCG GCTCGCAGCT CGCGCTGAAG CTCGTCAACG ATATCCGCGG CGAAGGCAAC
GTCCTCGTGT TCAACGGCTT CTACGGCGTG CCCGTGTGCG CGATCCGCTA CGACCAGCTG
AAAGCCGTGC TGAAGTGGTA TCCGAAGGTG AAGATCATCG AGCCCGAGCT GCGCGACGTG
ATTCCGAACA CGGCGCAGAA CGCGTACGCG CAGATCAGCC AGTTGCTGCA GAAGTATCCG
AAAGGCACGA TCTCGGCGAT CTGGGCCGCG TGGGACATTC CGCAGGTCGG CGCGACGCAG
GCGGTCGACG CGGCCGGCCG ACGCGAGATC CGCACGTACG CGGTGGACGG CAGCCCCGAG
GCGGTCGCGC TCGTGAGGAA TCCGACCTCG AGCGCGGCGG CCGTCGTCGC GCAGCAGCCG
GCGCTGATCG GCCGCACCGC CGTGCGCAAC GTCGCGCGCT ATCTGGCGGG CGACCGATCG
CTGCCCGCGT ACACGTTCGT GCCGTCGGTG CTCGTCACGA AGGACGACGC GGGTGTCGCG
CGGCCTGCGC TCGGGCAGAC GCCGGCCGCC GCCGGGCTCG CGCGGCGATG A
 
Protein sequence
MLFAVALAIG AAPACASSAA GAAPPGPRAG HAPLSLAGKR IGITAAGTDH YWDLQAYQGA 
VDEVKRLGGT PIALDAGRND SRQIAQIQTL IAQQPDAIIE QLGTASVLEP WLRKIRQAGI
PLFTIDTASP SSLNVVTSDN FAIGSQLALK LVNDIRGEGN VLVFNGFYGV PVCAIRYDQL
KAVLKWYPKV KIIEPELRDV IPNTAQNAYA QISQLLQKYP KGTISAIWAA WDIPQVGATQ
AVDAAGRREI RTYAVDGSPE AVALVRNPTS SAAAVVAQQP ALIGRTAVRN VARYLAGDRS
LPAYTFVPSV LVTKDDAGVA RPALGQTPAA AGLARR