Gene BURPS1106A_1921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1921 
Symbol 
ID4899324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1875847 
End bp1877454 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content70% 
IMG OID640135151 
Productcarbohydrate ABC transporter ATP-binding protein 
Protein accessionYP_001066186 
Protein GI126453379 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTGAGCG GCCCGCGCGA CGAGGCCGAC GACATGGACG AAGCGAGCGG CGCGGCGCGC 
GCGCCTGACG AAGCGAGCGA GGAGGCGATG GACACGATAC TGGCGCTCAC CGGCATCACG
AAGCGGTTTC CCGGCGTCGT CGCGCTGCGC GGAATCGATC TGCGCGTCGC GCGCGGCGAG
ATTCACGCGC TGCTCGGCGA GAACGGCGCG GGCAAGTCGA CGCTGATGAA GATCCTCTGC
GGGATCCACC CGCCGGACGA GGGCGTGATC GCGCTCGACG GCGAGCCGCG GCGCTTCGCG
AACCATCACG ACGCGATCGC GGCGGGCGTC GGCATCGTGT TTCAGGAGTT CAGCCTGATT
CCGGAGCTGA ACGCGGTCGA CAATCTGTTC CTCGGCCGCG AGTGGCGCGG CCGGCTCGGG
CTGCGCGAGC GCGCGCGGAT GCGCCGCGCG GCGGCGGACA TCTTCGCGCG GCTCGACGTC
GCGATCGATC TGTCGGCGCC GGTGCGCGAG CTGTCGGTCG CGCAGCAGCA GTTCGTCGAG
ATCGGCAAGG CGCTGTCGCT CGACGCGCGG CTGCTGATCC TCGACGAGCC GACCGCGACG
CTCACGCCCG CCGAAGCCGC GCACCTGTTC GGCGTGATGC GCGAGCTCAA GCGCCGGGGC
GTCGCGATGA TCTTCATCTC GCACCACCTC GACGAGATCT TCGAGGTGTG CGACCGGATC
ACCGTGCTGC GCGACGGGCA ATACGTCGGC ACGACCGAGG TCGCGCGCAC CGATGTCGGC
GCGCTCGTCG AGATGATGGT GGGCCGGCGC ATCGAGCAGA GCTTTCCGCC GAAGCCGCGT
CTTGCGCGCG ACGCCGCGCC CGTGCTCGAG GTGGACGCGC TGCAGGTGCG CGAGAACGGC
CCCGTGAACC GCTTCGCGCT GCGCGAGGGC GAGATTCTCG GCTTCGCGGG CCTCGTCGGC
TCCGGGCGCA CGTCGAGCGC GCTCGCGCTG ATCGGCGCGA AGCCCGCGCG CGTGCGGCGA
ATGCGCGTGC GCGGCCGCCC GGTGCGCGTC GCCGATCCCG CCGCCGCGCT CGCCGCGGGC
ATCGGCCTCC TGCCGGAGAG CCGCAAGACG CAGGGGCTCA TCCCCGCGTT CTCGATCCGG
CACAACATCG CGATCAACAA CCTCGGCAAG CACCGCCGGC TGCGCTGGTT CGTCGACGCG
GCGGCCGAGA CGCGCACGAC GCTCGAGCTG ATGCAGCGGC TCGGCGTGAA GGCGCCGACG
CCGCACACGC GCGTCGACAC GCTCTCGGGC GGCAATCAGC AGAAGGTCGT GATCGCGCGC
TGGCTCAACC ATCACACGCG GATCCTGATC TTCGACGAGC CGACGCGCGG CATCGACATC
GGCGCGAAGG CGGAAATCTA TCAACTGATG CGCGAGCTGA GCGCGCGCGG CTATTCGATC
GTGCTGATCT CGAGCGAGTT GCCGGAGATC GTCGGCCTGT GCGATCGCGT CGCGGTGTTC
CGGCAGGGCC GTATCGAGGC GATGCTCGAA GGCGAGGCGA TCGAGCCGAA CACGGTGATG
ACCTATGCGA CTTCCGACGT ACGCGGAGCG AATCATGAAC ATGCATGA
 
Protein sequence
MVSGPRDEAD DMDEASGAAR APDEASEEAM DTILALTGIT KRFPGVVALR GIDLRVARGE 
IHALLGENGA GKSTLMKILC GIHPPDEGVI ALDGEPRRFA NHHDAIAAGV GIVFQEFSLI
PELNAVDNLF LGREWRGRLG LRERARMRRA AADIFARLDV AIDLSAPVRE LSVAQQQFVE
IGKALSLDAR LLILDEPTAT LTPAEAAHLF GVMRELKRRG VAMIFISHHL DEIFEVCDRI
TVLRDGQYVG TTEVARTDVG ALVEMMVGRR IEQSFPPKPR LARDAAPVLE VDALQVRENG
PVNRFALREG EILGFAGLVG SGRTSSALAL IGAKPARVRR MRVRGRPVRV ADPAAALAAG
IGLLPESRKT QGLIPAFSIR HNIAINNLGK HRRLRWFVDA AAETRTTLEL MQRLGVKAPT
PHTRVDTLSG GNQQKVVIAR WLNHHTRILI FDEPTRGIDI GAKAEIYQLM RELSARGYSI
VLISSELPEI VGLCDRVAVF RQGRIEAMLE GEAIEPNTVM TYATSDVRGA NHEHA