Gene BURPS1106A_3048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3048 
Symbol 
ID4899418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2977346 
End bp2978464 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content66% 
IMG OID640136274 
Productcarbohydrate ABC transporter ATP-binding protein 
Protein accessionYP_001067287 
Protein GI126454160 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.061373 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGCC TTTCCATCCG TGACGTGTAC AAGACCTACC CGAACGGCGT GCCCGTCCTG 
AAGGGCGTCG ACATCGACAT CGAGGACGGT CAGTTCCTGA TTCTCGTCGG CGGCTCGGGC
TGCGGGAAGT CGACGCTGCT CAACATGATC GCGGGCCTCG AGACCGTGAC GAAGGGCGAG
ATCCGGATCG GCGATAAGGT CGTCAACGAT CTGTCGCCGA AGGATCGCGA CATCGCGATG
GTGTTCCAGT CGTACGCGCT CTATCCGTCG ATGACGGTGC GCGAGAACAT CTCGTTCGGG
CTGAACATCC GCAAGGTGCC GAAGAACGAG CAGAAGCAGA TCGTCGATCG CGTCGCCGCG
ATGCTGCAGA TCGAGCACCT GCTCGAGCGC AAGCCGGGGC AGCTCTCGGG CGGCCAGCGG
CAGCGCGTCG CGATGGGCCG CGCGCTCGCG CGCGACCCGG CGCTGTTCCT GTTCGACGAG
CCGCTGTCGA ACCTCGACGC GAAGTTGCGC ATCGAGATGC GCGCCGAGAT CAAGCTCTTG
CATCAGCGCC TCGGCACGAC GATCGTCTAC GTGACGCACG ACCAGATCGA GGCGATGACG
CTCGGCGACC GGATCGCGGT GATGAAGGAC GGTGTCGTTC AGCAGTTCGG CGCGCCGCAG
GACATCTACG ATTCGCCGTC GAACCTGTTC GTCGCCGGCT TCATCGGCGC GCCGCCGATG
AACTTCATCA ACGGCAAGCT CGTCGAGCAG GGCAGCGGCG TGGGCATCGA GCTCGATACG
GGCGCGATGC GCGGCGTGCT GAACCTGCCG TTCGACGCGA AGCGGATGAA CGGCCACGTC
GGCCGCGACG TGATCCTCGG CCTGCGGCCG GAGCGGATCA CCGATGCGCG TAGCGCGCAC
AACGGCGAGG GCGCGCGCCT GCAGCCCGTC GACGTGACGG TCGACGTGAC CGAGCCGACG
GGCCCCGACA CGCACGTGTT CGCCCAGGTC AACGGCAAGC GGATCGTGAG CCGCGTGCAT
CCGGCCGCGA ACCCGCAGCC GCAGCAGAAG CTGTCGCTGT TGTTCGACGT ATCGAAGGCG
GTGCTGTTCG ATCCGTCGAC GGAGGCGCGG ATCGCGTGA
 
Protein sequence
MASLSIRDVY KTYPNGVPVL KGVDIDIEDG QFLILVGGSG CGKSTLLNMI AGLETVTKGE 
IRIGDKVVND LSPKDRDIAM VFQSYALYPS MTVRENISFG LNIRKVPKNE QKQIVDRVAA
MLQIEHLLER KPGQLSGGQR QRVAMGRALA RDPALFLFDE PLSNLDAKLR IEMRAEIKLL
HQRLGTTIVY VTHDQIEAMT LGDRIAVMKD GVVQQFGAPQ DIYDSPSNLF VAGFIGAPPM
NFINGKLVEQ GSGVGIELDT GAMRGVLNLP FDAKRMNGHV GRDVILGLRP ERITDARSAH
NGEGARLQPV DVTVDVTEPT GPDTHVFAQV NGKRIVSRVH PAANPQPQQK LSLLFDVSKA
VLFDPSTEAR IA