Gene BURPS1710b_A0214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0214 
Symbol 
ID3692175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp323769 
End bp325139 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content68% 
IMG OID637730468 
ProductL-arabinose ABC transporter, periplasmic L-arabinose-binding protein 
Protein accessionYP_335373 
Protein GI76819353 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.532172 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTTTTCC CGCGACGCCC CGCCGCCTTC GACCTTCGCC CTTCGACCTT CGACCTTCGA 
CCTTCGACCT TCGACCTTCG ACCTCAGACT TCAGACCTTC GACGTCCGAC GTCCGACGTC
GGACCTTCGA CCCCGACGCC CGACATCGGA CATCGGACAT CGGACATCGG ACATCGGACA
TCCGGCATCC AAACTCGACA TCCGCCCCAC CCCTCGCCTC CCCACCGCCC GATCCCATTC
TTTAGGAACC CCTCGCAATA TTTGGAAGCC CGCCGACGCG CCTCCTACAA TCAGCCGCAC
GACATGCATG CCGTTGCGCG GCTCGCAAGG CCGCATCCAT ATCGAAAACG GAGCACGAAG
GAGACGCTCA TGGGATTGCG CTGGCCCCAA GCCGCCCTCG TCTGCGCGAG CCTCGCCGCC
GGTTTGTCGG CGGCGGCGCC CGCGCATGCG CAAGGCGCGG CCCCGGTGAA GATCGGCTTC
GTCGTCAAGC AGCCCGACGA CCCGTGGTTT CAGGACGAAT GGCGCTTCGC CGAGCAGGCG
GCGAAGGACA AGCACTTCAC GCTCGTGAAG ATCGCCGCGC CGAGCGGCGA GAAGGTGTCG
ACCGCGCTCG ACAGCCTCGC CGCGCAAAAG GCGCAGGGTG TGATCATCTG CGCGCCCGAC
GTGAAGCTCG GCCCCGGCAT CGCCGCGAAG GCGAGGCGCT ACGGGATGAA GCTGATGTCG
GTCGACGATC AGCTCGTCGA CGGGCGCGGC GCGCCGCTCG CCGACGTGCC GCACATGGGC
ATTTCCGCAT ACCGGATCGG CCGGCAGGTC GGCGACGCGA TCGCCGCCGA GGCGAAGCGG
CGCGGCTGGA ATCCGGCCGA GGTCGGCGTG CTGCGCCTCG CGTACGACCA GTTGCCGACC
GCGCGCGAGC GCACGACGGG CGCGGTCGAT GCGCTGAAGG CCGCGGGCTT CGCGGCGGCG
AACGTCGTCG ACGCGCCGGA GATGACGGCC GATACCGAAG GCGCGTTCAA CGCCGCGAAC
ATCGCGTTCA CCAAGCATCG GAACTTCAAG CACTGGGTGG CGTTCGGATC GAATGACGAC
ACGACGGTCG GCGCGGTGCG CGCGGGCGAG GGGCGCGGCA TCGGGGCGGA CGACATGATC
GCGGTCGGCA TCAACGGCAG CCAGGTCGCG CTGAACGAAT TCGCGAAACC GAAGCCGACG
GGCTTTTTCG GCTCGATCCT GCTGAATCCG CGGCTGCACG GCTACGACAC GTCGGTCAAC
ATGTACGACT GGATCACGCA GAACCGCGCG CCGCCGCCGG TCGTGCTGAC GTCCGGCACG
CTGATCACGC GCGCGAACGA AAAGACGGCG CGCGCGCAGC TCGGGCTGTG A
 
Protein sequence
MLFPRRPAAF DLRPSTFDLR PSTFDLRPQT SDLRRPTSDV GPSTPTPDIG HRTSDIGHRT 
SGIQTRHPPH PSPPHRPIPF FRNPSQYLEA RRRASYNQPH DMHAVARLAR PHPYRKRSTK
ETLMGLRWPQ AALVCASLAA GLSAAAPAHA QGAAPVKIGF VVKQPDDPWF QDEWRFAEQA
AKDKHFTLVK IAAPSGEKVS TALDSLAAQK AQGVIICAPD VKLGPGIAAK ARRYGMKLMS
VDDQLVDGRG APLADVPHMG ISAYRIGRQV GDAIAAEAKR RGWNPAEVGV LRLAYDQLPT
ARERTTGAVD ALKAAGFAAA NVVDAPEMTA DTEGAFNAAN IAFTKHRNFK HWVAFGSNDD
TTVGAVRAGE GRGIGADDMI AVGINGSQVA LNEFAKPKPT GFFGSILLNP RLHGYDTSVN
MYDWITQNRA PPPVVLTSGT LITRANEKTA RAQLGL