Gene BURPS668_A2070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2070 
Symbol 
ID4887998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2008056 
End bp2009474 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content68% 
IMG OID640132008 
Productmajor facilitator transporter 
Protein accessionYP_001063065 
Protein GI126444242 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.376866 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTCAA CGCAAAGCGC GCCGCGCGCA ACGGCGCCGG CAACGACGAT CGACGCCGGC 
GTCATCTCGG CGCGCCTCGA TCGCCTGCCG CCCACGCGCA GCGTCTGGAA ACTCGTCGCG
CTGCTGAGTC TCGGCTTCTT CTTCGAGCTC TACGATCTGC TGTACAGCGG CTACGTCGCG
CCCGGCCTCG TGAAGGGCGG CATCCTGAGC GCGACGACGC GCGGGCTGTT CGGCACGACG
GGCGTCGCGA GCTTCATCGC CGCGCTGTTC GCGGGGCTCT TCATCGGCAC GATCGCGTGC
GGCTTTCTCG CCGACCGCTT CGGCCGCCGC GCGGTGTTTA CGTGGTCGCT GCTGTGGTAC
ACGGCCGCGA ACCTCGTGAT GGCGTTCCAG GATACCGCCG GGGGCCTCAA TTTCTGGCGC
TTCGTCGTCG GGCTGGGGCT CGGCGTCGAA ATGGTGACGA TCGGCACATA TATCTCGGAG
TTGGTCCCGA AACAGATTCG CGGCCGCGCG TTCGCGTGCG AGCAGGCGGT CGGCTTCGTC
GCGGTGCCCG TGGTGGCGCT GCTCGCGTAT CTGCTGGTGC CGCATGCGCC GTTCGGCCTC
GACGGCTGGC GCTGGGTCGT GCTGATCGGC GCGCACGGCG CGATCTTCGT CTGGTGGATT
CGCCGCCAGT TGCCGGAAAG CCCGCGCTGG CTCGCGCAGC AGGGCCGGCT TGACGAAGCC
GAGCGCGTGC TCGCCGCGCT CGAGGCGAAG GTCGAGGCCG AGTACGGCCG GCCGCTGCCG
CCGCCCGCGC CCGCCGAGCC CATCGCGCCG CGCGGCCGGT TCGCCGACAT GTGGGTGCCG
CCGTACCGCA GGCGCACGGT GATGATGACG ATCTTCAACG TGTTTCAGAC GGTGGGCTTC
TACGGCTTCG CGAACTGGGT GCCGACGCTG CTGATCAAGC AGGGGATCAC CGTCACGACG
AGCCTCATGT ATTCGAGCGT GATCGCGCTC GCCGCGCCGA TCGGGCCGTT GATCGGCCTT
GCGATCGCCG ACCGCTTCGA GCGCAAGACG GTGATCGTCG CGATGGCGGG CGCGGCGATG
ATCGCGGGGC TGCTGTTCAG CCACGCGTCG GCCGCGTGGC TGCTCGTCGC GCTCGGCGTA
TGCCTCACGC TCGCGAACAA CATCATGTCG TACAGCTTCC ATGCCTATCA GGCCGAGCTG
TTTCCGACCG CGATCCGCGC GCGCGCGGTC GGCTTCGTCT ATTCGTGGAG CCGCTTTTCG
GCGATCTTCA CGTCGTTCGC GATCGCGGCC GTGCTGAAGG GATTCGGCAC GCCCGGCGTG
TTCGTGTTCA TCGCGGGCGC GATGGCGATC GTGATGGCGT CGATCGGGCT GATGGGGCCG
CGCACGAAAG GCGTCGCGCT CGAAGCGATA TCGCGTTGA
 
Protein sequence
MASTQSAPRA TAPATTIDAG VISARLDRLP PTRSVWKLVA LLSLGFFFEL YDLLYSGYVA 
PGLVKGGILS ATTRGLFGTT GVASFIAALF AGLFIGTIAC GFLADRFGRR AVFTWSLLWY
TAANLVMAFQ DTAGGLNFWR FVVGLGLGVE MVTIGTYISE LVPKQIRGRA FACEQAVGFV
AVPVVALLAY LLVPHAPFGL DGWRWVVLIG AHGAIFVWWI RRQLPESPRW LAQQGRLDEA
ERVLAALEAK VEAEYGRPLP PPAPAEPIAP RGRFADMWVP PYRRRTVMMT IFNVFQTVGF
YGFANWVPTL LIKQGITVTT SLMYSSVIAL AAPIGPLIGL AIADRFERKT VIVAMAGAAM
IAGLLFSHAS AAWLLVALGV CLTLANNIMS YSFHAYQAEL FPTAIRARAV GFVYSWSRFS
AIFTSFAIAA VLKGFGTPGV FVFIAGAMAI VMASIGLMGP RTKGVALEAI SR