Gene BURPS1710b_A1472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A1472 
Symbol 
ID3692471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp1793033 
End bp1795183 
Gene Length2151 bp 
Protein Length716 aa 
Translation table11 
GC content71% 
IMG OID637731726 
Productputative permease 
Protein accessionYP_336629 
Protein GI76818324 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCGCAC GGGAGGCGCG TGATGGCGAT CGTCCGCGTG GAGAACGTCA GCAAGACCTA 
CCTGCTCGAC AGCGTGAAGG TGACCGGGCT CGCCGGCGTG TCGGTCGACA TCGAAGCGAA
CCGCTTCACC GTGCTGAGCG GCCCGTCCGG CAGCGGCAAG ACGACGCTGC TCAACATGAT
CGGCTGCATC GACCGGCCGG ACGCGGGGCG CGTCATTGTG GCCGGCCAGG ACACCGCGGC
GCTCTGCGAC GACGCGCTGT CGGATTTTCG CGCGCGCGAG GTCGGGCACG TGTTCCAGTC
GTTCAATCTG CTGCCCGTGC TGTCCGCGTA CGAGAACGTC GAGTATCCGC TGCTGATGGC
GCGCACGCCG GCGCGCGAGC GCGCGCGTCG CGTCGCGTAT CTGCTCGATG CGGTGGGGCT
TGCCGGCAAG GGCGCGCATC GGCCGAGCCA GCTGTCGGGC GGCCAGCGCC AGCGCGTCGC
GATCGCGCGC GCGCTCGCGG CGGGGCCGTC GCTCGTGCTC GCCGACGAGC CGACCGCGAA
CCTCGACAGC GTCACGGGCC GCGCGATCAT CGCGCTGATG CGCGAGATGC AGCGCGAGAG
CGGCGTGTCG TTCATCGTGT CGTCGCACGA TCCGCAAGTG CTGGAGGCGG CCGACGACGT
CGTGCAAATT CGCGACGGGC GCATCGTCGA GCGCCGCCGC ATCGCGCAGG AGGCGCAAGG
CTGATGCATA CGTTCATGTT GGCGCTGCGC AACCTGCAGC GGAACCGGCG GCGCTCGATC
ACGACGCTGC TCGCGATGGT CGTCGGCGTG TGCGCGGTGC TGCTGTTCGG CGGCTTCAGC
AAGGACATCA CGTTCGGATT GCAGACCGAT TTCGTGCGGC GCAGCGGCCA TTTGCAGATT
CAGCGTCACG GCTATTACCA GTACGGCAGC GGCAATCCGG TCGCCTACGG GATCGGCGGC
TATGCGCGGC TGATCGCGCA ACTGCGCGAC GATCCGGTGC TCGCGCCGAT GATCGCGGTG
ATCACCCCGA CGCTGCAGTT CGGCGGGATC GCCGGCAACT TCGAGGCGGG CGTGTCGCGC
ACGGTGCTCG CGCAGGGCGC GATCGCCGAC GAGCAGGACC GGATGCAGCA ATGGAACGAC
TACGGTTTTC CGTTGACGCC GAAGCCGTAT CCGCTCGTCG GCACCGCGCC GGATGCGGCG
ATCGTGGGCA ACGGCGTCGC GCGTGTGCTG CATCTGTGCG CGCCGCTGCG CGTGCCCGAT
TGCGGCGACG GCGACGTCGC GAGGCAGGCC GCGCCGCAAG CGGCGTCGGG CGCGTCGGGC
GCGGATGCGC CGGCCGACGT GCTCGCGCTC GCGGCGGCCG AGGCGCACGC GCAGGAAAGC
GGGCGCGGCG AGCGCGCGGG CGCGGCGCAC ATCGAGGTGC TCGCGGCGAA CGCGTACGGC
GCGCCGAACG TCGGTGGCTT CACGGTTGCG AAGGCCGAGC AGCAGGGCGT GAAGGAGCTC
GACGACGTTT ATCTCGCGAT GCACTTGCCG CGCGCGCAGC GGCTCGTCTA CGGCGGCGAC
GCGCCGCGCG TGACGGCGAT CGAGATCCAG CTCCGGCATA CCGCGCAGCT TCCCGCCGCG
CGTGCGCGGC TCGACCGGCT GTTCGGCGGC CGCTTCGAGG GCCAGCCGCT CGACGTGCTC
GATTTCGCCG CGCTGAACCC GTTCTACGAC CAGACGAACC GGATGTTCTC GATGATCTTC
GGCTTCGTGT TCGTGCTGAT CGGCGCGATC GTGCTGTTCG TCGTCAGCAA CACGATGAGC
ACGGCGATCC TCGAGCGAAC CGTGGAGATC GGCACGCTGC GCTCGATGGG CGTGCGCCGC
GGCGGCATCC AGGCGCTGTT CGTCTGCGAG GGCGCGCTGC TCGGCGTCGT CGGCGCGTCG
ATCGGCGTGC TCGTCGCGCT CGCGCTCGCG TTCGTGGTCA ATCACAGCGG GCTCGCGTGG
ACGCCGCCCG CGCGGATCGA CTCGGTCGCG CTGACGGTGC GCGTGTGGGG CGAATGGCGG
CTCATCGCGC TGACGTTCGT CGGGCTCGCG TTCGTCGCGG GATTCTCGGC GTGGCTGCCC
GCGCGCCACG CGGCGCGGCT GTCGATCGTC GACGCGCTGC GCTATGCGTG A
 
Protein sequence
MRAREARDGD RPRGERQQDL PARQREGDRA RRRVGRHRSE PLHRAERPVR QRQDDAAQHD 
RLHRPAGRGA RHCGRPGHRG ALRRRAVGFS RARGRARVPV VQSAARAVRV RERRVSAADG
AHAGARARAS RRVSARCGGA CRQGRASAEP AVGRPAPARR DRARARGGAV ARARRRADRE
PRQRHGPRDH RADARDAARE RRVVHRVVAR SASAGGGRRR RANSRRAHRR APPHRAGGAR
LMHTFMLALR NLQRNRRRSI TTLLAMVVGV CAVLLFGGFS KDITFGLQTD FVRRSGHLQI
QRHGYYQYGS GNPVAYGIGG YARLIAQLRD DPVLAPMIAV ITPTLQFGGI AGNFEAGVSR
TVLAQGAIAD EQDRMQQWND YGFPLTPKPY PLVGTAPDAA IVGNGVARVL HLCAPLRVPD
CGDGDVARQA APQAASGASG ADAPADVLAL AAAEAHAQES GRGERAGAAH IEVLAANAYG
APNVGGFTVA KAEQQGVKEL DDVYLAMHLP RAQRLVYGGD APRVTAIEIQ LRHTAQLPAA
RARLDRLFGG RFEGQPLDVL DFAALNPFYD QTNRMFSMIF GFVFVLIGAI VLFVVSNTMS
TAILERTVEI GTLRSMGVRR GGIQALFVCE GALLGVVGAS IGVLVALALA FVVNHSGLAW
TPPARIDSVA LTVRVWGEWR LIALTFVGLA FVAGFSAWLP ARHAARLSIV DALRYA