Gene BURPS1106A_3104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3104 
Symbol 
ID4902320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3025257 
End bp3026885 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content71% 
IMG OID640136330 
Productputative amino acid ABC transporter, permease protein 
Protein accessionYP_001067342 
Protein GI226830772 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0559] Branched-chain amino acid ABC-type transport system, permease components 
TIGRFAM ID[TIGR03409] urea ABC transporter, permease protein UrtB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCATGC CTTTGAGCCG CGCCGCTCGT GCGCTCGCGG CGCTCGCCGC CTGCGCGGCG 
TTTTCGTTCG CCGCGCCGCG TGCGGCGCTC GCCGTCACGG CGGCCGACGT CGCCGCGCTC
GCCGGCGACG ACTTCGATGC GAAGCGCGCC GCGATCGACC GTCTCGCCGC CGGGCACGAC
GCCGCCGCGG CCGCGCTGCT GAACGCGCTC GCGAACGGCG ACGCGCTCGC GACCGACGCC
GGGCGCATCC TGATTCAGCA TGGCGACGCC GCGCGCGACG CACTGACGAA CGCGCCCGCG
CAGGCGGGCG ATGCGCAGCC GGTGATGCTC AACAACCTGC TGCGCGCGAA GATCGCGAAC
GCACTGTCGG GGCTCGATCT CGCGTCGCCC GACATCGACA CGCGCCGCCG CGCGATCGAT
GCGCTGCTCA AGCGCCCCGA TGCCGCGCTC AAGCCGATGA TCGACGCCGC GCGTGCGAAG
GAAACCGATC CCGTGCTCAA GCGCCGCCTC GACGCGCTAT GGGCGATCGC CGCGCTGCGC
GACGCCGATC CCGCGAAGCG CCTCGAAGCG GTGCGGCTCG TCGCCGCGCG AAGCGATCTC
GACATGATCG AGCAACTGCG CCCGCTCGTC GCGAAGCGGC CCGACGGCGG CGACGCGGAA
CCCGATGCGC GCGTGCGCGA GGCCGCGCAG CAGGGGCTCG GCGCGCTCTA TGCGATCCAG
CGCCGCGGCG AAATCGCGGG CACGCTGTTC GCGGGGCTCT CGCTCGGCAG CGTGCTGCTG
CTCGCCGCGC TCGGCCTCGC GATCACGTAC GGCCTCATCG GCGTCATCAA CATGGCGCAC
GGCGAGTTCC TGATGATCGG CGCGTATGCG ACCTACGTCG TGCAGACGCT CGTGCAGCGC
TATCTGCCCG GCGCGTTCGA CTGGTATCCG CTCGCCGCGA TTCCCGTGTC GTTCGCCGCG
GCCGCCGCGC TCGGCATCGT GCTCGAGCGC ACGGTGCTCA GGCACCTGTA TGGCCGCCCG
CTCGAGACGC TGCTCGCGAC GTTCGGCGTG AGCCTCATCC TGATCCAGGC GACGCGGATG
ATCTTCGGCG CACAGAACGT GCAGGTCGTC AATCCGTCGT GGATGAGCGG CGGCGTGACC
GTGATGCAGA ACCTGATCCT GCCGTACAAC CGCCTCGCGA TCCTCGCGTT CGCGCTCGTC
GTCGTCGGCA TCGCGTGGGC CGTGCTGACG AAAACGCGCC TCGGCCTGTT CGTGCGCGCG
GTCACGCAGA ACCGCCGGAT GGCCGCGTGC GTCGGCGTGA AGACGGCGCG CGTCGATTCG
TATGCGTTCG CGTTCGGCGC GGGCATCGCG GGCCTCGGCG GCTGCGCGCT GTCGCAGATC
GGCAACGTCG GCCCGGATCT CGGCCAGAGC TACATCGTCG ATTCGTTCAT GGCGGTCGTG
CTGGGCGGCG TCGGCCAGAT CGCGGGCACG GTGCTCGGGG GCTTCGGCCT CGGGCTCGTC
AGCAAGGCGA TCGAGCCGTT CTGGGGCGCG GTGCTCGCGA AGATCGCCGT GCTCGTGATG
ATCGTGCTGT TCATCCAGAA ACGCCCGCAA GGGATGTTCG CCCTGAAGGG CCGCAGCGCG
GAGGCATGA
 
Protein sequence
MPMPLSRAAR ALAALAACAA FSFAAPRAAL AVTAADVAAL AGDDFDAKRA AIDRLAAGHD 
AAAAALLNAL ANGDALATDA GRILIQHGDA ARDALTNAPA QAGDAQPVML NNLLRAKIAN
ALSGLDLASP DIDTRRRAID ALLKRPDAAL KPMIDAARAK ETDPVLKRRL DALWAIAALR
DADPAKRLEA VRLVAARSDL DMIEQLRPLV AKRPDGGDAE PDARVREAAQ QGLGALYAIQ
RRGEIAGTLF AGLSLGSVLL LAALGLAITY GLIGVINMAH GEFLMIGAYA TYVVQTLVQR
YLPGAFDWYP LAAIPVSFAA AAALGIVLER TVLRHLYGRP LETLLATFGV SLILIQATRM
IFGAQNVQVV NPSWMSGGVT VMQNLILPYN RLAILAFALV VVGIAWAVLT KTRLGLFVRA
VTQNRRMAAC VGVKTARVDS YAFAFGAGIA GLGGCALSQI GNVGPDLGQS YIVDSFMAVV
LGGVGQIAGT VLGGFGLGLV SKAIEPFWGA VLAKIAVLVM IVLFIQKRPQ GMFALKGRSA
EA