Gene BMA10247_A2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10247_A2052 
Symbol 
ID4891273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10247 
KingdomBacteria 
Replicon accessionNC_009079 
Strand
Start bp1988448 
End bp1990463 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content67% 
IMG OID640148313 
Productsolute/sodium symporter (SSS) family protein 
Protein accessionYP_001079224 
Protein GI126446259 
COG category[R] General function prediction only 
COG ID[COG4147] Predicted symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR03648] probable sodium:solute symporter, VC_2705 subfamily 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.483179 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGACGA ATCGGCTGGT TCGCGCGTAC GCGCTGTACA CGATCGGCTT CGCCGCGTTC 
GTGCTGCTGC TGTGGCGGAT CGAGCGCGCG ACCGGGCCGG GCGTGTGGAT CGGCTATGTG
TTCCTGTTCG TGCCGATCGC GGTCTATGCG GTGATCGGGC TGCTGTCGCG GACATCGGAC
CTCGTCGAAT ACTACGTGGC CGGGCGGCGC GTGCCGTCCG CGTTCAACGG CATGGCGACC
GCCGCCGACT GGCTGTCCGC GGCGTCGTTC ATCGGCCTCG CGGGCTCGCT GTACGCGACC
GGTTACGACG CGCTCGCGTA CCTGATGGGC TGGACGGGTG GTTTCTGCCT CGTTGCGTTC
CTGCTCGCGC CATACGTGCG CAAGCTCGCG CGCTACACGA TTCCCGATTT TCTGGGCACG
CGCTTTTCGA GCACGCTCGT CAGGGCGCTC GCGGCGATCG CCGCGATCCT GTGCTCGTTC
GTCTACCTGG TCGCGCAGAT ACAGGGCATC GGCCTCATCG CGACGCGCTT CATCGGCGTC
GATTTCTCGA TCGGCATCTT CTGCGGCCTC GCGGGGATTC TCGTGTGCTC GTTTCTGGGC
GGGATGCGGG CGGTCACGTG GACGCAGGTC GCGCAGTACA TCATCCTGAT CATTGCGTTC
CTGCTGCCGG TTTCGCTGAT CGCGATGAAG AACGGCCTCG GGCCCGTGCC GCAATTCAAT
TACGGCCATC TGATGTCGCG GGTCGAAACG CTCGAGGGCG AGATGCGCGA CGCGCCGCAG
GAGCGGCAGG TGCGCGAGAC GTATCGTCGG CAGGCGGGCG CGATCCAGGC GAGGCTCGAC
CGGCTGCCGG CGTCTTACGA CGAGGCGCGC GCGAAGCTGG TCGATCAGGT CGCCGAGCTG
CGCCGGCACA ACGGGCCGCT GCGCGAGATC AACCAGCGCG AACGCGCACT TGCCGAGTTT
CCGCGCGATC CGGCGGCGGC GCGGGTGGTG TGGGAGCAGG CGCGCGACGA ACTGCTCGCG
CGCGCGGCGG CCCCGGTGCC GATGCACGAG CCGTTTCCGG CCGCGAGCGG CGACGATCGC
CGCCCGCGCG GGCGCAACTT CCTCGCGCTG CTGCTGTGCC TGTCGCTCGG CACGGCGAGC
CTGCCGCACA TCCTGACGCG CTACAACACG ACGACCTCGG TGGCGGCTGC GCGGCGCTCG
GTGGGCTGGA CGCTGTTTTT CATCGCGCTG TTCTATTTGA CGGTGCCGGT GCTCGCGGTG
CTGATCAAGT ACGAGATCCT GACGAATCTG GTGGGGCGGC CGTTCGCCGA TCTGCCCGCG
TGGATCACGC AATGGCACCG TTTCGAGCCG GGGCTCATCG GCGTTACCGA CCTGTTGCGC
GACGGCATTG TCCATTGGTC GGAGATCCAG ATGCAGCCCG ATATCGTCGT GCTCGCGGCG
CCGGAGATTG CGGGGCTGCC GTATGTGGTG TCGGGGCTGA TCGCGGCGGG CGCGCTTGCC
GCGGCGCTGT CGACGGCGGA CGGGCTGCTG CTGACGATCG CGAACGCGCT GTCGCACGAC
GTCTACTACC ACATGGTTGC GCCGGACGCG TCGAGCCAGC GGCGCGTGAC GATCTCGAAG
GTGCTGTTGC TCGGCGTCGC GCTGTTTGCG TCGTATGTTG CGTCGCTGAA TACGGGGAAG
ATTCTGTTTC TTGTCGGGGC GGCGTTCTCG CTCGCGGCGT CGAGTTTCTT TCCGGTGCTC
GTGCTGGGCG TGTTCTGGAA GCGCACGACG ACGCGCGGCG CGGTGGCGGG GATGATGACG
GGGCTTGGCG TGTGCGTGTA CTACATCGTG TCGACGTATC CGTTCTTCAC GCAGATCACG
GGCTTCGCGG GGCCGGGCTG GCTCGGCATC GAGCCGATCA GCTTGGGCGT GTTCGGCGTG
CCGGCCGGGT TTGCGACGGC GATCGTCGTG AGCCTGCTGG ATCGGCGGCC GGATGCGTAC
ACGAACGCGC TTGTCGACTA TATCCGGCAC CCGTGA
 
Protein sequence
MLTNRLVRAY ALYTIGFAAF VLLLWRIERA TGPGVWIGYV FLFVPIAVYA VIGLLSRTSD 
LVEYYVAGRR VPSAFNGMAT AADWLSAASF IGLAGSLYAT GYDALAYLMG WTGGFCLVAF
LLAPYVRKLA RYTIPDFLGT RFSSTLVRAL AAIAAILCSF VYLVAQIQGI GLIATRFIGV
DFSIGIFCGL AGILVCSFLG GMRAVTWTQV AQYIILIIAF LLPVSLIAMK NGLGPVPQFN
YGHLMSRVET LEGEMRDAPQ ERQVRETYRR QAGAIQARLD RLPASYDEAR AKLVDQVAEL
RRHNGPLREI NQRERALAEF PRDPAAARVV WEQARDELLA RAAAPVPMHE PFPAASGDDR
RPRGRNFLAL LLCLSLGTAS LPHILTRYNT TTSVAAARRS VGWTLFFIAL FYLTVPVLAV
LIKYEILTNL VGRPFADLPA WITQWHRFEP GLIGVTDLLR DGIVHWSEIQ MQPDIVVLAA
PEIAGLPYVV SGLIAAGALA AALSTADGLL LTIANALSHD VYYHMVAPDA SSQRRVTISK
VLLLGVALFA SYVASLNTGK ILFLVGAAFS LAASSFFPVL VLGVFWKRTT TRGAVAGMMT
GLGVCVYYIV STYPFFTQIT GFAGPGWLGI EPISLGVFGV PAGFATAIVV SLLDRRPDAY
TNALVDYIRH P