Gene BURPS668_A0630 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0630 
Symbol 
ID4886655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp589895 
End bp591910 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content68% 
IMG OID640130570 
Productsolute/sodium symporter (SSS) family protein 
Protein accessionYP_001061630 
Protein GI126445184 
COG category[R] General function prediction only 
COG ID[COG4147] Predicted symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR03648] probable sodium:solute symporter, VC_2705 subfamily 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.449761 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGACGA ATCGGCTGGT TCGCGCGTAC GCGCTGTACA CGATCGGCTT CGCCGCGTTC 
GTGCTGCTGC TGTGGCGGAT CGAGCGCGCG ACCGGGCCGG GCGTGTGGAT CGGCTATGTG
TTCCTGTTCG TGCCGATCGC GGTCTATGCG GTGATCGGGC TGCTGTCGCG GACATCGGAC
CTCGTCGAAT ACTACGTGGC CGGGCGGCGC GTGCCGTCCG CGTTCAACGG CATGGCGACC
GCCGCCGACT GGCTGTCCGC GGCGTCGTTC ATCGGCCTCG CGGGCTCGCT GTACGCGACC
GGTTACGACG CGCTCGCGTA CCTGATGGGC TGGACGGGCG GTTTCTGCCT CGTTGCGTTC
CTGCTCGCGC CATACGTGCG CAAGCTCGCG CGCTACACGA TTCCCGATTT TCTGGGCACG
CGCTTTTCGA GCACGCTCGT CAGGGCGCTC GCGGCGATCG CCGCGATCCT GTGCTCGTTC
GTCTACCTGG TCGCGCAGAT ACAGGGCATC GGCCTCATCG CGACGCGCTT CATCGGCGTC
GATTTCTCGA TCGGCATCTT CTGCGGCCTC GCGGGGATTC TCGTGTGCTC GTTTCTGGGC
GGGATGCGGG CGGTCACGTG GACGCAGGTC GCGCAGTACA TCATCCTGAT CATTGCGTTC
CTGCTGCCGG TTTCGCTGAT CGCGATGAAG AACGGCCTCG GGCCCGTGCC GCAATTCAAT
TACGGCCGGC TGATGTCGCG GGTCGAAACG CTCGAGGGCG AGATGCGCGA CGCGCCGCAG
GAGCGGCAGG TGCGCGAGAC GTATCGTCGG CAGGCGGGCG CGATCCAGGC GAGGCTCGAC
CGGCTGCCGG CGTCTTACGA CGAGGCGCGC GCGAAGCTGG TCGATCAGGT CGCCGAGCTG
CGCCGGCACA ACGGGCCGCT GCGCGAGATC AACCAGCGCG AACGCGCGCT TGCCGAGTTT
CCGCGCGATC CGGCGGCGGC GCGGGTGGTG TGGGAGCAGG CGCGCGACGA ACTGCTCGCG
CGCGCGGCGG CCCCGGTGCC GATGCACGAG CCGTTTCCGG CCGCGAGCGA CGACGATCGC
CGCCCGCGCG GGCGCAACTT CCTCGCGCTG CTGCTGTGCC TGTCGCTCGG CACGGCGAGC
CTGCCGCACA TCCTGACGCG CTACAACACG ACGACCTCGG TGGCGGCTGC GCGGCGCTCG
GTGGGCTGGA CGCTGTTTTT CATCGCGCTG TTCTATTTGA CGGTGCCGGT GCTCGCGGTG
CTGATCAAGT ACGAGATCCT GACGAATCTG GTGGGGCGGC CGTTCGCCGA TCTGCCCGCG
TGGATCACGC AATGGCACCG TTTCGAGCCG GGGCTCATCG GCGTTGCCGA CCTGTTGCGC
GACGGCATTG TCCATTGGTC GGAGATCCAG ATGCAGCCCG ATATCGTCGT GCTCGCGGCG
CCGGAGATCG CGGGGCTGCC GTATGTGGTG TCGGGGCTGA TCGCGGCGGG CGCGCTTGCC
GCGGCGCTGT CGACGGCGGA CGGGCTGCTG CTGACGATCG CGAACGCGCT GTCGCACGAC
GTCTACTACC ACATGGTTGC GCCGGACGCG TCGAGCCAGC GGCGCGTGAC GATCTCGAAG
GTGCTGTTGC TCGGCGTCGC GCTGTTTGCG TCGTATGTTG CGTCGCTGAA TACGGGGAAG
ATTCTGTTTC TTGTCGGGGC GGCGTTCTCG CTCGCGGCGT CGAGTTTCTT TCCGGTGCTC
GTGCTGGGCG TGTTCTGGAA GCGCACGACG ACGCGCGGCG CGGTGGCGGG GATGATGACG
GGGCTTGGCG TGTGCGTGTA CTACATCGTG TCGACGTATC CGTTCTTCAC GCAGATCACG
GGCTTCGCGG GGCCGAGCTG GCTCGGCATC GAGCCGATCA GCTCGGGCGT GTTCGGCGTG
CCGGCCGGGT TTGCGACGGC GATCGTCGTG AGCCTGCTGG ATCGGCGGCC GGATGCGTAC
ACGAACGCGC TTGTCGACTA TATCCGGCAC CCGTGA
 
Protein sequence
MLTNRLVRAY ALYTIGFAAF VLLLWRIERA TGPGVWIGYV FLFVPIAVYA VIGLLSRTSD 
LVEYYVAGRR VPSAFNGMAT AADWLSAASF IGLAGSLYAT GYDALAYLMG WTGGFCLVAF
LLAPYVRKLA RYTIPDFLGT RFSSTLVRAL AAIAAILCSF VYLVAQIQGI GLIATRFIGV
DFSIGIFCGL AGILVCSFLG GMRAVTWTQV AQYIILIIAF LLPVSLIAMK NGLGPVPQFN
YGRLMSRVET LEGEMRDAPQ ERQVRETYRR QAGAIQARLD RLPASYDEAR AKLVDQVAEL
RRHNGPLREI NQRERALAEF PRDPAAARVV WEQARDELLA RAAAPVPMHE PFPAASDDDR
RPRGRNFLAL LLCLSLGTAS LPHILTRYNT TTSVAAARRS VGWTLFFIAL FYLTVPVLAV
LIKYEILTNL VGRPFADLPA WITQWHRFEP GLIGVADLLR DGIVHWSEIQ MQPDIVVLAA
PEIAGLPYVV SGLIAAGALA AALSTADGLL LTIANALSHD VYYHMVAPDA SSQRRVTISK
VLLLGVALFA SYVASLNTGK ILFLVGAAFS LAASSFFPVL VLGVFWKRTT TRGAVAGMMT
GLGVCVYYIV STYPFFTQIT GFAGPSWLGI EPISSGVFGV PAGFATAIVV SLLDRRPDAY
TNALVDYIRH P