Gene BURPS668_1164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1164 
Symbol 
ID4882502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1141888 
End bp1143438 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content66% 
IMG OID640127092 
Productsolute/sodium symporter (SSS) family protein 
Protein accessionYP_001058213 
Protein GI126439622 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTGA CCGCGACCTT CGTCTTCGTG CTGTTCTTCG TCGGCGTGAC GATCATGGGT 
TTTCTCGCCG CGAACTGGCG GCGCGGCAAC CTCGCGCATC TCGACGAATG GGGGCTCGGC
GGCCGGCGCT TCGGCACGGT CGTCACGTGG TTCCTGCTCG GCGGCGATCT CTACACCGCG
TACACGTTCG TCGCCGTGCC GGCGCTCGTG TTCGGCGCGG GCGCGATGGG CTTCTTCGCG
CTGCCGTACA CGATCCTCAT CTATCCGTTC GCGTTCGTCG TATTCCCGAA GCTCTGGAGC
ATCGCGAAGC GTCACGGCTA CGTGACGGCC GCCGATTTCG TCAACGCGCG CTACGGCAGC
CGCTCGCTCG CGCTCGCCGT CGCGGTGACG GGCATCGTCG CGACGATGCC GTACATCGCG
CTGCAGCTCG TCGGCATCGA GGTGGTGATC GGCGGGCTCG GCTTCGACAC CAAGGGCTTC
ATCGGCGATC TGCCGCTCAT CATCGCGTTC GCGATCCTCG CCGCTTACAC GTACACGTCG
GGGCTGCGTG CGCCCGCGAT GATCGCGATC GTCAAGGACA TCCTGATCTA CATCACGATC
GCCGCGGCCG TGATCGTGAT TCCGGCGAAG CTGGGCGGCT TCGGGCACAT CTTCGGCGCG
GTGCCGCCCG CGAAGCTGCT GTTGAAAGCG CCCGACGCGG CGAGCCTGAA CGGCTTCAGC
GCGTACACGA CGCTCGCGAT CGGCTCGGCG CTCGCGCTGT TCCTGTATCC GCACTCGGTG
ACGGCGATCC TGTCGTCGTC GTCGGGCAAC ACGATCCGCC GCAACATGGC GATGCTGCCC
GCGTACTCGT TCGTGCTCGG CCTGCTGGCG CTGCTCGGCT ACATGGCGCT CGCATCGGGC
GTGAAGGACA TGCCGGAATA CGCGCCGTAC TTCAAGGCGT TCGGCCCGAA TTTCGCGGTG
CCGGCGTTGT TCCTGCATTT CTTCCCGTCG TGGTTCGTCG GCGTCGCGTT CGCCGCGATC
GGGATCGGCG CGCTCGTGCC GGCGGCGATC ATGTCGATCG CGGCCGCGAA CCTGTACACG
CGCAACATCC ATCGCGAGTT CGTCAACCGC AACATGACGC ACGATCAGGA AACGCACGTC
GCGAAGCTCG TGTCGCTGAT CGTGAAGGTC GGCGCGGTCG CGTTCATTCT CGGGCTGCCG
CTCACCTACG CGATCCAGCT GCAACTGCTC GGCGGGATCT GGATCATCCA GACGCTGCCC
GCGATCGTGC TCGGGCTCTA TACGCGCGTG CTCGACTATC GCGGGCTGCT CGCCGGCTGG
GCGGCGGGGC TCGTCTGCGG CACGTGGATG GCGATTTCGC TGAAGCTCGC GAGCTCGATC
TTCACGATCC ATCTGTTCGG CCATGCGATT CCGGGCTACG CGGCCGTCTG GGCGCTGGCC
GTGAATCTCG TCGTGTCGAT CGTGGCCAGC GTGCTGGTTC GCGCGTTCGG GATCGCGCAC
GCGGAAGATC GCACGCGGCC GGAGGATTAT CTCGACGTCG TCGAGAGTTG A
 
Protein sequence
MNLTATFVFV LFFVGVTIMG FLAANWRRGN LAHLDEWGLG GRRFGTVVTW FLLGGDLYTA 
YTFVAVPALV FGAGAMGFFA LPYTILIYPF AFVVFPKLWS IAKRHGYVTA ADFVNARYGS
RSLALAVAVT GIVATMPYIA LQLVGIEVVI GGLGFDTKGF IGDLPLIIAF AILAAYTYTS
GLRAPAMIAI VKDILIYITI AAAVIVIPAK LGGFGHIFGA VPPAKLLLKA PDAASLNGFS
AYTTLAIGSA LALFLYPHSV TAILSSSSGN TIRRNMAMLP AYSFVLGLLA LLGYMALASG
VKDMPEYAPY FKAFGPNFAV PALFLHFFPS WFVGVAFAAI GIGALVPAAI MSIAAANLYT
RNIHREFVNR NMTHDQETHV AKLVSLIVKV GAVAFILGLP LTYAIQLQLL GGIWIIQTLP
AIVLGLYTRV LDYRGLLAGW AAGLVCGTWM AISLKLASSI FTIHLFGHAI PGYAAVWALA
VNLVVSIVAS VLVRAFGIAH AEDRTRPEDY LDVVES