Gene BURPS1106A_1172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1172 
Symbol 
ID4900541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1152197 
End bp1153747 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content66% 
IMG OID640134402 
Productsolute/sodium symporter (SSS) family protein 
Protein accessionYP_001065451 
Protein GI126453342 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.422897 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTGA CCGCGACCTT CGTCTTCGTG CTGTTCTTCG TCGGCGTGAC GATCATGGGT 
TTTCTCGCCG CGAACTGGCG GCGCGGCAAC CTCGCGCATC TCGACGAATG GGGGCTCGGC
GGCCGGCGCT TCGGCACGGT CGTCACGTGG TTCCTGCTCG GCGGCGATCT CTACACCGCG
TACACGTTCG TCGCCGTGCC GGCGCTCGTG TTCGGCGCGG GCGCGATGGG CTTCTTCGCG
CTGCCGTACA CGATCCTCAT CTATCCGTTC GCGTTCGTCG TATTCCCGAA GCTCTGGAGC
ATCGCGAAGC GTCACGGCTA CGTGACGGCC GCCGATTTCG TCAGCGCGCG CTACGGCAGC
CGCTCGCTCG CGCTCGCCGT CGCGGTGACG GGCATCGTCG CGACGATGCC GTACATCGCG
CTGCAGCTCG TCGGCATCGA GGTGGTGATC GGCGGGCTCG GCTTCGACAC CAAGGGCTTC
ATCGGCGATC TGCCGCTCAT CATCGCGTTC GCGATCCTCG CCGCTTACAC GTACACGTCG
GGGCTGCGCG CGCCCGCGAT GATCGCGATC GTCAAGGACA TCCTGATCTA CATCACGATC
GCCGCGGCCG TGATCGTGAT TCCGGCGAAG CTGGGCGGCT TCGGGCACAT CTTCGGCGCG
GTGCCGCCCG CGAAGCTGCT GTTGAAAGCG CCCGACGCGG CGAGCCTGAA CGGCTTCAGC
GCGTACACGA CGCTCGCGAT CGGCTCGGCG CTCGCGCTGT TCCTGTATCC GCACTCGGTG
ACGGCGATCC TGTCGTCGTC GTCGGGCAAC ACGATCCGCC GCAACATGGC GATGCTGCCC
GCGTACTCGT TCGTGCTCGG CCTGCTGGCG CTGCTCGGCT ACATGGCGCT CGCATCGGGC
GTGAAGGACA TGCCGGAATA CGCGCCGTAC TTCAAGGCGT TCGGCCCGAA TTTCGCGGTG
CCGGCGTTGT TCCTGCATTT CTTCCCGTCG TGGTTCGTCG GCGTCGCGTT CGCCGCGATC
GGGATCGGCG CGCTCGTGCC GGCGGCGATC ATGTCGATCG CGGCCGCGAA CCTGTACACG
CGCAACATTC ATCGCGAGTT CGTCAACCGC AACATGACGC ACGATCAGGA AACGCACGTC
GCGAAGCTCG TGTCGCTGAT CGTGAAGGTC GGCGCGGTCG CGTTCATTCT CGGGCTGCCG
CTCACCTACG CGATCCAGCT GCAACTGCTC GGCGGGATCT GGATCATCCA GACGCTGCCC
GCGATCGTGC TCGGCCTCTA TACGCGCGTG CTCGACTATC GCGGGCTGCT CGCCGGCTGG
GCGGCGGGGC TCGTCTGCGG CACGTGGATG GCGATCTCGC TGAAGCTCGC GAGCTCGATC
TTCACGATCC ATCTGTTCGG CCATGCGATT CCGGGCTACG CGGCCGTTTG GGCGCTGGCC
GTGAATCTCG TCGTGTCGAT CGTGGTCAGC GTGCTGGTTC GCGCGTTCGG GATCGCGCAC
GCGGAAGATC GCACGCGGCC GGAGGATTAT CTCGACGTCG TCGAGAGTTG A
 
Protein sequence
MNLTATFVFV LFFVGVTIMG FLAANWRRGN LAHLDEWGLG GRRFGTVVTW FLLGGDLYTA 
YTFVAVPALV FGAGAMGFFA LPYTILIYPF AFVVFPKLWS IAKRHGYVTA ADFVSARYGS
RSLALAVAVT GIVATMPYIA LQLVGIEVVI GGLGFDTKGF IGDLPLIIAF AILAAYTYTS
GLRAPAMIAI VKDILIYITI AAAVIVIPAK LGGFGHIFGA VPPAKLLLKA PDAASLNGFS
AYTTLAIGSA LALFLYPHSV TAILSSSSGN TIRRNMAMLP AYSFVLGLLA LLGYMALASG
VKDMPEYAPY FKAFGPNFAV PALFLHFFPS WFVGVAFAAI GIGALVPAAI MSIAAANLYT
RNIHREFVNR NMTHDQETHV AKLVSLIVKV GAVAFILGLP LTYAIQLQLL GGIWIIQTLP
AIVLGLYTRV LDYRGLLAGW AAGLVCGTWM AISLKLASSI FTIHLFGHAI PGYAAVWALA
VNLVVSIVVS VLVRAFGIAH AEDRTRPEDY LDVVES