Gene BURPS1106A_0352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0352 
Symbol 
ID4901396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp324489 
End bp325529 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content71% 
IMG OID640133582 
Productsodium/bile acid symporter family protein 
Protein accessionYP_001064635 
Protein GI126452084 
COG category[R] General function prediction only 
COG ID[COG0385] Predicted Na+-dependent transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.446247 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGTT CCCGCTTCGT TCCCGACAAC TTCACGCTCG CGCTCGTCGG CACCGTCGTG 
CTCGCGAGCT TCCTGCCGTG CCGCGGCGAG GCCGCGCACG CGTTCAACTG GGCGACCGAC
ATCGCGGTCG GCCTGCTGTT CTTCCTGCAC GGCGCGAAGC TCTCGCGCGA AGCGATCGTC
GCGGGCGCGA CGCACTGGCG GCTGCATGCG CTCGTGCTGC TCAGCACGTT CGCGCTGTTC
CCGCTGCTCG GCCTGGCGCT CAAGCCCGTG CTCACGCCGC TCGTCACGCC CGCGCTGTAC
GCCGGCGTGC TGTTTCTCTG CACGCTGCCG TCGACGGTGC AGTCGTCGAT CGCGTTCACG
TCGATCGCCA AGGGCAACGT GCCGGCGGCC GTCTGCTCGG CGTCCGCGTC GAGCCTGCTC
GGCATCTTCG TCACGCCGGC GCTCGTCGGC GTGATGGTGT CGACGCAGGG CACGGGCGCG
ACGGCGTCGC CGTGGAGCAC GATCGGCGCG ATCGTGATGC AACTGCTCGT GCCGTTCGTC
GCCGGCCAGT TGCTGCGGCC GGTGATCGGC CGCTGGATCG AGCGCAATCG CGGCGTGCTG
CGCTTCGTCG ATCAGGGCTC GATCCTGCTC GTCGTCTACG TCGCGTTCAG CGAAGCGGTG
AACGAGGGGC TCTGGCACCA GATCCCGCCG ACGGCGCTCG CGGGCCTCGC CGTCGTCAAC
GTCGTGTTGC TCGCGATCGC GCTCGCGGTC ACGACGGTCG TCAGCAAGCG GCTCGGTTTC
AACCGCGCGG ACCAGATCAC GATCATCTTC TGCGGCTCGA AGAAGAGCCT CGCGGCCGGC
GTGCCGATGG CGAAGGTAAT CTTCGCCGCG CACGCGGTGG GCGCGGTCGT GCTGCCGCTG
ATGCTGTTCC ATCAGATTCA GCTGATGACC TGCGCGGCGC TCGCGCAGCG CTGGGGCGCG
CGCGACACGA GCCGCGAACG GCGGGCGGAC GCGCCCGGCG CCGGGGCGCT CGGTTCGGGC
GCGAGCGCGG CGAAGCGCTG A
 
Protein sequence
MARSRFVPDN FTLALVGTVV LASFLPCRGE AAHAFNWATD IAVGLLFFLH GAKLSREAIV 
AGATHWRLHA LVLLSTFALF PLLGLALKPV LTPLVTPALY AGVLFLCTLP STVQSSIAFT
SIAKGNVPAA VCSASASSLL GIFVTPALVG VMVSTQGTGA TASPWSTIGA IVMQLLVPFV
AGQLLRPVIG RWIERNRGVL RFVDQGSILL VVYVAFSEAV NEGLWHQIPP TALAGLAVVN
VVLLAIALAV TTVVSKRLGF NRADQITIIF CGSKKSLAAG VPMAKVIFAA HAVGAVVLPL
MLFHQIQLMT CAALAQRWGA RDTSRERRAD APGAGALGSG ASAAKR