Gene BURPS1106A_3744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3744 
Symbol 
ID4899281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3652636 
End bp3654147 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content68% 
IMG OID640136970 
Productsodium:alanine symporter family protein 
Protein accessionYP_001067974 
Protein GI126452834 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1115] Na+/alanine symporter 
TIGRFAM ID[TIGR00835] amino acid carrier protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.748793 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGGCT TTGTGCATGC GTTGATCGAC GGGATCAACG GCATTCTCTG GAATTACGTG 
CTGATTGCGC TGCTGCTCGG CGCGGGCGCG TGGTTCACGC TGCGCTTCAG GATGATTCAG
CTGCGCGCGC TGTTCCTCAG CATGCGCCTT GTCGGCAGCA AGGGCGAGCC GGGCAGCATC
TCGTCGTTCC AGGCGTTCGC GACCGGGCTC GCGAGCCGCG TCGGCACGGG CAACATCGCG
GGCGTCGCGG TTGCGATGAC GGTGGGCGGG CCGGGCGCGA TCTTCTGGAT GTGGATGACG
GCGCTCGTCG GGATGTCGTC CGCGTTCGTC GAGGCGACGC TCGCGCAAAT CTTCAAGGTG
TCGCATCATG ACGGCACGTA TCGCGGCGGC CCCGCGTATT ACATCCAGAT CGGGCTGCGC
TCGCGCGGTT TCGGCGTGCT GTTCTCGCTG TCGCTGATCC TCGCGTTCGG CTTCGTGTTC
AACGCGGTGC AGGCGAACGC GATCGCCGAG GCGTTCAACA CGTCGTTCGG CGTGAGCCGC
GCCGCCGTCG GGCTCGCACT CGTCGCGCTG ACGGCGCCGA TCATCTTCGG CGGGATTCGG
CGGATCGCGC ACGTCGCGCA GGTGATCGTG CCCGTGATGG CGATCGGCTA TCTCGCGCTC
GCGGTCTATG CGGTCGCGAC GCACGTCGCG CTCGTGCCGG ACATGATCGT GCTGATCGTG
AAGAGCGCGT TCGGCCTCGA GCAAGCGGCG GGCGGCCTGT CCGGCTACGC GGTGAGCCAG
GCGGTGTCGA TCGGCGTGAA ACGCGGCCTG TTCTCGAACG AGGCCGGCAT GGGCAGCGCG
CCGAACGCGG CCGCGACCGC GAGCACGCGG CATCCGGTCA CGCAGGGGCT GATCCAGATG
CTCGGCGTGT TCGTCGACAC GATCGTGATC TGCAGCGCGA CCGCGTTCGT GATCCTGTTG
TCGGGGCAGT ACGAGCCGGG CACGTCGATG GCCGGCGCGG CGCTCACGCA GCGCGCGATC
TCGAGCCACG TCGGCGACTG GGGAGGCATC TACATGGCGG TGGCGATCTT CTTCCTCGCG
TTTTCTTCGG TGATCGGCAA CTACGCGTAT GCGGAAGGCA ACGTCGGCTT CGTCACGAGC
CGGCGCGGCG CGTTGCTGGT GTTCCGGCTC GCGGTGCTCG GCATGGTGAT GTTCGGCAGC
GTCGGGCAAC TGCCGCTCGT GTGGGCAATG GCCGATACCA GCATGGGGCT GATGGCGCTC
ATCAATCTGG TCGCGATCCT GCTGCTTGGC AAGTACGCGC TCGCCGCATG GCGCGATTAT
CAGCGCCAGC GCGCGGCGGG CGTGGCCGAC CCGGTGTTCA CCCGCCAGAC GATTCCCGCG
CTCGCGAAGG TGCTCCCGGA CGACGTGTGG GGCGAGCACG GGCCGTTGCC GCAAGGCGAC
AAGCTCGCGG CGGGGCGCGC CTCGGGTGCC GACGCGGTAC GGGCGGCGGG GCCGGTCGGC
TCCGCGCGAT GA
 
Protein sequence
MEGFVHALID GINGILWNYV LIALLLGAGA WFTLRFRMIQ LRALFLSMRL VGSKGEPGSI 
SSFQAFATGL ASRVGTGNIA GVAVAMTVGG PGAIFWMWMT ALVGMSSAFV EATLAQIFKV
SHHDGTYRGG PAYYIQIGLR SRGFGVLFSL SLILAFGFVF NAVQANAIAE AFNTSFGVSR
AAVGLALVAL TAPIIFGGIR RIAHVAQVIV PVMAIGYLAL AVYAVATHVA LVPDMIVLIV
KSAFGLEQAA GGLSGYAVSQ AVSIGVKRGL FSNEAGMGSA PNAAATASTR HPVTQGLIQM
LGVFVDTIVI CSATAFVILL SGQYEPGTSM AGAALTQRAI SSHVGDWGGI YMAVAIFFLA
FSSVIGNYAY AEGNVGFVTS RRGALLVFRL AVLGMVMFGS VGQLPLVWAM ADTSMGLMAL
INLVAILLLG KYALAAWRDY QRQRAAGVAD PVFTRQTIPA LAKVLPDDVW GEHGPLPQGD
KLAAGRASGA DAVRAAGPVG SAR