Gene Bphy_4149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphy_4149 
Symbol 
ID6245677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phymatum STM815 
KingdomBacteria 
Replicon accessionNC_010623 
Strand
Start bp1139220 
End bp1140185 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content59% 
IMG OID642595909 
Productaliphatic sulfonate ABC transporter periplasmic ligand-binding protein 
Protein accessionYP_001860316 
Protein GI186472974 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0138672 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCAAC ATCGCATGTT CACACGCCGA ACCTTCCTGG CAGGAACCGG CGCACTCCTG 
GCCTCGACCG CGTTTTCTTC ATTCGCCGAT AGCCGTGCAA AAGAAATTCG CATCGGATAT
CAAAAGGCCG CAAGCACACT GGTGCTTTTG AAGGCACATG GAACGCTCGA AAAGCGCTTC
GCCGCTCAGG GCGTCAGTGT GAAATGGACG GAATTTCCTG CTGGTCCGCA GTTGCTTGAA
GGACTGAATG TCGGTTCGGT CGACTTCGGT TACGTTGGGG AGGCGCCTCC CGTCATCGCG
CAAGCCGCTG GCGCCAATTT CGTGTACACC GCGTATGAAA TTCCAACGCC GCAAGCCGAA
GGCATTCTTG TCCATCGCGA CGCACCGATT CAATCCGTTG CGGACCTGAA GGGGAAGCGC
GTAGCGTTTA ACAAGGGCTC CGACGTTCAT TGGTTTCTCG TCGCCGCGTT ACAGAAAGCC
GGCGTGAGCT ACCCCGATAT TCAGCCCGTT TTTCTGCCGC CCGCCGATGC GCGGGCGGCG
TTCGAGCGCG GGGCAATCGA TGCATGGGCC ATTTGGGATC CGTTCCTCGA AGCAGCAAAG
CGGCAATCGA ACGCGAGACT TTTGACCGAC GGTACGGGCA TCGTCAATCA CCACCAGTTC
TTTCTCAGCG CGCGCTCTTT CGCGCAGCAA AACCGGGGGC TGCTCGATGC CGTCGTTACC
GAAGTCGGGA AGGAAGGCGC GTGGGTTCGT GGACACTACG CAGAGGCGGC GGCACAGCTC
GCGCCGATTC AGGGGCTCGA CGCGAATGTC ATCGAAGCGG GCCTGCGACA CTATGCTCAT
GTCTACAAGC CGATCGATGC GGGTGTGCTG GCTGAACAGC AAAAGATCGC CGATGCGTTC
ACTGAGCTTC GCATCATTCC GACGAAGATC GTGACGAAGG AAGCGGTGCT CGACGCGAAG
GCTTGA
 
Protein sequence
MSQHRMFTRR TFLAGTGALL ASTAFSSFAD SRAKEIRIGY QKAASTLVLL KAHGTLEKRF 
AAQGVSVKWT EFPAGPQLLE GLNVGSVDFG YVGEAPPVIA QAAGANFVYT AYEIPTPQAE
GILVHRDAPI QSVADLKGKR VAFNKGSDVH WFLVAALQKA GVSYPDIQPV FLPPADARAA
FERGAIDAWA IWDPFLEAAK RQSNARLLTD GTGIVNHHQF FLSARSFAQQ NRGLLDAVVT
EVGKEGAWVR GHYAEAAAQL APIQGLDANV IEAGLRHYAH VYKPIDAGVL AEQQKIADAF
TELRIIPTKI VTKEAVLDAK A