Gene BURPS668_3957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3957 
Symbol 
ID4883662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3858866 
End bp3860005 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content65% 
IMG OID640129885 
Productputative periplasmic substrate-binding protein 
Protein accessionYP_001060950 
Protein GI126441874 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.877185 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACACA CGATGAAAAA GCTGGCAGGC GCGACGTTCG TCGCGGTCAT GTCGCTCGCG 
GGGACGGCGC ACGCGGATGA CGTCAAGATC GGCTTTGCTG CGCCGATGAC GGGCGCGCAG
GCGCACTACG GCAAGGATAT GCAGAACGGG ATCGTGCTCG CGATCGAGGA CATCAACGCG
ACGAAGCCGA CGATCGGCGG CAAGCCGGTG AAGTTCGTGC TCGACACGCA GGACGATCAG
GCCGACCCGC GCACCGGCAC GACGGTCGCG CAGAAGCTCG TCGACGACGG CATCAAGGGC
ATGCTCGGCC ACTTCAACTC GGGCACGACG ATTCCGGCTT CGCGCATCTA CGCGAACGCG
GGCATCCCGC AGATCGCGAT GGCGACGGCG CCCGAGTACA CGACGCAGGG CTACAAGACG
ACCTTCCGGA TGATGACGTC CGACACGCAG CAGGGCTCGG TCGCGGGCAC GTTCGCGGTG
AAGGATCTCG GCATGAAGAA GATCGCGATC GTCGACGATC GCACCGCTTA CGGCCAGGGC
CTTGCCGACC AGTTCGAGAA GGCGGCGAAG GCGGCGGGCG CGACGATCGT CGATCGTGAA
TTCACGAACG ACAAGGCTGT CGACTTCAAG GCGATCCTGA CGAAGCTCAA GGCGACGAAG
CCGGACCTCG TCTACTACGG CGGCGCGGAT TCGCAGGCCG CGCCGATGGC CAAGCAGATG
AAGTCGCTCG GCGTCACGGC GCCGCTGATG GGCGGCGAGA TGGTGCACAC GCCGACCTTC
CTGAAGATCG CGGGCGACGC GGCCGAAGGC TCGATCGCTT CGCTCGCCGG CCTGCCGCTC
GCCGAAATGC CCGGCGGCAA GGCGTACGCG GACAAGTACA AGAAGCGCTT CGGCGAAGAC
GTGCAGACGT ACTCGCCGTA TGCGTACGAC GGCGCGATGG CGATGTTCAG CGCGATGAAG
AAGGCAAACT CGACGGACCC GGCGAAGTAT CTGCCGCTGC TCGCGAAGAC CGACATGGCG
GGCGTGACGT CGACGCACAT CGCGTATGAC GCGAAGGGCG ACCTGAGGAA CGGCGGCATC
ACGATGTACA AGGTCGAGAA GGGCGAATGG AAGCCCCTGA AGAGCATCGG CGGCAAGTAA
 
Protein sequence
MQHTMKKLAG ATFVAVMSLA GTAHADDVKI GFAAPMTGAQ AHYGKDMQNG IVLAIEDINA 
TKPTIGGKPV KFVLDTQDDQ ADPRTGTTVA QKLVDDGIKG MLGHFNSGTT IPASRIYANA
GIPQIAMATA PEYTTQGYKT TFRMMTSDTQ QGSVAGTFAV KDLGMKKIAI VDDRTAYGQG
LADQFEKAAK AAGATIVDRE FTNDKAVDFK AILTKLKATK PDLVYYGGAD SQAAPMAKQM
KSLGVTAPLM GGEMVHTPTF LKIAGDAAEG SIASLAGLPL AEMPGGKAYA DKYKKRFGED
VQTYSPYAYD GAMAMFSAMK KANSTDPAKY LPLLAKTDMA GVTSTHIAYD AKGDLRNGGI
TMYKVEKGEW KPLKSIGGK