Gene BURPS668_1907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1907 
Symbol 
ID4884221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1868955 
End bp1869902 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content68% 
IMG OID640127835 
Productcarbohydrate ABC transporter periplasmic sugar-binding protein 
Protein accessionYP_001058942 
Protein GI126440180 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0516898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCACA CCGCCTTGTC GACGCCGGCT TCCGGCCGCC GGGCCGCCCG CGCGCTGCGC 
GCCGCCGCGC TTTCGCTCGC GCTCGGCGCG GCGAGCGCCG CGCACGCGGC GCCGCTGAAG
ATCGGCATGA CGTTCCAGGA ACTGAACAAC CCGTACTTCG TGACGATGCA AAAGGCGCTC
GACGAGGCGG CCGCGTCGAT CGGCGCGCAG GTGATCGTCA CCGACGCGCA TCACGACGTG
AGCAAGCAGG TGAGCGACGT CGAGGACATG CTGCAGAAGA AGATCGACAT CCTGCTCGTG
AACCCGACCG ATTCGACGGG CATCCAGTCG GCCGTCGTGT CGGCGAAGAA GGCGGGCGCC
GTCGTCGTCG CGGTGGACGC GAACGCGAAC GGCCCGGTCG ACGCGTTCGT CGGCTCGAAG
AATTTCGACG CGGGCGCGAT GTCGTGCGAC TACCTCGCGA AGGCGATCGG CGGCGGCGGC
GAAGTCGCGA TCCTCGACGG CATCCCGGTC GTGCCGATTC TCGAGCGCGT GCGCGGCTGC
CGCGCGGCGC TCGCGAAATT CCCGAACGTG AGGATCGTCG ACGTGCAGAA CGGCAGGCAG
GAGCGCGCGA GCGCGCTCGC CGTCACCGAG AACATGATCC AGGCGCACCC GTTGCTCAAG
GGCGTCTTCA GCGTCAACGA CGGCGGCTCG ATGGGCGCGC TGTCCGCGAT CGAGGCGTCG
GGCCGCGACA TCAAGCTCAC GAGCGTCGAC GGCGCGCCGG AGGCGATCGC GGCGATGCAG
AAGCCGAACT CGAAGTTCAT CGAGACGTCC GCGCAGTTCC CGCGCGACCA GATTCGCCTC
GCGATCGGCA TCGGGCTCGC GAAGAAGTGG GGCGCCAATG TGCCGAAGGC GATTCCGGTC
GACGTGAAGC GGATCGACAA GGGCAACGCG AAGACGTTCA GTTGGTGA
 
Protein sequence
MMHTALSTPA SGRRAARALR AAALSLALGA ASAAHAAPLK IGMTFQELNN PYFVTMQKAL 
DEAAASIGAQ VIVTDAHHDV SKQVSDVEDM LQKKIDILLV NPTDSTGIQS AVVSAKKAGA
VVVAVDANAN GPVDAFVGSK NFDAGAMSCD YLAKAIGGGG EVAILDGIPV VPILERVRGC
RAALAKFPNV RIVDVQNGRQ ERASALAVTE NMIQAHPLLK GVFSVNDGGS MGALSAIEAS
GRDIKLTSVD GAPEAIAAMQ KPNSKFIETS AQFPRDQIRL AIGIGLAKKW GANVPKAIPV
DVKRIDKGNA KTFSW