Gene BURPS668_1837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1837 
Symbol 
ID4884356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1801396 
End bp1802457 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content71% 
IMG OID640127765 
Productpermease 
Protein accessionYP_001058872 
Protein GI126439253 
COG category[R] General function prediction only 
COG ID[COG0701] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGCT CCCGTTCCTA TACACCGCAT CCGGCGCTCG GCCTCGCGAC GTTCGTCGTG 
CTGGCCGTCG CGGGCCTCTT CTACGTGAAG TGGTTTCCGT ATTATCACAA GGCGTTCGCC
GCGGCCGAAC ACCATTCGAT CGGCCAGTCG ATCCTGATGG GCGCCGCGGC GCACGCGCCG
CAGCCGTCGC TGCAGGCGGC GCTCGATTAC GCATGGGCGT ACGGCAAGGC GATCTGGCAG
GCGATGGTGC TGGGCCTGCT GCTCGGTTCG GCCGTGCAGG CGCTGCTGCC CGCGCACTGG
GTCGCGCGCG CGCTCGGCGG CACGGGCTTC GGCAGCGTCG CGGCGGGGGG GCTGCTGGCG
CTTCCCGGCA TGATGTGCAC GTGCTGCGCG GCGCCCGTCG TCGCGGGCCT GCGCGAGCGC
GACGCGTCGC CGGGCGGCGC GCTCGCGTTC TGGCTCGGCA ACACCGTGCT CAATCCCGCC
GCGCTCGTGT TCATGGGCTT CGTGCTCGGC TGGCACTGGA GCGCGCTGCG GCTCGTGCTC
GGCGTCGCGA TGGTGTTCGG CGTCGGTTAT CTGGTCAATC GCCTGGCCGG CGCGCAGCCG
CGCGTCGTCG ACGATGCGCT TCGCGCGAAG CTCGTCGCCG AGCAGGCGGC GGTCGGCAAC
GCGTTCGTGC GGTGGATGAA GATCTTCGCG CGGATGACCG TGCGCCTCGT GCCCGAATAC
CTGGTGCTCG TGCTGCTGCT CGGCGCGGCG CGCGCATGGC TGTTTCCGCA CATCGGGCCG
GACATCGGCA ACGGGGTCGG CTGGATCGTC GCGTTCGCGA TCGCCGGCAT GCTGTTCGTG
ATTCCGACCG CGGGCGAGGT GCCGATCATC CAGGCGATGC TCTCGCTCGG CATGGGCGTT
GGTCCGGCGG GTGCGCTGCT GATGACGCTG CCGCCCGTCA GCGTGCCGTC GCTCGCGATG
TTGGCGCGTT CGTTCAAGCC GGCGACGCTC GCGCTCGTCG CGGCGCTCGT CGTCGCGTTC
GGCGTGGTCG GCGGGCTGGC CGCCGTCGCG CTGGGGTTCT GA
 
Protein sequence
MSSSRSYTPH PALGLATFVV LAVAGLFYVK WFPYYHKAFA AAEHHSIGQS ILMGAAAHAP 
QPSLQAALDY AWAYGKAIWQ AMVLGLLLGS AVQALLPAHW VARALGGTGF GSVAAGGLLA
LPGMMCTCCA APVVAGLRER DASPGGALAF WLGNTVLNPA ALVFMGFVLG WHWSALRLVL
GVAMVFGVGY LVNRLAGAQP RVVDDALRAK LVAEQAAVGN AFVRWMKIFA RMTVRLVPEY
LVLVLLLGAA RAWLFPHIGP DIGNGVGWIV AFAIAGMLFV IPTAGEVPII QAMLSLGMGV
GPAGALLMTL PPVSVPSLAM LARSFKPATL ALVAALVVAF GVVGGLAAVA LGF