Gene BURPS668_A2642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2642 
Symbol 
ID4886692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2538068 
End bp2539126 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content64% 
IMG OID640132579 
Producthypothetical protein 
Protein accessionYP_001063635 
Protein GI126442642 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.00776691 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAGCG GGAACGATCA CCAAAAATTC TTCCACCTGT TGCTTCTCGT CGTCACCGTC 
GGTCTTTGCT GGATATTGAC GCCGTTCTTC GGCGCGATCT TCTGGGGGAC CATTCTCGCG
ATCCTGTTCC AGCCCGTGCA GCGCTGGCTC GCCGCACGCT TCGGCAAGCG CCGCAATCTC
GCCGCGCTCG TCACGCTGTC GCTCATCATC CTGATCGTGA TCCTGCCGCT TGCGTTCGTG
ACCGCGACAC TCGTGCAGGA GATCGCGTAC GCGTATCAGC AGATCAAGAC GATGCAGCCG
AACATGACGC AGTACTTCCA GGAGTTCATG CACGCGCTGC CGAGCTCCGT GCATCGCGTG
CTGCACAATT ACGGGCTCAC CGACATCGCC GGCATCCAGA AGAAGCTGAC CGACGGCGCG
GCCGCGATCA GCCAGTTCGT GGCCGCGCAG GCGCTCAGCA TCGGGCAGAA CACGTTCCAG
TTCGTCGTGA GCTTCGGCGT GATGCTGTAC CTCGTGTTCT TCCTGTTGCG CGACGGCGGC
GAGATCGGCC GCCGCGTGCG GCGCGCGCTG CCGCTCGACG AAGAGCACAA GCAGCATCTG
CTGACGAAGT TCACGACGGT CGTGCGCGCG ACCGTCAAGG GCAACATCGC GGTCGCGGCC
GTGCAGGGCG CGCTCGGAGG CCTGATCTTC TGGATTCTCG GGATCGAGGG CGTGATTCTG
TGGGGCGCGC TGATGGCGTT CCTGTCGCTG CTGCCCGCGA TCGGCGCGGG GCTCGTATGG
GTGCCGGCCG CCGGCTATTT CGCGGTGACC GGGCAAATCT GGAAATGCGT GATTCTCGTC
GCGTTCTGCG TGGGCGTGAT CGGGCTCGTC GATAACCTGC TGCGGCCGAT CCTCGTCGGC
AAGGACACGA AGATGCCCGA TTGGGTCGTG CTGATCTCGA CGCTCGGCGG CATGGCGCTG
TTCGGCATCA ACGGCTTCGT GATCGGCCCG CTCGTCGCCG CGCTGTTCAT GGCGAGCTGG
GACATCTTCG CGCGCACCGA GCAGACCGAC TGGGAATGA
 
Protein sequence
MDSGNDHQKF FHLLLLVVTV GLCWILTPFF GAIFWGTILA ILFQPVQRWL AARFGKRRNL 
AALVTLSLII LIVILPLAFV TATLVQEIAY AYQQIKTMQP NMTQYFQEFM HALPSSVHRV
LHNYGLTDIA GIQKKLTDGA AAISQFVAAQ ALSIGQNTFQ FVVSFGVMLY LVFFLLRDGG
EIGRRVRRAL PLDEEHKQHL LTKFTTVVRA TVKGNIAVAA VQGALGGLIF WILGIEGVIL
WGALMAFLSL LPAIGAGLVW VPAAGYFAVT GQIWKCVILV AFCVGVIGLV DNLLRPILVG
KDTKMPDWVV LISTLGGMAL FGINGFVIGP LVAALFMASW DIFARTEQTD WE