Gene BURPS668_A3087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A3087 
Symbol 
ID4886867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2927396 
End bp2928652 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content69% 
IMG OID640133023 
Producthypothetical protein 
Protein accessionYP_001064078 
Protein GI126442905 
COG category[S] Function unknown 
COG ID[COG4655] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCGCG TCACTTCCTC GTCAGGCGGC GCCCGTCCAT GCGGGCGCCG CCGCCAGCGC 
GGCGTCGTGT CGATTCTCGT CGCGCTGATG CTCGCGGTGC TGATCGGCTT CGTCGGCCTC
GCGCTGGATC TCGGCAAGCT CTACGTGACG CGCAGCGAGC TGCAGAACAG CGCGGACGCG
TGCGCGCTCG CCGCGGCGCG GGATCTGACG GGTGCCATCA ATCTGTCCGT GCCGGAGGCG
GCCGGCATCA CCGCCGGCCA CCTCAACTAC GCGTTGTTCG AGCAGTTTCC GGTTCAGATG
CAGACGAACT CGAACGTCAC GTTCAGCGAT TCGCTGAGCA ATCCGTTTCA ACCGAAGAAC
GCGATCGCGT CGCCTTCGTC GATCAAGTAC GTGAAGTGCA CGACATCGCG CACGGGCATC
GTCAACTGGT TCATCCAGAC GCTCAACCTG GTGCCGGGCG TGACCGTGGC GAACGCGTCG
GTGTCCGCGA CGGCCGTGGC GACCGTCGGC GCCGCGCAGA CCACCTGCGC GATTCCGGTG
TTCATCTGCA AGGCCGGCAC GCAGACGAGC CCGCCCGTGG CCGGCGCGAC CTACAACATC
GGCGACTGGC TCTCCGCGAA GACGGGCTCG CCGCCGTCGT TCGGCGCGGG CAACTTCGGC
TGGTCGGCGC TCGACGGCTC GAACAGCGCG TCGTCGATCG CCAACGAGCT GACGGGCAAC
TACTGCGCGC TGCCCGCCAC CGGCTCGCAG GTCGGCACGC CCGGCGACAA GGCGGCGACG
ACCAACGCGT ACAACACGCG CTTCGGCATC TACGCGAATC CGTACAAGAA CCCGTCGTAC
GGCACGCCCG ACTTCACCGG CTTCGCCTAC GACGCGACCA CATGGCCCTC GCAGAGCAAC
GCGTATTCGG ACTTCGTCAG CAAGCGCCTG GCGTTCGCGA GCTATCAGGG CGACCTGATC
ACCGGCATCA ACACGGGCGG CTCGTACAAC CCGAGCTACT ACGCGGCGGG CGCCGACCGC
AGGCTCGCGC TCGCGCCCGA GGTGGACTGC TCGGTGCTGC TGAGCGGCCA CAGCGCGCCC
GTGCTCTCGT GGGATTGCGT GCTGATGCTC GACCCGATGG GCTCCGGCGG CAGCGCGACG
CCCGTGCATC TCGAGTACCG CGGCTCGTCG ACCGCGTCCG GCAGCCCGTG CGCGACGCAA
GGCACGCCGG GCAACGGCAG CTCGGTCGGC CCGCAGGTGC CCGTGCTGCT CCAATGA
 
Protein sequence
MSRVTSSSGG ARPCGRRRQR GVVSILVALM LAVLIGFVGL ALDLGKLYVT RSELQNSADA 
CALAAARDLT GAINLSVPEA AGITAGHLNY ALFEQFPVQM QTNSNVTFSD SLSNPFQPKN
AIASPSSIKY VKCTTSRTGI VNWFIQTLNL VPGVTVANAS VSATAVATVG AAQTTCAIPV
FICKAGTQTS PPVAGATYNI GDWLSAKTGS PPSFGAGNFG WSALDGSNSA SSIANELTGN
YCALPATGSQ VGTPGDKAAT TNAYNTRFGI YANPYKNPSY GTPDFTGFAY DATTWPSQSN
AYSDFVSKRL AFASYQGDLI TGINTGGSYN PSYYAAGADR RLALAPEVDC SVLLSGHSAP
VLSWDCVLML DPMGSGGSAT PVHLEYRGSS TASGSPCATQ GTPGNGSSVG PQVPVLLQ