Gene BURPS668_2349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2349 
Symbol 
ID4884195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2324337 
End bp2325752 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content69% 
IMG OID640128277 
Producthypothetical protein 
Protein accessionYP_001059381 
Protein GI126442036 
COG category[S] Function unknown 
COG ID[COG5267] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.475225 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGCGG CGATGCAGAC CCTGCTCGAC GCGGACGACG CGCGCTTTCT GCTGACGCGC 
ACCGGCTTTT CGCCGCCGCC GCGCGCGCTC GCGCGCTTCG TCGGCATGAC GCGCGCGCAG
GCGCTCGCCG AACTGCTCGA CGGCGCCCGC ACGCAAAGCG TGACGCCGCC GCCCGACTGG
GTGCGCGAGC CGCCGCCGTC GCGCGCGATG CGCGCCGCGT TCTCGCCGGA CGAGCGGCGC
GCGGAACAAC AGCTTCGCAA TCGCCGCTAC GACGAGCTGC GCGCATGGTG GGTGCGCGAG
ATGATCGTGA CGCCTTCGCC GCTCACCGAG CGCATGACGC TCTTCTGGCA CAACCACTTC
ACGTCCGGCC AGGACAAGGT ACCGTTTCCG CAAACGATTG CGGCTCAGCA TGCGCTGCTG
CGCGCCAACG CGCTCGGCAA TTTCGGCGCG ATGCTGCACG GCGTCGCGAA GGATCCGGCG
ATGCTGCAGT ATCTCGATGG CGCGAGCAAT CGCAAGGGCC GCCCGAACGA GAACTTCGCG
CGCGAGGCGA TGGAACTTTT CACGCTCGGC GAAGGCCACT ATACGCAGCG CGACGTGTCC
GAGGCCGCGC GCGCGTACAC CGGCTGGGGG CTCGATCCCG ATGCGCTCAC GTACGTGTTC
CGGCCGAACG TTCACGACGA CGGCGTGAAG ACCGTGCTCG GCGAAACCGG GCGCTTCGAT
GGCGACGCGG TGCTCGACAT CCTGCTCGGG CGCCCCGAGA CCGCGCGCTT CGTCGTCGCG
AAGCTGTGGC GCGAATTCGT CTCCGATGCG CCGGATGCGG GCGAGGTCGA GCGCATCGCC
GCGCGCTTGC GGCAGAGCGA TTACGACATC CGCGCGGCGC TCACGGAGCT GTTTTCGTCC
GACGCATTCT GGGCCGAGCG CAACCGCGGC GTGCTCGTCA AGTCGCCGGC GGAATTCGTG
GTCGGCACGG TGAGGCTGTT CGACGTCGAT TACGTCGATG CCGCGCCGTT CGCGAACACG
TTGCGCGCGC TGGGTCAGAA CCTGTTCTAT CCGCCGAACG TGAAGGGCTG GCCGGGCGGC
GTGAGCTGGA TCAACAGCGC GACGCTGCTT GCGCGCAAGC AGTTCGTCGA GCAGATGATG
CGCGCGACCG AGGCGCCCGG CATGCGTGCG GCGCCCGTTT CCCGCGACAT GGCGGGCCAG
CCGGCGCCGA CGCGGCGCGG CGCGATGCGC TTCGATCTCG ACGCGTGGCT TGCCGCGTAC
CGGACGAAGC CGCAGGCGCA GCCGGATCTG TCGACGGAGC TGCAACTGCA GCACGCGGTG
CTGCCGATTT CGCCGGCCGC GGCGATCGAG GCGGGGGCGA CGAGCGGCGC GTATTTGCAG
GCCCTGTTGA TGGACCCGGC GTATCAACTG AAGTGA
 
Protein sequence
MPAAMQTLLD ADDARFLLTR TGFSPPPRAL ARFVGMTRAQ ALAELLDGAR TQSVTPPPDW 
VREPPPSRAM RAAFSPDERR AEQQLRNRRY DELRAWWVRE MIVTPSPLTE RMTLFWHNHF
TSGQDKVPFP QTIAAQHALL RANALGNFGA MLHGVAKDPA MLQYLDGASN RKGRPNENFA
REAMELFTLG EGHYTQRDVS EAARAYTGWG LDPDALTYVF RPNVHDDGVK TVLGETGRFD
GDAVLDILLG RPETARFVVA KLWREFVSDA PDAGEVERIA ARLRQSDYDI RAALTELFSS
DAFWAERNRG VLVKSPAEFV VGTVRLFDVD YVDAAPFANT LRALGQNLFY PPNVKGWPGG
VSWINSATLL ARKQFVEQMM RATEAPGMRA APVSRDMAGQ PAPTRRGAMR FDLDAWLAAY
RTKPQAQPDL STELQLQHAV LPISPAAAIE AGATSGAYLQ ALLMDPAYQL K