Gene BURPS668_2523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2523 
Symbol 
ID4883472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2493468 
End bp2494868 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content72% 
IMG OID640128451 
Producthypothetical protein 
Protein accessionYP_001059550 
Protein GI126440109 
COG category[S] Function unknown 
COG ID[COG4529] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACGA CGACGGTTGC GATTATCGGT GCGGGGTTTT GCGGGGCGAC GCTGGCGACG 
CATCTGCTGC GAAGGCCGCC GGTGCGGCCG ATGCGGGTGC TGCTGATCAA CCGGTCGGGC
GCGATGGCGC GCGGCGTGGC GTACGGCACG CGCGCGCTCG GCCATCTGCT GAACGTGCCC
GCCGGCCGGA TGAGCGCGGT GGCCGGCGAC GATGACGACT TCTATCGATA CGCGAGCGGG
CGCGATCCGC GCGTCGCGCG CGGCAGCTTC GTGCCGCGGC GGATCTACGG CGACTACCTC
GAGGCGCGCC TGACCGAGGC GATCGAGCAG GCGCACGCGG GCATCGAATT TCGTAGCGTG
GTGGGCAGCG CGGTGAGAAT CGCGCCCGTC GACGGCGGCG CGCGCGGCGC GATCACGATG
GACGGCGGCG CGGTGATCGA GGCCGACCGC GTCGTGCTGA GCAGCGGCAA CGAAATGCGC
CGCGATCCGT TCATCGCCGA ATCGCAACGC AAGTTCTACG ACAGCCATAC CTACGTTCGC
GATCCATGGC GGCCGGGCGC GCTGCGCGGC ATCGCGCCCG ATACGCCGGT GCTGCTCGTG
GGCAGCGGGC TCACGATGAT GGACGTGGTG CTCGATTTGC GCGCCCGGGG CCACGCGGCG
CCGATTCACG TGGTGTCGCG CCACGGGTTG ATGCCGCTCG CGCACCGTGA GATGGACGCG
CCGCCGTCCT ACGACGATCG GCTGGCGGCC CGTATGCTCG CGCGCGCGGA CGTGCGCCAT
TACGTGCGCG CGGTGCGCGA CGCGATTCGC CGAGGCGGCG ACTGGCGAGA CGTGATCGGT
TCGCTGCGCG CGGCGACGCC GGCGCTGTGG CGCCAGTTGC CGAGCGACGA GCGCCGGCGC
TTCCTGCGCC ATGTCAGGCC GTACTGGGAC GTGCATCGCC ACCGCTGCGC GCCCGAGCCG
GCCGCACGGC TGCAAGCGGA ATTCGAGCGA GGCGGCGTCG CGGCCGTCGC GGGGCGGGTG
ACGGGCTACA GCGAACATCC GAACGGCGTC GGCGTGACGG TGCGCCGGCG CGGCGCGGCC
GTCGACGAGC GTCTCGAGGT GGGCGCGGTC GTCAACTGCA CGGGGCCGGC ACCGGACTTC
AGCGCGCGGG CGGGATCGCT GCTCGGCAAC CTGTATGCGG ACGGGCTGAT CGTGCCGGAT
GCGATCGGCA TGGGGTTCGA GATCGCCGAC GACGGCGCGG TGCTCGATCG CGACGGCTCG
CCGTCGGCGT GGCTGCGTTA TGTCGGACCG TTGCTGCAGG CGCGCGATTG GGAGGCGACG
GCGGTGCCGG AACTGCGGCA GTACGTGCAG CGGCTCGCCG ATACGCTGCT CGCGCCGCGC
GACGAACGGG CGCTGACCTA G
 
Protein sequence
MSTTTVAIIG AGFCGATLAT HLLRRPPVRP MRVLLINRSG AMARGVAYGT RALGHLLNVP 
AGRMSAVAGD DDDFYRYASG RDPRVARGSF VPRRIYGDYL EARLTEAIEQ AHAGIEFRSV
VGSAVRIAPV DGGARGAITM DGGAVIEADR VVLSSGNEMR RDPFIAESQR KFYDSHTYVR
DPWRPGALRG IAPDTPVLLV GSGLTMMDVV LDLRARGHAA PIHVVSRHGL MPLAHREMDA
PPSYDDRLAA RMLARADVRH YVRAVRDAIR RGGDWRDVIG SLRAATPALW RQLPSDERRR
FLRHVRPYWD VHRHRCAPEP AARLQAEFER GGVAAVAGRV TGYSEHPNGV GVTVRRRGAA
VDERLEVGAV VNCTGPAPDF SARAGSLLGN LYADGLIVPD AIGMGFEIAD DGAVLDRDGS
PSAWLRYVGP LLQARDWEAT AVPELRQYVQ RLADTLLAPR DERALT