Gene BURPS668_A2761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2761 
Symbol 
ID4887777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2631985 
End bp2633175 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content68% 
IMG OID640132697 
Productsulfotransferase domain-containing protein 
Protein accessionYP_001063753 
Protein GI126443187 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCATG CAGCAACCGG CGCGGCCATC GGCACGCCCA TCGGCGCGCC CATCGGCGCC 
GTTCGTCCCC GCCCCGTGCT GATGATTCCG CTGCGGCGCT GCGGCAGCCA CGCGCTGCGG
CTGCGGCTGA ATCTCAATTC GCAGTTCTAT TCGCCATATC CGCTGCACAT CGTCGACTTC
ATGCCGCTGT TGCCGCTGTA CGGCGATCTC GCCGACGACC GCGCGTATTT CCGGCTCGTC
GCCGACATCG TCGGCCTGCA GGCGGCGAGC ATGGTCAAGT GGCCCGGCGT CGCGTTCGAT
CCGGTCGAGA TCTTCGACGC GCTTCGGCAC GCGCCGCGCA GCGCCCATCG CATCGTCTGG
GAGCTGCTGC TGCGCGCGGG CGAGCGCGAA GGCGCGCGCG TCGTGATGGA CAAGTCGCTC
GACAGCGTGC ACTACGCCGA CGAGCTGATG ACGCTGTATC CGGACATGCT GTTCCTGAAC
GTCGTGCGCG ATCCGCGCGC GCAGGTCGCG TCGATGAACC GCGCGATCAT TCACGATTTC
GATACGCTGC TCAACGCGCA GGCGTGGGTG GCCGCGCATC GCGCGGCCGA TGTCCTGATC
GCGCGCCATC CGCAGCGCGT GCTGACGATT CGCTACGAGG ATTTCCTGTC GGATCAGGCG
CACACGTTGC AGCGCGTATG CGCGTTCTTC GGCATCGATT TCCTGCCGCG GATGCTCGAC
ATCGCGAATT CGCCGGAGGC GCGGCATATC TCGCGCATGT CCGAGCTGTG GGCGTCGAAC
TGTTTCGCGC CGATCGCCGC GAATGCGGAC AAGTTCAAGC AGCAGCTATC GACTGCCGAG
ATCGCGACGA TCGAGACGCT CGCGCACGAA TACATGCAAC GCTACGGCTA TCAGCAGATG
ACCGACGCGA CCGCGATGCC CGACGCGTTC GCCGCCGCCG CCGCGCGCCG CCGCTCCGAC
GCGCGGCGAC GGCACGCATG GCGCGAGCTC GAGCAGTCGA ATTTCCGTGA TTTCGTGCTG
CGCCGGCATC GCGCCGACTA TCTCGAGACG GTGCGCGCCC GCTTGCAGCG GCATGCGAGC
GCGCAGGCGG ATTCGCGTGC CGATTTGCGT GCCGATTCGC CGGCCGATTC GCCGGCCGGC
GCGCCCGGGC GGCGCGATAC GCTGACCGCG GCCTTCGACG TAACCGACTG A
 
Protein sequence
MTHAATGAAI GTPIGAPIGA VRPRPVLMIP LRRCGSHALR LRLNLNSQFY SPYPLHIVDF 
MPLLPLYGDL ADDRAYFRLV ADIVGLQAAS MVKWPGVAFD PVEIFDALRH APRSAHRIVW
ELLLRAGERE GARVVMDKSL DSVHYADELM TLYPDMLFLN VVRDPRAQVA SMNRAIIHDF
DTLLNAQAWV AAHRAADVLI ARHPQRVLTI RYEDFLSDQA HTLQRVCAFF GIDFLPRMLD
IANSPEARHI SRMSELWASN CFAPIAANAD KFKQQLSTAE IATIETLAHE YMQRYGYQQM
TDATAMPDAF AAAAARRRSD ARRRHAWREL EQSNFRDFVL RRHRADYLET VRARLQRHAS
AQADSRADLR ADSPADSPAG APGRRDTLTA AFDVTD