Gene BURPS1106A_A2230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2230 
Symbol 
ID4905850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2216503 
End bp2217681 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content68% 
IMG OID640145335 
Producthypothetical protein 
Protein accessionYP_001076263 
Protein GI126456473 
COG category[S] Function unknown 
COG ID[COG3287] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGGGA GATCTGTCGT GACCATGTTG TCTTCCACGA TTCCCGCCGT GCACTCGACG 
TGCGCGCACG CGCGCGACGC CGTGCGGGAA GTGCATGCTG CGCTCGCGAA CTGCGACGCC
GAACTGGTGT TGTTCTTCTG CTCGAGCCGC TTCGATCTCG ACGCGCTCGC GGACGAGATG
CGCGAGCGGT TTCGCGGCAC GCGCGTGATC GGCTGCACGA CAGCCGGCGA GATCGGGCCG
GCCGGCTACC GCAACGACAG TCTCGTCGCG GTTGCGCTGC CGCGCGCGCT GTTTACCGTC
GAAACCGCGC TGCTCGAGGA TCTGCAGACG TTTACGATCG CAAGCGGGCA TGCCTGCGCG
CTCGACGCGC TGCACGATCT GGAGCGACGC GCGCCGCGCG CGAGCGGCGC GAATTCGTTC
GCGCTGCTGT TGATCGACGG ATTGTCGGTG CGCGAGGAGC CCGTCACGCG CACGCTGCAG
GGCGCGCTCG GCGACATCGC GCTCGTCGGC GGCTCGGCGG CCGACGATCT GCGTTTCGAG
CGAACCGCGA TCTTCTACGA CGGGCGGTTC CGCGACGATT GCGCGGCGCT GATCGTCGCG
TCGACCGCGC TGCCGTTTCG CACGTTCAAG ACCCAGCATT TCCGCTGCGG CACCGAGCGG
CTCGTCGTCA CGCAGGCGGA TGCGGAACGC CGCACCGTCA GCGAGATCAA CGGGCTGCCC
GCCGCGGAGG AATACGCGCG CCTCATCGGC GCGCGCGTCG AGGATCTCAG CCCCGGCCAC
TTCGCGGCGG CGCCCGTCGT CGTGCTGATC GACGGCACCG ATTACGTGCG ATCGATCCAG
AAGCTCAACC CGGACGGCAG CCTCACGTTC TACTGCGCGA TCGAGGAGGG CCTCGTGCTG
CGCGTGGCGC GCGCGCTCGA TCTCGTCGAC AACCTGCAGG CGACGTTCGG CGATTTGCGC
GACTCGTTCG GCGAGCCGCA GCTCGTGCTC GCGTGGGATT GCATCCTGCG CCATCTCGAG
ATGATGCAGC GGGGCACGCG CGATACCGCG GCGGAGGTGC TGAAGGCGAA CCATGCCGTC
GGCTTCAGCA CCTACGGCGA ACAGTACGGC GGCGTTCACG TGAACCAGAC GCTCACCGGC
ATCGTCTTCA GTCGCGCGCC GGAGCCCGAC CGTGGCTGA
 
Protein sequence
MKGRSVVTML SSTIPAVHST CAHARDAVRE VHAALANCDA ELVLFFCSSR FDLDALADEM 
RERFRGTRVI GCTTAGEIGP AGYRNDSLVA VALPRALFTV ETALLEDLQT FTIASGHACA
LDALHDLERR APRASGANSF ALLLIDGLSV REEPVTRTLQ GALGDIALVG GSAADDLRFE
RTAIFYDGRF RDDCAALIVA STALPFRTFK TQHFRCGTER LVVTQADAER RTVSEINGLP
AAEEYARLIG ARVEDLSPGH FAAAPVVVLI DGTDYVRSIQ KLNPDGSLTF YCAIEEGLVL
RVARALDLVD NLQATFGDLR DSFGEPQLVL AWDCILRHLE MMQRGTRDTA AEVLKANHAV
GFSTYGEQYG GVHVNQTLTG IVFSRAPEPD RG