Gene BMA10229_A3175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10229_A3175 
Symbol 
ID4791772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10229 
KingdomBacteria 
Replicon accessionNC_008836 
Strand
Start bp3214260 
End bp3215660 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content72% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_001029115 
Protein GI124385214 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.333375 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACGA CGACGGTTGC GATTATCGGT GCGGGGTTTT GCGGGGCGAC GCTGGCGACG 
CATCTGCTGC GAAGGCCGCC GGTGCGGCCA ATGCGGGTGC TGCTGATCAA CCGGTCGGGC
GCGATGGCGC GCGGCGTGGC GTACGGCACG CGCGCGCTCG GCCATCTGCT GAACGTGCCC
GCCGGCCGGA TGAGCGCGGT GGCCGGCGAC GATGACGACT TCTATCGATA CGCGAGCGGG
CGCGATCCGC GCGTCGCGCG CGGCAGCTTC GTGCCGCGGC GGATCTACGG CGACTACCTC
GAGGCGCGCC TGACCGAGGC GATCGAGCAG GCGCACGCGG GCATCGAATT TCGTAGCGTG
GTGGGCAGCG CGGTGAGAAT CGCGCCCGTC GACGGCGGCG CGCGCGGCGC GATCACGATG
GACGGCGGCG CGGTGATCGA GGCCGACCGC GTCGTGCTGA GCAGCGGCAA CGAAATGCGC
CGCGATCCGT TCATCGCCGA ATCGCAACGC AAGTTCTACG ACAGCCATGC CTACGTTCGC
GATCCATGGC GGCCGGGCGC ACTGCGCGGC ATCGCGCCCG ATACGCCGGT GCTGCTCGTG
GGCAGCGGGC TCACGATGAT GGACGTGGTG CTCGATTTGC GCGCCCGGGG CCACGCGGCG
CCGATTCACG TGGTGTCGCG CCACGGGTTG ATGCCGCTCG CGCACCGTGA GATGGACGCG
CCGCCGTCCT ACGACGATCG GCTGGCGGCC CGCATGCTCG CGCGCGCGGA CGTGCGCCAT
TACGTGCGCG CGGTGCGCGA CGCGATTCGC CGAGGCGGCG ACTGGCGAGA CGTGATCGGT
TCGCTGCGCG CGGCGACGCC GGCGCTGTGG CGCCAGTTGC CGAGCGACGA GCGCCGGCGC
TTCCTGCGCC ATGTCAGGCC GTACTGGGAC GTGCATCGCC ACCGCTGCGC GCCCGAGCCG
GCCGCACGGC TGCAAGCGGA ATTCGAGCGA GGCGGCGTCG CGGCCGTCGC GGGGCGGGTG
ACGGGCTACA GCGAGCATCC GAACGGCGTC GGCGTGACGG TGCGCCGGCG CGGCGCGGCC
GTCGACGAGC GTCTCGAGGT GGGCGCGGTC GTCAACTGCA CGGGGCCGGC ACCGGACTTC
AGCGCGCGGG CGGGATCGCT GCTCGGCAAC CTGTATGCGG ACGGGCTGAT CGTGCCGGAT
GCGATCGGCA TGGGGTTCGA GATCGCCGAC GACGGCGCGG TGCTCGATCG CGACGGCTCG
CCGTCGGCGT GGCTGCGTTA TGTCGGACCG TTGCTGCAGG CGCGCGATTG GGAGGCGACG
GCGGTGCCGG AACTGCGGCA GTACGTGCAG CGGCTCGCCG ATACGCTGCT CGCGCCGCGC
GACGAACGGG CGCTGACCTA G
 
Protein sequence
MSTTTVAIIG AGFCGATLAT HLLRRPPVRP MRVLLINRSG AMARGVAYGT RALGHLLNVP 
AGRMSAVAGD DDDFYRYASG RDPRVARGSF VPRRIYGDYL EARLTEAIEQ AHAGIEFRSV
VGSAVRIAPV DGGARGAITM DGGAVIEADR VVLSSGNEMR RDPFIAESQR KFYDSHAYVR
DPWRPGALRG IAPDTPVLLV GSGLTMMDVV LDLRARGHAA PIHVVSRHGL MPLAHREMDA
PPSYDDRLAA RMLARADVRH YVRAVRDAIR RGGDWRDVIG SLRAATPALW RQLPSDERRR
FLRHVRPYWD VHRHRCAPEP AARLQAEFER GGVAAVAGRV TGYSEHPNGV GVTVRRRGAA
VDERLEVGAV VNCTGPAPDF SARAGSLLGN LYADGLIVPD AIGMGFEIAD DGAVLDRDGS
PSAWLRYVGP LLQARDWEAT AVPELRQYVQ RLADTLLAPR DERALT