Gene BMA10229_A2433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10229_A2433 
Symbol 
ID4793760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10229 
KingdomBacteria 
Replicon accessionNC_008836 
Strand
Start bp2471944 
End bp2473083 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content70% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_001028392 
Protein GI124385157 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCGC TGCAACGACA ACAACAACCC GCCGGCGCCG CGCGCCGCCG CTTCTGGCGC 
GGCGCGCAGG TCGCGCTCGC GAGCGCCGCG TTCGCGCTGC TCGCCGCCTG CGGCGGCGGC
GACGACAACG GCTCGTCGCA GCCGAGCGCC GGCGTGAACA TGCAGGTCGT GTCCTTCGGC
GACAGCCTGT CGGACGTCGG CACGTATTCG CCGCAGATCC TGATCGGCTT CGGCGGCGGG
CGCTTCACGA CGAATCCGGG CCAGGTATGG ACGCAGGACG TCGCCGCCTA CTACGGCGAC
ACGCTCACCC CGGCGTTCGA AGGCGGCTTC GGCGTGCCGC TGCAGGCGGC GGGCGGCCTG
GGCTACGCGC AGGGCGGCTC GCGCGTCACG CAGCAGCCGG GCATCGGCCA CGCGGACGCG
AGCGTCGCGA ACGCCGACTA CGCGCAGGCG ACGACCGTGC CCGTCGCGAC GCAGGTGCAG
CAATACCTGC AGCAGCACGG CAGCTTCAAC GCGAATCAGA TCGTGCTCGT CAACGGCGGC
GCGAACGACA TCTTCTATCA GGTGCAGGTC GCGCAGGCTC AGGGCAATAC GCCCGCCGCG
CAGCTCGCCG CCGCGCAGCA GATCGGCCTC GCCGCGCAGC AGCTCGCGGG CGTCGTCCAG
CAGATCGTCG CGGCGGGCGC GACGCACGTG TTCGTATCGA ACGTGCCGGA CATCGGCGGC
ACGCCGCTCG CGGCGTCGAC GGGCCAGCAG GCCGCGCTCA CGCAGTTGTC GACGATCTTC
AACAGCACGC TCGTCGCGGC GCTGAAGGCG CTGAACGTCG ATCCCGCGAA GGCCGTGCTG
ATCGACGCAT TCACGTGGCA GGACGGCATC GCCGCGAACT ACCAGGGCAA CGGCTTCTCG
GTGGCGAACA CGGGCACCGC GTGCAACCTG CAATCGATGA TCGCCGCCGC GACGAAGGCG
GGGGTCGCGA ACCCGACCGC GTTCGGCTCG TCGCTGTTCT GCTCGCCGCA GATGTACACG
GTCGCGAACG CGGACCAGAC GTACATGTTC GCCGACACGG TCCACCCGAC GACGCGCCTG
CACGCGCTCT TCGCGCAATA CGTCGAGCAG CAGATCGCGA AAACGGGCGT CGGCAAGTAA
 
Protein sequence
MNPLQRQQQP AGAARRRFWR GAQVALASAA FALLAACGGG DDNGSSQPSA GVNMQVVSFG 
DSLSDVGTYS PQILIGFGGG RFTTNPGQVW TQDVAAYYGD TLTPAFEGGF GVPLQAAGGL
GYAQGGSRVT QQPGIGHADA SVANADYAQA TTVPVATQVQ QYLQQHGSFN ANQIVLVNGG
ANDIFYQVQV AQAQGNTPAA QLAAAQQIGL AAQQLAGVVQ QIVAAGATHV FVSNVPDIGG
TPLAASTGQQ AALTQLSTIF NSTLVAALKA LNVDPAKAVL IDAFTWQDGI AANYQGNGFS
VANTGTACNL QSMIAAATKA GVANPTAFGS SLFCSPQMYT VANADQTYMF ADTVHPTTRL
HALFAQYVEQ QIAKTGVGK