Gene BMA10229_A2601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10229_A2601 
SymbolhemL 
ID4792201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10229 
KingdomBacteria 
Replicon accessionNC_008836 
Strand
Start bp2641034 
End bp2642317 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content71% 
IMG OID 
Productglutamate-1-semialdehyde aminotransferase 
Protein accessionYP_001028558 
Protein GI124384030 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAACA ATCAAACTCT CTTCGAACGC GCCCAGCGAA CCATCCCGGG CGGCGTCAAT 
TCGCCGGTGC GGGCGTTCCG TTCGGTCGGC GGCACGCCGC GCTTCGTCGC GCGTGCGCAG
GGCGCGTACT TCTGGGACGC GGACGGCAAG CGCTACATCG ACTACATCGG CTCGTGGGGG
CCGATGATCG TCGGCCACGT GCACCCGGAC GTGCTCGCGG CCGTGCAGCG CGTGCTCGCC
GACGGCTTCT CGTTCGGCGC GCCCACCGAA GCCGAAATCG AGATCGCCGA GGAGATCTGC
AAGCTCGTGC CGTCGATCGA GCAGGTGCGG ATGGTGTCGA GCGGCACCGA AGCGACGATG
AGCGCGCTGC GCCTCGCGCG CGGCTTCACC GGCCGCAGCC GGATCGTCAA GTTCGAGGGC
TGCTATCACG GCCATGCGGA CAGCCTGCTC GTGAAGGCGG GCTCGGGCCT GCTCACGTTC
GGCAATCCGA CCTCGGCGGG CGTGCCGGCC GACGTCGCGA AGCACACGAC CGTGCTCGAG
TACAACAACG TCGCGGCGCT CGAGGAAGCA TTCGCCGCGT TCGGCGGCGA GATCGCCGCG
GTGATCGTCG AGCCCGTCGC GGGCAACATG AACCTCGTGC GCGGCACGCC GGAGTTCCTG
AACGCGCTGC GCGCGCTCAC CGCGAAGCAC GGCGCCGTGC TGATCTTCGA CGAAGTGATG
TGCGGCTTTC GCGTCGCGCT CGGCGGCGCG CAGCAGCACT ACGGGATCAC GCCGGATCTG
ACCTGCCTCG GCAAGGTGAT CGGCGGCGGC ATGCCGGCCG CCGCGTTCGG CGGCCGCGGC
GACATCATGT CGCACCTCGC GCCGCTCGGC GACGTCTATC AGGCGGGCAC CCTGTCGGGC
AACCCGGTCG CGGTCGCGGC GGGCCTCGCG ACGCTGCGGC TGATCCAGGC GCCGGGCTTT
CACGATGCGC TCGCCGACAA GACCCGGCGG CTCGCCGACG GCCTCGCGGC CGAGGCGCGC
GCGGCGGGCG TGCCGTTCTC GGCCGACGCG ATCGGCGGGA TGTTCGGCCT CTACTTCACC
GAGCAGGTGC CCGCGAGCTT CGCCGACGTG ACGAAGAGCG ACATCGAGCG CTTCAACCGC
TTCTTCCATC TGATGCTCGA CGCCGGCGTG TACTTCGCGC CCTCCGCGTA CGAAGCGGGC
TTCGTGTCGA GCGCGCACGA CGACGCGACG CTCGACGCGA CGCTCGACGC CGCCCGCCGC
GCGTTCGCCG CGCTGCGTGC CTGA
 
Protein sequence
MSNNQTLFER AQRTIPGGVN SPVRAFRSVG GTPRFVARAQ GAYFWDADGK RYIDYIGSWG 
PMIVGHVHPD VLAAVQRVLA DGFSFGAPTE AEIEIAEEIC KLVPSIEQVR MVSSGTEATM
SALRLARGFT GRSRIVKFEG CYHGHADSLL VKAGSGLLTF GNPTSAGVPA DVAKHTTVLE
YNNVAALEEA FAAFGGEIAA VIVEPVAGNM NLVRGTPEFL NALRALTAKH GAVLIFDEVM
CGFRVALGGA QQHYGITPDL TCLGKVIGGG MPAAAFGGRG DIMSHLAPLG DVYQAGTLSG
NPVAVAAGLA TLRLIQAPGF HDALADKTRR LADGLAAEAR AAGVPFSADA IGGMFGLYFT
EQVPASFADV TKSDIERFNR FFHLMLDAGV YFAPSAYEAG FVSSAHDDAT LDATLDAARR
AFAALRA