Gene BURPS668_3011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3011 
SymbolhemL 
ID4884563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2956366 
End bp2957808 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content71% 
IMG OID640128939 
Productglutamate-1-semialdehyde aminotransferase 
Protein accessionYP_001060024 
Protein GI126441157 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase 
TIGRFAM ID[TIGR00713] glutamate-1-semialdehyde-2,1-aminomutase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGACAT CCCGCCGCCG CCCCGCCCCC GCGCGCCGAA CCGCCGCAGC CGCCCCGAAA 
CCCGTCGCGC GCGTCTCCGG TACAATCAGT CGGATCGATA CGCGCGACGC GAGCCGCGGC
GCTTCGCCGC TCGCGCGCAC TCTCTCGCAG GTCCAATTCA TGTCGAACAA TCAAACTCTC
TTCGAACGCG CCCAGCGAAC CATCCCGGGC GGCGTCAATT CGCCGGTGCG GGCGTTCCGC
TCGGTCGGCG GCACGCCGCG CTTCGTCGCG CGCGCGCAGG GCGCGTACTT CTGGGACGCG
GACGGCAAGC GCTACATCGA CTACATCGGC TCGTGGGGGC CGATGATCGT CGGCCACGTG
CACCCGGACG TGCTCGCGGC CGTGCAGCAC GTGCTCGCCG ACGGCTTCTC GTTCGGCGCG
CCCACCGAAG CCGAAATCGA GATCGCCGAG GAGATCTGCA AGCTCGTGCC GTCGATCGAG
CAGGTGCGGA TGGTGTCGAG CGGCACCGAA GCGACGATGA GCGCGCTGCG CCTCGCGCGC
GGCTTCACCG GCCGCAGCCG GATCGTCAAG TTCGAGGGCT GCTATCACGG CCATGCGGAC
AGCCTGCTCG TGAAGGCGGG CTCGGGCCTG CTCACGTTCG GCAATCCGAC CTCGGCGGGC
GTGCCGGCCG ACGTCGCGAA GCACACGACC GTGCTCGAGT ACAACAACGT CGCGGCGCTC
GAGGAAGCGT TCGCCGCGTT CGGCGGCGAG ATCGCCGCGG TGATCGTCGA GCCCGTCGCG
GGCAACATGA ACCTCGTGCG CGGCACGCCG GAGTTCCTGA ACGCGCTGCG CGCGCTCACC
GCGAAGCACG GCGCCGTGCT GATCTTCGAC GAAGTGATGT GCGGCTTTCG CGTCGCGCTC
GGCGGCGCGC AGCAGCACTA CGGGATCACG CCGGATCTGA CCTGCCTCGG CAAGGTGATC
GGCGGCGGCA TGCCGGCCGC CGCGTTCGGC GGCCGCGGCG ACATCATGTC GCACCTCGCG
CCGCTCGGCG GCGTCTATCA GGCGGGCACG CTGTCGGGCA ACCCGGTCGC GGTCGCGGCG
GGCCTCGCGA CGCTGCGGCT GATCCAGGCG CCGGGCTTTC ACGATGCGCT CGCCGACAAG
ACCCGGCGGC TCGCCGACAG CCTCGCCGCC GAGGCGCGCG CGGCGGGCGT GCCGTTCTCG
GCCGACGCGA TCGGCGGGAT GTTCGGCCTC TACTTCACCG AGCAGGTGCC CGCGAGCTTC
GCCGACGTGA CGAAGAGCGA CATCGAGCGC TTCAACCGCT TCTTCCATCT GATGCTCGAC
GCCGGCGTGT ACTTCGCGCC CTCCGCGTAC GAAGCGGGCT TCGTGTCGAG CGCGCACGAC
GACGCGACGC TCGACGCGAC GCTCGACGCC GCCCGCCGCG CGTTCGCCGC GCTGCGTGCC
TGA
 
Protein sequence
MPTSRRRPAP ARRTAAAAPK PVARVSGTIS RIDTRDASRG ASPLARTLSQ VQFMSNNQTL 
FERAQRTIPG GVNSPVRAFR SVGGTPRFVA RAQGAYFWDA DGKRYIDYIG SWGPMIVGHV
HPDVLAAVQH VLADGFSFGA PTEAEIEIAE EICKLVPSIE QVRMVSSGTE ATMSALRLAR
GFTGRSRIVK FEGCYHGHAD SLLVKAGSGL LTFGNPTSAG VPADVAKHTT VLEYNNVAAL
EEAFAAFGGE IAAVIVEPVA GNMNLVRGTP EFLNALRALT AKHGAVLIFD EVMCGFRVAL
GGAQQHYGIT PDLTCLGKVI GGGMPAAAFG GRGDIMSHLA PLGGVYQAGT LSGNPVAVAA
GLATLRLIQA PGFHDALADK TRRLADSLAA EARAAGVPFS ADAIGGMFGL YFTEQVPASF
ADVTKSDIER FNRFFHLMLD AGVYFAPSAY EAGFVSSAHD DATLDATLDA ARRAFAALRA