Gene BURPS1106A_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2023 
Symbol 
ID4901860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1988294 
End bp1989454 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content71% 
IMG OID640135253 
ProductL-allo-threonine aldolase 
Protein accessionYP_001066288 
Protein GI126454958 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2008] Threonine aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.181196 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACGCA CGACGGCGGC CGCGAAGGCG GCCGTCTCGA CGACGAACCG GCGCAGCGAG 
GCCGGACGAC CCAACCAATG GATCGATGCG ATGATTGATT TCCTGAGCGA CACGGTGACG
CTGCCGACCG CGGAGATGCG GCACGCGATG TTTACCGCGA ATGTGGGCGA CGATTGCTAT
GGCGAGGATC CGACGGTCAA CGAGCTCGAA TCGGTGGCGG CCGGACTGAC CGGCAAGGAA
GCGGCGGCGT TCGTCACGAG CGGCACGCTC GGCAACCTGA GCGCGCTGCT CGCGCAATGC
CCGCGCGGGC ACGAGGTGAT CCTCGGCGAT CGCTCCGACC TGTACAACTA CGAGGCGGGC
GGCGTGTCGC TCGTCGGCGG CGCGGTGTTG CACCCCGTCG AGACCGCCGA CGACGGCAGC
CTGCCGCTCG AGCGGCTGCG CGCGGCGATC CGCGACAAGC GCGACCCCCA GTGCGCGCCC
GCCGCGGTGA TCGCGCTCGA GAATCCGCAT TGCCTCGCCG GCGGCCGCGT GCTGTCGCTC
GACTACCTGC GGCGCGTGCG CGCGCTCGCC GACGAGCACG GGCTCGCCGT GCACATGGAC
GGCGCGCGTC TGTTCAACGC GCAGGCGAGC CTCGGCACGC CGGCGGCCGA GATCGTCGCG
CACGTCGATT CGGTCCAGTT CTGCCTGTCG AAGAGCCTCG CCGCGCCGTA CGGCTCGATG
GTGTGCGGCA GCGCCGCCCT GATCGATCGC GTGAAGCGCT ATCGGAAGCT GCTCGGCGGC
GGCACGCGGC AAGCCGGCAT CATGGCGGCC GCCGGGCTCG TCGCGCTGCG CACGATGGTC
GCGCGGCTCG CGGACGATCA CCGCCGCGCG GCGCGCCTCG CCGCGGAGCT GGCGCGGATT
CCGGGCGTCG CGCTGCGCTC GGCGGTGATC GAGACGAACA TGGTGTTCTT CGACGTCGCC
GAGCCGGGCA ACGAGGCGTT TCTCGCCGCG CTGCGCGACG CGGGCATCCG GATGGGCGTG
CTCGGCGACG GCGTGATCCG GGCCGTCGTG CACTACATGA TCGACGACGA CGCGATCAGC
CGCACCGTCG ACGCCGTCCG CGCGATTGTT CTTCCGTTCG CCCCGGCGTT AGCGCCGGCC
GCCGCATCGC AGGCGCAATG A
 
Protein sequence
MTRTTAAAKA AVSTTNRRSE AGRPNQWIDA MIDFLSDTVT LPTAEMRHAM FTANVGDDCY 
GEDPTVNELE SVAAGLTGKE AAAFVTSGTL GNLSALLAQC PRGHEVILGD RSDLYNYEAG
GVSLVGGAVL HPVETADDGS LPLERLRAAI RDKRDPQCAP AAVIALENPH CLAGGRVLSL
DYLRRVRALA DEHGLAVHMD GARLFNAQAS LGTPAAEIVA HVDSVQFCLS KSLAAPYGSM
VCGSAALIDR VKRYRKLLGG GTRQAGIMAA AGLVALRTMV ARLADDHRRA ARLAAELARI
PGVALRSAVI ETNMVFFDVA EPGNEAFLAA LRDAGIRMGV LGDGVIRAVV HYMIDDDAIS
RTVDAVRAIV LPFAPALAPA AASQAQ