Gene BURPS668_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2003 
Symbol 
ID4884272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1982929 
End bp1983999 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content72% 
IMG OID640127931 
ProductL-allo-threonine aldolase 
Protein accessionYP_001059038 
Protein GI284159935 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2008] Threonine aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGATT TCCTGAGCGA CACGGTGACG CTGCCGACCG CGGAGATGCG GCACGCGATG 
TTTACCGCGA ATGTGGGCGA CGATTGCTAC GGCGAGGATC CGACGGTCAA CGAGCTCGAA
TCGGTGGCGG CCGGACTGAC CGGCAAGGAA GCGGCGGCGT TCGTCACGAG CGGCACGCTC
GGCAACCTGA GCGCGCTGCT CGCGCAATGC CCGCGCGGGC ACGAGGTGAT CCTCGGCGAT
CGCTCCGACC TGTACAACTA CGAGGCGGGC GGCGTGTCGC TCGTCGGCGG CGCGGTGTTG
CACCCCGTCG AGACCGCCGA CGACGGCAGC CTGCCGCTCG AGCGGCTGCG CGCGGCGATC
CGCGACAAGC GCGACCCCCA GTGCGCGCCC GCCGCGGTGA TCGCGCTCGA GAATCCGCAT
TGCCTCGCCG GCGGCCGCGT GCTGTCGCTC GACTACCTGC GGCGCGTGCG CGCGCTCGCC
GACGAGCACG GGCTCGCCGT GCACATGGAC GGCGCGCGTC TGTTCAACGC GCAGGCGAGC
CTCGGCACGC CGGCGGCCGA GATCGTCGCG CACGTCGATT CGGTCCAGTT CTGCCTGTCG
AAGAGCCTCG CCGCGCCGTA CGGCTCGATG GTGTGCGGCA GCGCCGCCCT GATCGATCGC
GTGAAGCGCT ATCGGAAGCT GCTCGGCGGC GGCACGCGGC AAGCCGGCAT CATGGCGGCC
GCCGGGCTCG TCGCGCTGCG CACGATGGTC GCGCGGCTCG CGGACGATCA CCGCCGCGCG
GCGCGCCTCG CCGCGGAGCT GGCGCGGATT CCGGGCGTCG CGCTGCGCTC GGCGGTGATC
GAGACGAACA TGGTGTTCTT CGACGTCGCC GAGCCGGGCA ACGAGGCGTT TCTCGCCGCG
CTGCGCGACG CGGGCATCCG GATGGGCGTG CTCGGCGACG GCGTGATCCG GGCCGTCGTG
CACTACATGA TCGACGACGA CGCGATCAGC CGCACCGTCG ACGCCGTCCG CGCGATTGTT
CTTCCGTTCG CCCCGGCGTT AGCGCCGGCC GCCGCATCGC AGGCGCAATG A
 
Protein sequence
MIDFLSDTVT LPTAEMRHAM FTANVGDDCY GEDPTVNELE SVAAGLTGKE AAAFVTSGTL 
GNLSALLAQC PRGHEVILGD RSDLYNYEAG GVSLVGGAVL HPVETADDGS LPLERLRAAI
RDKRDPQCAP AAVIALENPH CLAGGRVLSL DYLRRVRALA DEHGLAVHMD GARLFNAQAS
LGTPAAEIVA HVDSVQFCLS KSLAAPYGSM VCGSAALIDR VKRYRKLLGG GTRQAGIMAA
AGLVALRTMV ARLADDHRRA ARLAAELARI PGVALRSAVI ETNMVFFDVA EPGNEAFLAA
LRDAGIRMGV LGDGVIRAVV HYMIDDDAIS RTVDAVRAIV LPFAPALAPA AASQAQ