Gene BURPS668_2111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2111 
Symbol 
ID4885102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2101332 
End bp2102390 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content69% 
IMG OID640128039 
ProductL-allo-threonine aldolase 
Protein accessionYP_001059146 
Protein GI126440420 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2008] Threonine aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.486975 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGCA AGCCCATCCA ATTCGACTTC CTGAGCGACA CCGTCACCCT GCCCACGACA 
GCCATGCGCC AGGCGATGTA CGAGGCCGAC GTCGGCGACG ACGTCTATGG CGAAGACCCC
AGCACCAACC GGCTCGAATC GTATGCGGCC GACCTGCTCG GCAAGGAGGC CGCCTGCTGG
CTGCCGTCGG GCACGATGGC CAATCTGAGC GCGATTCTCG CGCAGTGCGA GCGCGGCAGG
GAACTGTTCG TCGGCGACGA TTCCGACCTT TACAACTACG AGGCGGGGGG AGTGTCCGTG
GTCGGCGGCA TCGTCCTGCA CCCGCTCGCG ACGAACGCGC GCGGCGAAAT TCCGCTCGAC
GCGCTGCAAG ACGCGCTGCG CGACGCCGAC GACACGCAGT GCGCGCCGCC CGGCATCGTG
GCGATCGAAA CGCCTCACGT GCGCACGGGG GGCACCCCGC TGTCGCTCGA CTATCTGCAC
GCGCTGCGCG CGTTCTGCGA CGCGCACTGC CTTGCGCTCC ATATCGACGG CGCGCGCGTG
TTCAATGCGG CGATCGCGCT CCGCGTCGAC GCGAAGCGCA TCGCCGCATA CGGCGACACG
CTGCAATTCT GCCTATCGAA GAGTCTCGCC GCGCCCGCGG GCTCGATCGT CGTGTCCGAT
CGCGACACGA TCGCGCGCGT GCGCCGCTGG CGCAAGCTGC TCGGCGGCGG CATGCGGCAG
ATCGGCGTCG TCACGGCCGC CGGCGAAGTC GCGCTGCGCA CGATGGTCGA GCGCCTCGCC
GACGATCACG CGCACGCGCG GCGCCTCGCC GACGGCCTGG CCGCGATCGA CGGCATCGAG
CTCGCGCACG AGCGCGTGCA GACCAACATG GTGTTCTTCA AGGTCCGGCA CGCGTCGCTC
GACCAGCGCG GCTTTCTCGA CGCGCTCGCG GCGCGCGGCG TTCGAATGGC GGAGCTCGGC
CACGGCAACA TCCGCGCCGT GACGCACTAC CACCATGCGG CGCACGACAT CGACCGCACG
CTCGCGATCG TGCGTGAAAT CCTGTCTGGC GAAAGCTGA
 
Protein sequence
MSGKPIQFDF LSDTVTLPTT AMRQAMYEAD VGDDVYGEDP STNRLESYAA DLLGKEAACW 
LPSGTMANLS AILAQCERGR ELFVGDDSDL YNYEAGGVSV VGGIVLHPLA TNARGEIPLD
ALQDALRDAD DTQCAPPGIV AIETPHVRTG GTPLSLDYLH ALRAFCDAHC LALHIDGARV
FNAAIALRVD AKRIAAYGDT LQFCLSKSLA APAGSIVVSD RDTIARVRRW RKLLGGGMRQ
IGVVTAAGEV ALRTMVERLA DDHAHARRLA DGLAAIDGIE LAHERVQTNM VFFKVRHASL
DQRGFLDALA ARGVRMAELG HGNIRAVTHY HHAAHDIDRT LAIVREILSG ES