Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2111 |
Symbol | |
ID | 4885102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 2101332 |
End bp | 2102390 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640128039 |
Product | L-allo-threonine aldolase |
Protein accession | YP_001059146 |
Protein GI | 126440420 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2008] Threonine aldolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.486975 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGCA AGCCCATCCA ATTCGACTTC CTGAGCGACA CCGTCACCCT GCCCACGACA GCCATGCGCC AGGCGATGTA CGAGGCCGAC GTCGGCGACG ACGTCTATGG CGAAGACCCC AGCACCAACC GGCTCGAATC GTATGCGGCC GACCTGCTCG GCAAGGAGGC CGCCTGCTGG CTGCCGTCGG GCACGATGGC CAATCTGAGC GCGATTCTCG CGCAGTGCGA GCGCGGCAGG GAACTGTTCG TCGGCGACGA TTCCGACCTT TACAACTACG AGGCGGGGGG AGTGTCCGTG GTCGGCGGCA TCGTCCTGCA CCCGCTCGCG ACGAACGCGC GCGGCGAAAT TCCGCTCGAC GCGCTGCAAG ACGCGCTGCG CGACGCCGAC GACACGCAGT GCGCGCCGCC CGGCATCGTG GCGATCGAAA CGCCTCACGT GCGCACGGGG GGCACCCCGC TGTCGCTCGA CTATCTGCAC GCGCTGCGCG CGTTCTGCGA CGCGCACTGC CTTGCGCTCC ATATCGACGG CGCGCGCGTG TTCAATGCGG CGATCGCGCT CCGCGTCGAC GCGAAGCGCA TCGCCGCATA CGGCGACACG CTGCAATTCT GCCTATCGAA GAGTCTCGCC GCGCCCGCGG GCTCGATCGT CGTGTCCGAT CGCGACACGA TCGCGCGCGT GCGCCGCTGG CGCAAGCTGC TCGGCGGCGG CATGCGGCAG ATCGGCGTCG TCACGGCCGC CGGCGAAGTC GCGCTGCGCA CGATGGTCGA GCGCCTCGCC GACGATCACG CGCACGCGCG GCGCCTCGCC GACGGCCTGG CCGCGATCGA CGGCATCGAG CTCGCGCACG AGCGCGTGCA GACCAACATG GTGTTCTTCA AGGTCCGGCA CGCGTCGCTC GACCAGCGCG GCTTTCTCGA CGCGCTCGCG GCGCGCGGCG TTCGAATGGC GGAGCTCGGC CACGGCAACA TCCGCGCCGT GACGCACTAC CACCATGCGG CGCACGACAT CGACCGCACG CTCGCGATCG TGCGTGAAAT CCTGTCTGGC GAAAGCTGA
|
Protein sequence | MSGKPIQFDF LSDTVTLPTT AMRQAMYEAD VGDDVYGEDP STNRLESYAA DLLGKEAACW LPSGTMANLS AILAQCERGR ELFVGDDSDL YNYEAGGVSV VGGIVLHPLA TNARGEIPLD ALQDALRDAD DTQCAPPGIV AIETPHVRTG GTPLSLDYLH ALRAFCDAHC LALHIDGARV FNAAIALRVD AKRIAAYGDT LQFCLSKSLA APAGSIVVSD RDTIARVRRW RKLLGGGMRQ IGVVTAAGEV ALRTMVERLA DDHAHARRLA DGLAAIDGIE LAHERVQTNM VFFKVRHASL DQRGFLDALA ARGVRMAELG HGNIRAVTHY HHAAHDIDRT LAIVREILSG ES
|
| |