Gene BURPS1106A_3325 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3325 
SymbolaroG 
ID4902989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3248127 
End bp3249392 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content66% 
IMG OID640136551 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001067562 
Protein GI126453129 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATGGCGC GCGAAGCGCA ACCGAATTCC CGAAAAACCG CCGGCGAACC CGGCGGTTTT 
TTTTCGCCCC GCCGGTTCGC AAGCAGGGAC GACGGGCCGA TCCACCGCTT TACCGAATCA
CCGATTTGTC GAATCGAACC GCCAGCCGCA CCCGCGCACG CCGGGCGGCG CACCGAACCG
GAGAACTCAA GCATGCCCCC GCACAATACC GACGACGTCC GCATCCGTGA ACTGAAGGAG
CTGACTCCGC CCGCCCACCT GATCCGCGAA TTCGCGCTCG GCGAGGCGGT GTCGGAGCTC
ATCTACAACG CGCGCCAGGC GATGCACCGG ATCCTGCACG GGATGGACGA TCGCCTGATC
GTCATCATCG GGCCGTGCTC GATCCACGAC ACGAAGGCGG CGCTCGAATA CGCGGGCCGG
CTCGTCCAGG AGCGCGAGCG CTTCGCAAGC GAACTCGAGA TCGTAATGCG CGTGTACTTC
GAGAAGCCGC GCACGACGGT CGGCTGGAAG GGGCTCATCA ACGATCCGCA CCTGGATAAC
AGCTTCAAGA TCAACGACGG CCTGCGCACC GCGCGCGAGC TGCTGCTGCA GATCAACGAG
ATGGGGCTGC CCGCCGGCAC CGAATACCTC GACATGATCA GCCCGCAGTA CATCGCGGAC
CTGATCTCGT GGGGCGCGAT CGGCGCGCGC ACGACCGAAT CGCAGGTGCA CCGCGAGCTC
GCGTCGGGGC TGTCGTGCCC GGTCGGCTTC AAGAACGGCA CCGACGGCAA CGTGAAGATC
GCGGTCGACG CGATCAAGGC CGCATCGCAG CCGCACCATT TCCTGTCGGT GACGAAGGGC
GGCCATTCGG CGATCGTGTC GACGGCCGGC AACGAGGACT GCCACGTGAT CCTGCGCGGC
GGCAAGGCGC CGAACTACGA TGCCGACAGC GTGAACGCCG CGTGCGCGGA CATCGGCAAG
GCCGGCCTCG CCGCGCGCCT GATGATCGAC GCGAGCCATG CGAACAGCTC GAAGAAGCAC
GAGAACCAGA TTCCGGTATG CGCGGACATC GGCCGCCAGA TCGCCGCGGG CGACGAGCGC
ATCGTCGGCG TGATGGTCGA GTCGCACCTC GTCGAAGGCC GCCAGGACCT GAAGGAAGGC
TGCCCGCTCA CGTACGGCCA GAGCATCACC GATGCATGCA TCAACTGGGA CGACAGCGTG
AAGGTGCTCG AAGGGCTCGC CGAAGCGGTG AAGGCGCGGC GCGTCGCGCG CGGCAGCGGC
AACTGA
 
Protein sequence
MMAREAQPNS RKTAGEPGGF FSPRRFASRD DGPIHRFTES PICRIEPPAA PAHAGRRTEP 
ENSSMPPHNT DDVRIRELKE LTPPAHLIRE FALGEAVSEL IYNARQAMHR ILHGMDDRLI
VIIGPCSIHD TKAALEYAGR LVQERERFAS ELEIVMRVYF EKPRTTVGWK GLINDPHLDN
SFKINDGLRT ARELLLQINE MGLPAGTEYL DMISPQYIAD LISWGAIGAR TTESQVHREL
ASGLSCPVGF KNGTDGNVKI AVDAIKAASQ PHHFLSVTKG GHSAIVSTAG NEDCHVILRG
GKAPNYDADS VNAACADIGK AGLAARLMID ASHANSSKKH ENQIPVCADI GRQIAAGDER
IVGVMVESHL VEGRQDLKEG CPLTYGQSIT DACINWDDSV KVLEGLAEAV KARRVARGSG
N