Gene BURPS1710b_2997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_2997 
SymbolpheA 
ID3690076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp3302183 
End bp3303265 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content68% 
IMG OID637729453 
Productchorismate mutase/prephenate dehydratase 
Protein accessionYP_334376 
Protein GI76809940 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.379619 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGACG AACTCAATTC CCGCCTGAAG CCTCTGCGCG AGCGCATCGA CGCGATCGAC 
ACGCAACTGA TCGCGCTGCT GAATCAGCGC GCGGCGGTCG CGCTCGAGGT CGGCGAGGTC
AAGAAGCACT TCAACGCGCC CGTGTTCCGG CCGGAGCGCG AGCAGCAGGT GATCGCGCGC
TTGCAGGACA TGAGCGCCGG GCCGCTCGCG AGCGAGCACA TCAGCGCGAT CTGGCGCGAG
ATCATGGCGG CGAGCCGCGA TCTCGAGCAG ACGATACACG TCGCGTTCCT CGGGCCCGTC
GGCACCTATA GCGAACAGGC GATGTTCGAC TACTTCGGCC AATCGATCGA GGGGCTGCCT
TGCCCGTCGA TCGACGAGGT GTTCCGCTCG GTCGAGGCGG GCGCCGCGAC GTTCGGCGTC
GTGCCGGTCG AGAATTCGTC GGAAGGCGCG GTGTCGCGCA CGCTCGATCT GCTGCTGCAT
ACGCAGCTTC TGATCGGCGG CGAGCTGTCG CTGCCGATTC ATCACAATCT GCTCACGCAA
ACAGGCAAGC TCGACGGCGT GAAGCGCGTG TGCGCGCATG CGCAGGCGCT CGCGCAGTGC
CAGCAATGGC TCGCGTCGAA CGCGCCGCAT CTCGAGCGGC AGGCGGTCGC GAGCAACGCG
GAAGCCGCGC GGCTCGCGGC CGACGACGCG ACGGTCGCCG CGATCGCGGG CGACCGCGCG
GCGACGCACT ACGGGCTGCA GATCGCCTAT GCGCTGATCC AGGACGATCC GCACAACCGC
ACGCGCTTCG CGGTGATCGG CCAGGAGCCG GCGGGGCCGA GCGGGCATGA CCAGACCTCG
CTCATCGTGT CGGTGAAGAA CGAGCCGGGC GCGGTGTTCA AGCTGCTCGA GCCGCTTGCG
CGGCACGGCG TGTCGATGAC GCGCTTCGAG TCGCGCCCGG CGCGGGTCGG CACGTGGGAG
TATTACTTCT ACATCGACAT CGAAGGGCAT CGCGACGACG CCGCTGTCCA GGGTGCGCTC
GCGGAGCTTG GCAGGAAGGC GGCTTTTCTG AAGATTCTCG GTTCGTATCC GCGCGCGCGG
TGA
 
Protein sequence
MDDELNSRLK PLRERIDAID TQLIALLNQR AAVALEVGEV KKHFNAPVFR PEREQQVIAR 
LQDMSAGPLA SEHISAIWRE IMAASRDLEQ TIHVAFLGPV GTYSEQAMFD YFGQSIEGLP
CPSIDEVFRS VEAGAATFGV VPVENSSEGA VSRTLDLLLH TQLLIGGELS LPIHHNLLTQ
TGKLDGVKRV CAHAQALAQC QQWLASNAPH LERQAVASNA EAARLAADDA TVAAIAGDRA
ATHYGLQIAY ALIQDDPHNR TRFAVIGQEP AGPSGHDQTS LIVSVKNEPG AVFKLLEPLA
RHGVSMTRFE SRPARVGTWE YYFYIDIEGH RDDAAVQGAL AELGRKAAFL KILGSYPRAR