Gene BURPS1106A_0734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0734 
SymbolaroA 
ID4899309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp713213 
End bp714568 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content67% 
IMG OID640133964 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001065016 
Protein GI126453471 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCA GCGATCGCCT CCAGCCGTCG TTCGTCGAAG TGAAGAACAC CTCGACGCTC 
AGCGGCACCA TCGATCTTCC CGCGTCGAAG AGCTCGTCGA CGCGCGCGCT CCTCACCGCG
GCGCTCACGC CCGGCATCAG CACGATCCGC AACGTCGCGA CGGGCTTCAA CTCGAACGCG
ATGAAGCACA ACTGCGAGCG GCTCGGCGCG TCGTTCTCGA GCGAGGGCGA CACGACGGTC
GTCAAGGGCG TCGACGTGAT GCACGTCGAT CGCGAGATCG TCTTCGACCC GGGCAACTCC
GGCGTCGTGC TGCGCCTGCT GATGGGCGTC GCCGGCTACC TGCCGGACAC CCGGTTCGTC
ACGCAATACC GCTATTCGCT CGGCGTGCGC TCGCAGGCGG AGATGGTCGC CGCGCTGCGC
CGGCTGAACG TCGAATGCGA AGCGGTCGGC CCCGAGGCGC GGCTGCCGAT CAGCATGCGC
TCGACGCGCG CGCTCGGCAA GCACACCGAA GTGTCGTGCA AGAAGAGCTC GCAGTTCCTG
AGCGGCCTGC TCTATCTCGG CGCGATCGGC GAGCGCGATC TCGAGATCGA CGTGGTCGAT
CACATCACCG CGCCGTCGAT GGTGCATACG ACGATCAACA ATCTCGCGCA CGCGGGCGTC
GCCGTCGAAT ACGACGCGGC CTTTCGCCGC TTCTTCGTGC CGGGGCGCGA TCGCTTCAAG
CCGTCCGAGT TCACGGTCGG CGCCGATCCG GCGAGCACGG CCGCGATCCT CGCGCTGTGC
GGCTCGCTCG CGTCGGACGT CACGCTCAAC GGCTTCTTCG AGGAAGAGCT CGGCAGCGGC
GCGGTGATCC GCTATCTGAC CGATACCGGC ACGCTGATCG ACGAGTTGCC CGGCAACCGC
ATCCGCATCC GGGGCGGCGC ATCGATCCGC GCGCAGGATT TCGACGGCTC GCTCGCGCCG
GACGCGGTGC CCGCGCTCGC CGGCCGCGCG GCGTTCGCCG AAGGCACGAG CACGTTCTAC
AACATCGAGC ATATCCGCTA CAAGGAATCG GACCGCATCT CGGATTTCCG CCGCGAACTC
GACAAGCTCG GCGTGCGCTC CGAAGAAAAG CTCGATCAGT TGATCATCCA CGGCAATCCG
CGCGGCTACC GCGGCGGCGC GGTCGTCGAC GGCCACTACG ATCACGGGCT CATCATGGCG
CTCACGACGA TCGGCCTGCA CTGCGAGCAT CCGGTGCTGA TCAAGGAGCC GCATCACGTC
GGGCAGACGT ATCCCGATTA CTTCGCCGAC ATCGGCTCGA TCGGCGCGAA CGTCGACGAG
CTGATCTACC CGAACGTCGC CGCGGCGCGC GCATGA
 
Protein sequence
MTTSDRLQPS FVEVKNTSTL SGTIDLPASK SSSTRALLTA ALTPGISTIR NVATGFNSNA 
MKHNCERLGA SFSSEGDTTV VKGVDVMHVD REIVFDPGNS GVVLRLLMGV AGYLPDTRFV
TQYRYSLGVR SQAEMVAALR RLNVECEAVG PEARLPISMR STRALGKHTE VSCKKSSQFL
SGLLYLGAIG ERDLEIDVVD HITAPSMVHT TINNLAHAGV AVEYDAAFRR FFVPGRDRFK
PSEFTVGADP ASTAAILALC GSLASDVTLN GFFEEELGSG AVIRYLTDTG TLIDELPGNR
IRIRGGASIR AQDFDGSLAP DAVPALAGRA AFAEGTSTFY NIEHIRYKES DRISDFRREL
DKLGVRSEEK LDQLIIHGNP RGYRGGAVVD GHYDHGLIMA LTTIGLHCEH PVLIKEPHHV
GQTYPDYFAD IGSIGANVDE LIYPNVAAAR A