Gene BURPS668_0720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_0720 
SymbolaroA 
ID4882388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp698155 
End bp699510 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content67% 
IMG OID640126648 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001057772 
Protein GI126440301 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.105957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCA GCGATCGCCT CCAGCCGTCG TTCGTCGAAG TGAAGAACAC CTCGACGCTC 
AGCGGCACCA TCGATCTTCC CGCGTCGAAG AGCTCGTCGA CGCGCGCGCT CCTCACCGCG
GCGCTCACGC CCGGCATCAG CACGATCCGC AACGTCGCGA CGGGCTTCAA CTCGAACGCG
ATGAAGCACA ACTGCGAGCG GCTCGGCGCG TCGTTCTCGA GCGAGGGCGA CACGACGGTC
GTCAAGGGCG TCGACGTGAT GCACGTCGAT CGCGAGATCG TCTTCGACCC GGGCAACTCC
GGCGTCGTGC TGCGCCTGCT GATGGGCGTC GCCGGCTACC TGCCGGACAC CCGGTTCGTC
ACGCAATACC GCTATTCGCT CGGCGTGCGC TCGCAGGCGG AGATGGTCGC CGCGCTGCGC
CGGCTGAACG TCGAATGCGA AGCGGTCGGC CCCGAGGCGC GGCTGCCGAT CAGCATGCGC
TCGACGCGCG CGCTCGGCAA GCACACCGAA GTGTCGTGCA AGAAGAGCTC GCAGTTCCTG
AGCGGCCTGC TCTATCTCGG CGCGATCGGC GAGCGCGATC TCGAGATCGA CGTGGTCGAT
CACATCACCG CGCCGTCGAT GGTGCATACG ACGATCAACA ATCTCGCGCA CGCGGGCGTC
GCCGTCGAAT ACGACGCGGC CTTTCGCCGC TTCTTCGTGC CGGGGCGCGA TCGCTTCAAG
CCGTCCGAGT TCACGGTCGG CGCCGATCCG GCGAGCACGG CCGCGATCCT CGCGCTGTGC
GGCTCGCTCG CGTCGGACGT CACGCTCAAC GGCTTCTTCG AGGAAGAGCT CGGCAGCGGC
GCGGTGATCC GCTATCTGAC CGATACCGGC ACGCTGATCG ACGAGTTGCC CGGCAACCGC
ATCCGCATCC GGGGCGGCGC GTCGATCCGC GCGCAGGATT TCGACGGCTC GCTCGCGCCG
GACGCGGTGC CCGCGCTCGC CGGCCGCGCG GCGTTCGCCG AAGGCACGAG CACGTTCTAC
AACATCGAGC ATATCCGCTA CAAGGAATCG GACCGCATCT CGGATTTCCG CCGCGAACTC
GACAAGCTCG GCGTGCGCTC CGAGGAAAAG CTCGATCAGT TGATCATCCA CGGCAATCCG
CGCGGCTACC GCGGCGGCGC GGTCGTCGAC GGCCACTACG ATCACGGGCT CATCATGGCG
CTCACGACGA TCGGCCTGCA CTGCGAGCAT CCGGTGCTGA TCAAGGAGCC GCATCACGTC
GGGCAGACGT ATCCCGATTA CTTCGCCGAC ATCGGCTCGA TCGGCGCGAA CGTCGACGAG
CTGATCTACC CGAACGTCGC CGCGGCGCGC GCGTGA
 
Protein sequence
MTTSDRLQPS FVEVKNTSTL SGTIDLPASK SSSTRALLTA ALTPGISTIR NVATGFNSNA 
MKHNCERLGA SFSSEGDTTV VKGVDVMHVD REIVFDPGNS GVVLRLLMGV AGYLPDTRFV
TQYRYSLGVR SQAEMVAALR RLNVECEAVG PEARLPISMR STRALGKHTE VSCKKSSQFL
SGLLYLGAIG ERDLEIDVVD HITAPSMVHT TINNLAHAGV AVEYDAAFRR FFVPGRDRFK
PSEFTVGADP ASTAAILALC GSLASDVTLN GFFEEELGSG AVIRYLTDTG TLIDELPGNR
IRIRGGASIR AQDFDGSLAP DAVPALAGRA AFAEGTSTFY NIEHIRYKES DRISDFRREL
DKLGVRSEEK LDQLIIHGNP RGYRGGAVVD GHYDHGLIMA LTTIGLHCEH PVLIKEPHHV
GQTYPDYFAD IGSIGANVDE LIYPNVAAAR A