Gene BMASAVP1_A0338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_A0338 
SymbolpyrC 
ID4680301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008785 
Strand
Start bp340001 
End bp341059 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content70% 
IMG OID639844616 
Productdihydroorotase 
Protein accessionYP_991689 
Protein GI121600998 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0418] Dihydroorotase 
TIGRFAM ID[TIGR00856] dihydroorotase, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCTT CCGTTCCCGC CTCGCTGACC CTCGCCCGCC CCGACGACTG GCACCTGCAC 
GTGCGCGACG GCGCGATGCT CGCGGCCGTC CTGCCGCACA CCGCTCGCCA GTTCGGCCGC
GCGATCATCA TGCCGAACCT GAAGCCGCCC GTCACGACGA CCGCGCAGGC GCAGGCCTAC
CGCGAGCGCA TCCTGGCCGC GGTGCCGGCC GGCATGACGT TCGAGCCGTT GATGACGCTG
TACCTGACCG ACAACACGCC CGCCGACGAA ATCCGCCGCG CACGCGAAAG CGGCTGCGTG
CACGGCGTGA AGCTCTATCC GGCGGGCGCG ACGACGAACT CGGACGCCGG CGTGACCGAC
CTGCTCGGCA AGTGCGCGAA GACGCTCGAG GCGATGCAGG AAGTCGGGAT GCCGCTGCTC
GTGCACGGCG AGGTGACGGA TCCGTCGATC GACCTGTTCG ACCGCGAGAA GGTGTTCATC
GATCGCGTGA TGGAGCCGCT GCGCCGCGCG CTGCCGGGGC TCAAGGTGGT GTTCGAGCAT
ATTACGACGA AGGATGCGGC CGACTACGTG CGCGATGCCG ACGCGGCGTC AGGCCGGATC
GGCGCGACGA TCACCGCGCA CCATCTGCTG TACAACCGCA ATGCGATGTT TTTCGGCGGC
ATCCGTCCAC ATTACTACTG CCTGCCGGTG CTCAAGCGCG AGACGCATCG GATCGCGCTC
GTCGAGGCGG CGACGTCGGG CAATCCGCGC TTCTTCCTCG GCACCGACAG CGCGCCGCAC
GCGAAAGGCG CGAAGGAAGC CGCGTGCGGC TGCGCGGGCT GCTACACCGC GCTGCACGCG
CTCGAGCTGT ACGCGGAGGC ATTCGACCAG GCGGGCGCGC TCGACAAGCT CGAAGGCTTC
GCGAGCTTCT TCGGCGCGGA CTTCTACGGT TTGCCGCGCA GCGCCGAGAC GGTGACGCTG
CGCCGCGAGA CGTGGGAGCT GCCGCGCGAG ATCGACGCGG GCGCCGGCCC GGTCGTGCCG
CTGCGCGGCG GCGAGGCGAT CGGCTGGCGG CTCGTCTGA
 
Protein sequence
MNASVPASLT LARPDDWHLH VRDGAMLAAV LPHTARQFGR AIIMPNLKPP VTTTAQAQAY 
RERILAAVPA GMTFEPLMTL YLTDNTPADE IRRARESGCV HGVKLYPAGA TTNSDAGVTD
LLGKCAKTLE AMQEVGMPLL VHGEVTDPSI DLFDREKVFI DRVMEPLRRA LPGLKVVFEH
ITTKDAADYV RDADAASGRI GATITAHHLL YNRNAMFFGG IRPHYYCLPV LKRETHRIAL
VEAATSGNPR FFLGTDSAPH AKGAKEAACG CAGCYTALHA LELYAEAFDQ AGALDKLEGF
ASFFGADFYG LPRSAETVTL RRETWELPRE IDAGAGPVVP LRGGEAIGWR LV