Gene BURPS668_3422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3422 
Symbol 
ID4883382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3347853 
End bp3348926 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content69% 
IMG OID640129350 
Productallantoicase 
Protein accessionYP_001060433 
Protein GI284159940 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG4266] Allantoicase 
TIGRFAM ID[TIGR02961] allantoicase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0252136 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGCCG ATTTTTCCGC CGGTTTTCCG CTTTCGGACA CACATTCAAT AACAGGAGAC 
GACATGGCCG CCCCGATTCT CGATCCGAAC GCACCCGCGT TCACGCGCCG CTACATGAAT
CTCGCCGACC CGCGCCTCGG TGCGAAGGCG CTCTTCGCGA GCGACGAATT CTTCGCGCCG
AAGGAGCGGA TGCTCGATCC CGAGCCCGCC GTGTTCATTC CCGGCAAGTA CGACGACCAC
GGCAAATGGA TGGACGGCTG GGAGACGCGC CGCAAGCGCA CGACGGGGCA CGACTTCTGC
GTCGTGCGGC TCGCGCGGCC GGGCGTGGTG TACGGCGTCG ATCTCGACAC GAGCCACTTC
ACCGGCAATT TCCCGCCCGC CGCGTCGATC GACGCATGCG TGTCGGACGC CGACACGCCG
CCCGACGACG CCGTCTGGGA AACGCTCGTG CCGGCGACGA CGCTCGCCGG CAATCAGCAT
CACTACGTCG ACGTGAGCAA TCCTCGCACC TATACGCACC TGCGCGTGAA CCTGTATCCG
GACGGCGGGC TCGCGCGGCT GCGCGTGTAC GGCCAGCCGC AGCGCGACTG GAGCCGCGCG
GCGCGCGGCG AGCTCGTCGA TCTGGCCGCG ATCGAGAACG GCGCGTATCT CGTCGCCGCG
AACAACGAGC ACTTCGGCCC CGCGTCGCGG ATGCTGATGC CCGGGCGCGG CGCGAACATG
GGCGACGGCT GGGAGACGCG GCGCCGCCGC GAGCCCGGCA ACGACTGGGC GATCGTCGCG
CTCGCGCGGC CCGGCGTGAT TCGTAGGGTC GAAGTCGATA CCGCGCACTT CAAGGGCAAT
TTCCCGGACC GCTGCTCGCT GCAGGCGGCG CGCGTCGCGG GCGGCACGGA CGCGTCGCTC
GTCACGCAGG CGATGTTCTG GCCGATGCTG CTCGGCGAGC AGCCGCTCGG GATGGATAGC
GTGCATACGT TCGAGACGCA GCTCGCGGCG CTCGGCCCCG TCTCGCACGT GCGGCTGAAC
ATCCATCCGG ACGGCGGCGT GTCGCGCCTG CGCCTCTGGG GCGAGCTCGC ATAA
 
Protein sequence
MAADFSAGFP LSDTHSITGD DMAAPILDPN APAFTRRYMN LADPRLGAKA LFASDEFFAP 
KERMLDPEPA VFIPGKYDDH GKWMDGWETR RKRTTGHDFC VVRLARPGVV YGVDLDTSHF
TGNFPPAASI DACVSDADTP PDDAVWETLV PATTLAGNQH HYVDVSNPRT YTHLRVNLYP
DGGLARLRVY GQPQRDWSRA ARGELVDLAA IENGAYLVAA NNEHFGPASR MLMPGRGANM
GDGWETRRRR EPGNDWAIVA LARPGVIRRV EVDTAHFKGN FPDRCSLQAA RVAGGTDASL
VTQAMFWPML LGEQPLGMDS VHTFETQLAA LGPVSHVRLN IHPDGGVSRL RLWGELA