Gene BURPS668_2386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2386 
Symbol 
ID4883071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2354923 
End bp2356077 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content68% 
IMG OID640128314 
Productallantoicase 
Protein accessionYP_001059418 
Protein GI284159964 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG4266] Allantoicase 
TIGRFAM ID[TIGR02961] allantoicase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGGCG CGGCGGGTTC GCCGCGACGG CATCCGCTGC GGCGCGCCAG ACGCGCCGCC 
TCGGTCGCCC GAGCCGCGCG AGACGGCGCG CGGCGGCGTT CCACTCATCA GGCATCACGC
ACTTACGACA AGGACAAGAC GATGGCTCTT CCGCTTTCGG ATCCGAACGC TCCCGAATTC
ACGCGGCGTT ACGTGAATCT CGCCGATCCG CGTCTCGGCG CGCAGGCGCT TGAGGCGAGC
GACGATTTCT TCGCGCCGAA GGAGCGCATG CTGAATCCGG AGCCCGCGGT GTTCATCCCG
GGCAAATACG ACGATCACGG CAAATGGATG GACGGCTGGG AGACGCGCCG CAAGCGCACG
ACGGGTTATG ACTGGTGCGT CGTGAAGCTC GCGCGTCCGG GCGTGATCAA GGGTGTCGAC
ATCGATACGA GCCATTTCAC CGGCAATTTC CCGCCCGCCG CGTCGATCGA GGCCGCGCAC
GTGCCCGACG GCGCGCCGAA CGAGGCGACG AAGTGGGTGG AGATCGTGCC GGCGACGACG
CTGCAGGGCA ACAGCCATCA CTACGTCGAA GCACGCGACG CGAACGCATA CACGCATCTG
CGCGTGAACC TCTACCCGGA CGGCGGCATC GCGCGGCTGC GCGTCTACGG CCAGCCGCAG
CTCGATTGGG CGGGCGCGAG CCGATCGGCG CTGTTCGATC TCGCGGCGAT GGAGAACGGC
GGCTACGTCG TCGCGGCGAA CAACCAGCAC TTCGGCCTCG CGTCGAACGT GCTGCTGCCG
GGCCGCGGCG TGAACATGGG CGACGGCTGG GAGACGCGCC GCCGCCGCGA GCCGGGCAAC
GACTGGGCGA TCGTCGCGCT CGCGCAGCCG GGCGTGATCC GCAAGGTCGA AATCGACACC
GCGCATTTCA AGGGCAACTA TCCGGACCGC TGTTCGATCC AGGCCGCCTA TGTGCAGGGC
GGCACCGACA GCTCGCTCGT CACGCAGGCG ATGTTCTGGC CGGTGCTGCT CGGCGAGCAG
AAGCTGCAGA TGGACAAGCA GCACGCTTTC GAAGCCGAGC TCGCCGCGCT CGGGCCCGTC
ACGCACGTGC GGCTGAACAT CATTCCGGAC GGCGGCGTAT CGCGTCTGCG CGTATGGGGC
ACGCTCGACA AATGA
 
Protein sequence
MRGAAGSPRR HPLRRARRAA SVARAARDGA RRRSTHQASR TYDKDKTMAL PLSDPNAPEF 
TRRYVNLADP RLGAQALEAS DDFFAPKERM LNPEPAVFIP GKYDDHGKWM DGWETRRKRT
TGYDWCVVKL ARPGVIKGVD IDTSHFTGNF PPAASIEAAH VPDGAPNEAT KWVEIVPATT
LQGNSHHYVE ARDANAYTHL RVNLYPDGGI ARLRVYGQPQ LDWAGASRSA LFDLAAMENG
GYVVAANNQH FGLASNVLLP GRGVNMGDGW ETRRRREPGN DWAIVALAQP GVIRKVEIDT
AHFKGNYPDR CSIQAAYVQG GTDSSLVTQA MFWPVLLGEQ KLQMDKQHAF EAELAALGPV
THVRLNIIPD GGVSRLRVWG TLDK