Gene BMA10229_A3251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10229_A3251 
Symbol 
ID4791936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10229 
KingdomBacteria 
Replicon accessionNC_008836 
Strand
Start bp3303398 
End bp3305089 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content74% 
IMG OID 
ProductRNA pseudouridine synthase family protein 
Protein accessionYP_001029187 
Protein GI124384572 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.9005 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCACAA AATTGACCGT CAAGAATCCG CGCCCGGCGA CGCCCGGCCG CGCCCCCGTC 
CGCTCCGGCA GCCTCACCGC GCGCAAGGTC GCGCGGCCCG ACCCGAAAGC GGCGGGCGCA
AAACCCGCCG CGGCGAAGCC TGCTGCGAAG TCCGCATCGG CTGCCAAGCC GGCGGCGCCG
CGCAGCGCGG CGAACGCTGC GCCGAAGCGC GCGCCGGGGC CGTCGCGCCC GGCCGCGGCA
TCGGAAGGCA AGCGCGTCGC GAAGCCGCGC ACCGCGCACG ACGCCGGCCG CACGGGCGGC
GAGCGTGCGC CGGCCAAGCG CGCCACCACG CCCGGCGCGG CGTCCGCGCC GCGCACGCGC
CGCACCGACG CGAAGCCGGC GCGCCGCACC AACGAACGCC CTGCCGGCCG CGACGAGCGT
GCACCGCGCG ACTCGGATGC GCGCGCGTTC GATGCGGGCA CGCGCGGTAA GGACCGCGCG
CCCCGCGAGG GCGCAAGGCC CGGCGCACGG GGCGCGACGG GCGCGAAGTT CGGCGGCGCG
GCGCGTCGAT CGGACGACGC CGACCGTCGA ACGCCCCGCG CGACGCGTGC GGACAGCCGC
GCGCGCGATG CCGCGCCGTC GTCGTTCGCG GGCAAGACCG CGACAGCCGG CAAGCGTGCG
CCGCAGCGCG CCGACGATCG CTACGGCGCA GCCGGGAAGC GCACATCGCC GCGCACCGAG
CGAACCGAGC GAACCGAGCG CCCCGCCCGC TTCGGCGAAC GGCCGGCCAC CCGCGCGAGC
GCATCCGGCG AGCGCCGCCC CACGGCCCGC GCGGCGACGG GTTCGCGCCT CAAGCTCGCG
CAGCCGATCA AGCGCGGCAG CGGCGAACTG GGCGAATCCG CTCGCGGCGG TGAGCACGGC
GAACGCGGCA AGCGTATCGA GCGCGGCGAC GAAACCGGCC TCGTGCGCCT GTCGAAGCGC
ATGTCGGAGC TGGGTCTCTG CTCGCGCCGC GAAGCAGACG AATGGATCGA GAAAGGCTGG
GTGCTCGTCG ACGGCGAGCG CATCGACACG CTCGGCACGA AGGTGCGCGC CGACCAGCGC
ATCGAGATCG ATTCGAACGC GCGCGCCGCG CAGGCCGCGC AAGTGACGAT CCTGCTGCAC
AAGCCGGTGG GCTACGTGTC GGGCCAGGCG GAGGACGGCT ACGCCCCCGC CGCGACGCTC
GTCACGCGCG AGAACCACTG GAGCGGCGAC CGCTCGCCGC TGCGCTTCTC GCCGCAGCAC
CTGCGCGCGC TCGCGCCCGC GGGCCGGCTC GACATCGATT CGACGGGCCT TCTCGTGCTG
ACGCAGAACG GGCGCGTCGC GAAACAGCTG ATCGGCGAAC AATCGGACAT CGACAAGGAA
TACCTGGTGC GCGTGCGCTT CGGCGAGCGC ACGGCCGACA TCGAACGCCA CTTCCCCGCC
GAGTCGCTCG CGAAGCTGCG CCACGGCCTC GAGCTCGACG GCGTGCCGCT CAAGCCCGCG
ATGGTCAGTT GGCAGAACGG CGAGCAACTG CGCTTCGTGC TGCGCGAAGG CAAGAAGCGC
CAGATTCGCC GGATGTGCGA ACTCGTCGGC CTCGAGGTGA TCGGCCTGAA GCGCGTGCGG
ATGGGCCGCG TGATGCTGGG CGCGCTGCCG CAAGGCGAGT GGCGCTATCT CGGGCCGGAC
GAATCGTTCT GA
 
Protein sequence
MRTKLTVKNP RPATPGRAPV RSGSLTARKV ARPDPKAAGA KPAAAKPAAK SASAAKPAAP 
RSAANAAPKR APGPSRPAAA SEGKRVAKPR TAHDAGRTGG ERAPAKRATT PGAASAPRTR
RTDAKPARRT NERPAGRDER APRDSDARAF DAGTRGKDRA PREGARPGAR GATGAKFGGA
ARRSDDADRR TPRATRADSR ARDAAPSSFA GKTATAGKRA PQRADDRYGA AGKRTSPRTE
RTERTERPAR FGERPATRAS ASGERRPTAR AATGSRLKLA QPIKRGSGEL GESARGGEHG
ERGKRIERGD ETGLVRLSKR MSELGLCSRR EADEWIEKGW VLVDGERIDT LGTKVRADQR
IEIDSNARAA QAAQVTILLH KPVGYVSGQA EDGYAPAATL VTRENHWSGD RSPLRFSPQH
LRALAPAGRL DIDSTGLLVL TQNGRVAKQL IGEQSDIDKE YLVRVRFGER TADIERHFPA
ESLAKLRHGL ELDGVPLKPA MVSWQNGEQL RFVLREGKKR QIRRMCELVG LEVIGLKRVR
MGRVMLGALP QGEWRYLGPD ESF