Gene BURPS668_A3042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A3042 
Symbol 
ID4886585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2891401 
End bp2892876 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content72% 
IMG OID640132978 
ProductMmgE/PrpD family protein 
Protein accessionYP_001064033 
Protein GI126442889 
COG category[R] General function prediction only 
COG ID[COG2079] Uncharacterized protein involved in propionate catabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGAC GCCATTTCCT CGCCGCCGCC GTCGCGGCGG GCCTGCCGCT CGCCGCGACG 
CTCGCGCCGC GCGGCGTGCG CGCGCAGCCG AACGCAACGC AGGCCGCCGC GCCCACGCTC
GCGCGCCAAC TCGCCGAATA CGCGGCGGGG CTGCGCTACG AGGATCTCGA CTCGGCGACG
ATCGACATCG TCAAATCGCA TCTGATCGAC GCGCTCGGTT GCGCGCTCGC CGCGCTCGAC
GAGCCGCCCG TGCGCATCGC GCGCGACGCG GCGCGCAGCG CCGGCGGCGG CGGACCGTCG
ACGATCATCG GAACGGCCGA GCGAACGAGC CCCGATCTCG CGACGTTCGC CACGGGCACC
GCGCTGCGCT ACTTCGATTT CAACGACGCA TACGCGGGCC GCGAAATCGG CCATCCGAGC
GACAACATCG CCGCGTGCGT CGCGGTGGCC GAGGCGCAGC ATGCGAGCGG CCGCGAGCTG
ATCCTGTCGA TCGCGCTCGC CTACGAGATC GCATGCCGGC TGATGGACGC GGCCGCGATC
AGCCCGCGCG GCTGGGACCA CACGTGCTAC TCGCTGCCCG CCGCCGCGCT CGCGGCGGGC
AAGCTGATGC GCATGCCGGT CGAAGCGCTC ACGCAGGCCG TGAACCTGTC GCTCAACAGC
CATCTCGCGC TGAACCAGAC GCGCGTGCAG CAGCTATCGA ACTGGAAGGC GCTCGCCGAC
GCGGACGCCG CGCGCAACGC GGTGTTCTCG ACGCAGCTCG CCCGTGCGGG GCTCACCGGG
CCGTCGCCGA TCTTCGAGGG CGAGGCGGGC TTCTTCCGCC AGGTGTCGGG GCCGTTCGAG
CTCGAGACGA GCCGTTTCGG CGGGCGCGGC GAGCCGTTCA GGATCGCGCG CTGCTTCGTC
AAGTACTACC CGGCGCAAGG CTTCACGCAG ACCGCGATTC CGGCCGCGCT CGACGTCGCG
TCGCAAGCGG GCGACCTGAG CCGCATCCGC CGCATCGATG TGCACACGAC GCGCGTCGGC
TACGTGACGG CCGGCAGCGA GCCGGAGAAA TGGAAGCCGT CGACGCGCGA AACGGCCGAC
CACAGCCTGC CGTACGTCGT CGCGCGCGCG ATGCTCGACG GCGACATCCG GACCACGAGC
TTCTCGGACG CCGCGCTGCG CGATCCGGCG CTGCACGCGC TGATCGCGAA GATTCGCGTC
GAAGAGGATC CGGCGCTGAC GGCGGGCTAC CCCGCGCGCG CGGCAAATCG CGTGACCGCG
CATTGCCGCG ACGGCGCGGT GTATGCGAAG CAGGTCGACG ATCTGCCCGG CTCGCCGACG
CGGCCGATGC GCCGTGAGGA TTTCGAAGCG AAATTCGTGA AGAACGGCGG CGCGCGCCTG
AGCGAGCAGC GCATGCGCGC GGCGCTCGAC AGGTTGTGGC GGCTCGACGA GCTGCAGGAT
GTCGCCGCGC TGCCGCCGCT GTTCGTCGCG GGCTGA
 
Protein sequence
MNRRHFLAAA VAAGLPLAAT LAPRGVRAQP NATQAAAPTL ARQLAEYAAG LRYEDLDSAT 
IDIVKSHLID ALGCALAALD EPPVRIARDA ARSAGGGGPS TIIGTAERTS PDLATFATGT
ALRYFDFNDA YAGREIGHPS DNIAACVAVA EAQHASGREL ILSIALAYEI ACRLMDAAAI
SPRGWDHTCY SLPAAALAAG KLMRMPVEAL TQAVNLSLNS HLALNQTRVQ QLSNWKALAD
ADAARNAVFS TQLARAGLTG PSPIFEGEAG FFRQVSGPFE LETSRFGGRG EPFRIARCFV
KYYPAQGFTQ TAIPAALDVA SQAGDLSRIR RIDVHTTRVG YVTAGSEPEK WKPSTRETAD
HSLPYVVARA MLDGDIRTTS FSDAALRDPA LHALIAKIRV EEDPALTAGY PARAANRVTA
HCRDGAVYAK QVDDLPGSPT RPMRREDFEA KFVKNGGARL SEQRMRAALD RLWRLDELQD
VAALPPLFVA G