Gene BURPS668_1072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1072 
Symbol 
ID4884951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1050687 
End bp1052666 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content71% 
IMG OID640127000 
Productbifunctional uroporphyrinogen-III synthetase/uroporphyrin-III C-methyltransferase 
Protein accessionYP_001058122 
Protein GI126440099 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1587] Uroporphyrinogen-III synthase
[COG2959] Uncharacterized enzyme of heme biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGGCG CGCCGCGCAC GTTCACCGCG GTGCTCACGC GCCCCGACGG ACAGTCGGCG 
GCGCTCGCGG CGCAGCTCGC GGCGGCGGGC ATCGACGTGC TCGACTTTCC GCTCATCGAC
ATCGCGCCGC TCGCCGACGA CGCGCCGCTC GCCGAAGCGT TCGCGCGGCT CGACGCGTAT
GCGCTCGTCG TGTTCGTGTC GCCGAACGCG GTCGATCACG CGCTCGCGCG GCTCGGCGCG
ATCTGGCCGC ATCCGCTGCC GATCGGCGTC GTCGGGCCGG GCAGCGTCGC CGCGCTCGCG
CGGCACGGCA TCGCCGCGCC CGCGCATCGC GTGATCGCGC CGAGCGCGCC CGACGACGGC
GGCGAGCCGC ACTACGATTC CGAGAGCCTG TTCGCCGAGA TCGCGCGTGC GTTCGACGGC
GAGGCGAAGC TCGCCGGCAA GCGCGTGCTG ATCGTGCGCG GCGACGGCGG CCGCGAATGG
CTTGCCGAGC GCCTGCGCGA GGCGGGCGCC GAAGTCGAGC TCGTCGCCGC GTATCGGCGC
GTGAGCCCGG AGCCTTCGAT CGGCGCGTGG GAGCGCGTGC ACGCGCTGCT GTCGGGCGCG
CCGCACGCGT GGCTCGTGAC GAGCTCGGAG GGCGTGCGCA ATCTGCAGGA GCTCGCGCAC
GAGCATTTGA ACGAAACGGA GATCGACGCG CTCAAGCACG CCCGGTTCGT CGCGCCGCAT
TCGAGGATCG TCGAGACCGC GCGCGCACTC GGTTTTGATA GGATTACGCT GACCGGCGCG
GGCGATGAGC GCATCGTCCG CGCGTTTCGT ACGTTGGCCG ACCAGGCCGA TCAACCGGCG
ACAGCCGCAC CGATGCCTTC TCGCATGACA GACACCAACG ATACCAAAGA CGTCTCTTCC
AAACCGGCCG CGGCTCCCGT TGCGCCGCCG AATCAACCGT TTACGCCGTT TGAAACGAAG
GAGCGGCGCG GCGCGGCGAG CGCGGCGCTC TGGTTCGTGG TCGTCGTGAT CGCGGCCGCG
GCGGGCGTCG GCGGCTATGC GCTGAACCGC AAGGTCGACC GCCTCGATCA GCACGCGACC
GAGCGGCAAA AGGCGCTCGA CGCGCAAACG GCCGAGCTCC GCACGAAGAC CGAACAGGCG
CTCGCGAGCG TGCGCCAGGC CGATTCGCAA CTGTCGCAGC TCGAAGGCAA GCTCGCCGAC
GCGCAGACCG CGCAGACCGC GCTGCAGCAG CAATATCAGG ATCTGTCGCG CAACCGCGAC
GCGTGGATGA TCGAGGAAGT CGGTCAGATG CTGTCGAGCG CGAGCCAGCA ACTGCTGCTC
ACGGGCAACA CGCAGCTCGC GCTGATCGCG CTGCAGAACG CCGATGCGCG GCTCGCGTCG
TCGCAGAGCG CGCAGGCGGT CGTCGTGCGC AAGGCGATCG CGCAGGATAT CGAACGGTTG
AAGGCCGCGC CTTCGGCGGA TCTCACGGGG CTTGCGATCA AGCTCGACGA CGCGATCGCG
AAGGTCGACA CGCTGCCGCT CGCGGGCGAA GTGCTCGCGC CGCACGCGCA GGCGAAGCCC
GACGCCGCCG CGAGCGCCCG GCAGGCGGCC GCGGCGGCGG GCGAGCCACG CTGGAAGGCC
TGGTGGCGCG GCTTCTCGGC GGGCATCGGC GAGCAGTTGA AGTCGCTCGT CGAGGTGCGC
CGCATCGATC ACGCGGACGC GATGCTCGCG TCGCCCGAAC AGGGCTACTT CGTGCGCGAG
AACGTGAAGC TGCGTCTGCT GAGCGCGCGG CTGTCGCTGC TCGCGCGCGA CGACGGCGCG
ATGAAGTCCG ATCTGCATGC CGCGCAGGCG GCCGTGGCGC GCTACTTCGA CGGCGCATCG
AAGGACACCC GGGTCGTTCA GGATCTGCTC AAGCAGGTCG ACGCCGCGTC GCTGACGGTC
GCGGTGCCGA ACCTCAACAC GAGCCTGAAC GCGGTTCAAC AGTTCAAGAG CCGGGGTTGA
 
Protein sequence
MAGAPRTFTA VLTRPDGQSA ALAAQLAAAG IDVLDFPLID IAPLADDAPL AEAFARLDAY 
ALVVFVSPNA VDHALARLGA IWPHPLPIGV VGPGSVAALA RHGIAAPAHR VIAPSAPDDG
GEPHYDSESL FAEIARAFDG EAKLAGKRVL IVRGDGGREW LAERLREAGA EVELVAAYRR
VSPEPSIGAW ERVHALLSGA PHAWLVTSSE GVRNLQELAH EHLNETEIDA LKHARFVAPH
SRIVETARAL GFDRITLTGA GDERIVRAFR TLADQADQPA TAAPMPSRMT DTNDTKDVSS
KPAAAPVAPP NQPFTPFETK ERRGAASAAL WFVVVVIAAA AGVGGYALNR KVDRLDQHAT
ERQKALDAQT AELRTKTEQA LASVRQADSQ LSQLEGKLAD AQTAQTALQQ QYQDLSRNRD
AWMIEEVGQM LSSASQQLLL TGNTQLALIA LQNADARLAS SQSAQAVVVR KAIAQDIERL
KAAPSADLTG LAIKLDDAIA KVDTLPLAGE VLAPHAQAKP DAAASARQAA AAAGEPRWKA
WWRGFSAGIG EQLKSLVEVR RIDHADAMLA SPEQGYFVRE NVKLRLLSAR LSLLARDDGA
MKSDLHAAQA AVARYFDGAS KDTRVVQDLL KQVDAASLTV AVPNLNTSLN AVQQFKSRG