Gene BURPS668_A3028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A3028 
Symbol 
ID4885707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2876547 
End bp2878058 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content74% 
IMG OID640132964 
ProductMmgE/PrpD family protein 
Protein accessionYP_001064019 
Protein GI126443089 
COG category[R] General function prediction only 
COG ID[COG2079] Uncharacterized protein involved in propionate catabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAG CCGATCTGAC CATCGTACGG GCCGCCGCGC GCCGCGCCGC GCCCGTCGCC 
ACGCCGCCCG CGGACGGCAT CGTCGGCGCG CTCGGACGCT TCGCGGCCGC GGTGCGCACC
GACGGGCTCG AACGGCAACT GCGCGTGGAG GCCGCCGCGC GCGTGCTCGA CCTGCTCGGC
AATAGCCTGA TCGCGCATCG CGAGTCCGTC GCACATGCAG TGCTGCGCGT CGCGCGCGGC
TGGGCGGCGC GAGGCCCGGC GGGCGTCGTC GGCGCGCGCG ACCGGTTGCC CGCCGCGCTC
GCCGCGCTCG TGAACGGTAC GCTCGCGCAC GCGATGGACT TCGACGATTC GCACATGCTG
TCGGTGCTGC ATCCGAGCGC GTCGGTGATT CCCGCGGCGC TCGCCGTGGC CGAGGCGACG
AATGCGTCGG GCGCCGCGCT GCTCGATGCG ATCACGGTCG GCACCGAGAT CTGCATCCGG
CTCGGCGTCG CCGCATACAG CGAGCGGCTC GGCAACTCGG TGTTCTTCGA TCGCGGCCAG
CACGCGACGT CGATCTGCGG CACGCTCGGC GCGGCGGCGG CGGCCGCGAT GCTGTACGGG
CTCGACGCGG CGGGGATCGC GTCGGCGCTC GGCATCGCGG CGAGCATGGG CGCGGGCCTG
CTCGAGGCGA ACCGCACGGG CGGCTCGGTC AAGCGCGTGC ACTGCGGCTG GGCCGCGCAC
GCGGGCGTGA GCGCGGCGGA ATTCGCGGCG GCGGGCGTCA CCGCGCCGCC GACCGCGCTC
GAAGGCCGGT TCGGCTTCTT CCATGCGTGG TGCGGCGATC TCGCCGATCC GAACGCGGTG
CTGAGCCATC TCGGCGACGA ATGGGAGACG AGCCAGATCA TCTTCAAGCC GTATCCGTGC
AACCATTTCA CGCACCCGGG CATCGACGCC GCGCTGCAAC TGAAGGCGCA GGGCCTGCAT
GCGGATGAGG TGGCGTCGAT CGAGCTGAGC GTCGCGAGCC CGACGCTGCG CACGATCGGT
GAGCCCGCCG AAATCAAGAT GCGTCCGCCG AACGGCTATG CGGCCGCCTT TTCGGGGCCG
TACACGGTCG CGGCCGCGCT GCTCGGCGGC GGCGGGCTCG GCGTGTGGTT CGACGATTTC
GACGATGCGC ATGTGCACGA TCCCGCGCGG CGCGCGCTCG CCGCGAAGGT GCGCTGCGTC
GCCGAGCCGT GGTGCGACGC GCGCTTTCCG GCGGGGCTGC CGGCGGTGAT GCGCGTGACG
ACCGTCGGCG GGCACGCGCT CGAGGCGCGC ATCGAATCGA GCAAGGGCAC CAACGCGCGG
CCGCTGACCG AGCAGGAGCT GGCGGCGAAG TTCATGCTGG CCGCGGGCGC GACGCTCGGC
ATGCCGGCGG CGCTCGCGCT GCGCGATGCG GTGTCGGCGC TCGTCGCGGA CGGGCCGCTC
GCGCCGCTCA TGGAACTGAC GTCCGGCACG GCCGGCGCGT CGCCGAACGG CACGGGAGGC
TCGCTTGACT GA
 
Protein sequence
MSEADLTIVR AAARRAAPVA TPPADGIVGA LGRFAAAVRT DGLERQLRVE AAARVLDLLG 
NSLIAHRESV AHAVLRVARG WAARGPAGVV GARDRLPAAL AALVNGTLAH AMDFDDSHML
SVLHPSASVI PAALAVAEAT NASGAALLDA ITVGTEICIR LGVAAYSERL GNSVFFDRGQ
HATSICGTLG AAAAAAMLYG LDAAGIASAL GIAASMGAGL LEANRTGGSV KRVHCGWAAH
AGVSAAEFAA AGVTAPPTAL EGRFGFFHAW CGDLADPNAV LSHLGDEWET SQIIFKPYPC
NHFTHPGIDA ALQLKAQGLH ADEVASIELS VASPTLRTIG EPAEIKMRPP NGYAAAFSGP
YTVAAALLGG GGLGVWFDDF DDAHVHDPAR RALAAKVRCV AEPWCDARFP AGLPAVMRVT
TVGGHALEAR IESSKGTNAR PLTEQELAAK FMLAAGATLG MPAALALRDA VSALVADGPL
APLMELTSGT AGASPNGTGG SLD