Gene BURPS668_1428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1428 
Symbol 
ID4883663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1395775 
End bp1397106 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content73% 
IMG OID640127356 
Productputative carotenoid 9,10-9',10' cleavage dioxygenase 
Protein accessionYP_001058471 
Protein GI126439831 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACCA TCGACCTGAA CGCGGGCGCG CTGGCGGCCG TGGCCGGCGA AATCGACGCC 
GTCGACCTGC GCGTGACGGG CGCGCTTGCG CGCGAGCTGA ACGGCGTGCT CGTGCGCAAC
GGGCCCAATC CGCTGCGCGG CCGCTTCGAC GGCGGCGACG TGCTGTCGTG GTGGCCGCAG
GACGCGATGC TGCACGCGAT ATCGTTCGAC GACGGCCGCG CGACCCGCTA CCGCAACCGC
TGGGCGCGCA CGCGGCGCTG GGCGCGCGTG CACGATCCGG CGCGCGAGCC GTCGCTCGTC
GACACGAATC CGAACGTCAA CGTGCTCGCT CACGCGGGCG AGATTCTCGC GCTCGCCGAG
GGCGGCGCGC CGCTCGCGAT CACGGCCGGG CTCGACAGCA TCGGCGCGGC GCGCCGCCAC
CCGGGGCTCG CGCACGGGAT GGCCGCGCAT CCGAAGGTCG ATCCGCACAC GGGCGAGCTG
ATCGCGTTTC GCGCCGACTG GAACCGGCCG TGGCTGCGCT ACCTCGTCGC GGACGCGGCC
GGCGCGCAGA CCGTCGACAC GGAGATCGCG CTGCCCGCGC CGTCGATGAT GCACGACATC
GCGATCACCG CGACGCACAG CATCGTGTTC GACCTGAACG TCGCGTATGA CTTCTCGATG
CTGTCGCGCG GCCATCGGAT GCCGCTGCGC TGGCACGACG CGCGCGGCGC GCGCATCGGC
GTCGTGCCGC GCCGCGGCGG CGATGCGCGC TGGTTCGACA TCGCGCCGTG CTTCATTCAG
CACGTCGTCA ACGCATACGA TCTCGACCGC CCGGCGATCG TGCTGGACGT GGTCCGCTAT
CCGTGGTTCC TGCGCGTCGC CCGCGACGGG CGCGGCTTCG ACGACAACCC GCCCGGCGTG
CTGTGGCGCT ACGTGATCGA TCTCGTGACG GGCACCGTCG CCGAGCAGCC GCTCGACGAC
GCCGGCATCG AGCTGCCCCG GATCGACGCG CGCCGCACCG GGCGCCGCCA CCGGTTCCTG
TACGCGGCCG AGCAGCCGAC CCCCGTGGAG CTGCGCGGCA TCGTGCGCTA CGTCCTCGAC
GGCGGCTCGA CGCAGCGCTA CCGGGTGCCG CCCGGCGACC AGAACAGCGA GCCCGTGTTC
GTCCCGCGTC CGGGCGCGGC GGGCGAAGAC GACGGCTGGC TGCTCGTCTG CGTGTATCGC
CATGCAACGG ATACGAGCGA CGTCGTGATC CTCGACGGCC GGTCGATCGG CGACGGGCCG
ATCGCGACCG TGCACCTGCC GCGCCGCGTC CCGGCGGGTT TTCATGGCGC GTGGCTGCCG
GCCGGCGCAT GA
 
Protein sequence
MTTIDLNAGA LAAVAGEIDA VDLRVTGALA RELNGVLVRN GPNPLRGRFD GGDVLSWWPQ 
DAMLHAISFD DGRATRYRNR WARTRRWARV HDPAREPSLV DTNPNVNVLA HAGEILALAE
GGAPLAITAG LDSIGAARRH PGLAHGMAAH PKVDPHTGEL IAFRADWNRP WLRYLVADAA
GAQTVDTEIA LPAPSMMHDI AITATHSIVF DLNVAYDFSM LSRGHRMPLR WHDARGARIG
VVPRRGGDAR WFDIAPCFIQ HVVNAYDLDR PAIVLDVVRY PWFLRVARDG RGFDDNPPGV
LWRYVIDLVT GTVAEQPLDD AGIELPRIDA RRTGRRHRFL YAAEQPTPVE LRGIVRYVLD
GGSTQRYRVP PGDQNSEPVF VPRPGAAGED DGWLLVCVYR HATDTSDVVI LDGRSIGDGP
IATVHLPRRV PAGFHGAWLP AGA