Gene BURPS1106A_A0029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0029 
Symbol 
ID4906276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp21879 
End bp23285 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content65% 
IMG OID640143136 
Productcytochrome P450 family protein 
Protein accessionYP_001074072 
Protein GI126458038 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGGCC TCCCTCTGTC GCTTCCGCGA GTCGACAGCA CCGATTCGCT CTTTGCGGAT 
CCGCTCGCTT TTCTCGCACG AGCGCGCGCA CAATGCGGCG ACGTCTTCGT GCTGCGGGAG
CACGGTCCGA TTTTTTCTCG CGCGAGCGAT TGCAGCGGCG TGATCGCCGT GTTCGGCGAA
CATCGGCTTC GACAAGTCTT GACCGAGACG GATACCTTCG CGCTGCCGAT GTCAGCGGCG
GCGAAGATGG CGTTGCCGAA GAATCTGGTC AACCTCAACC GCGGCTTGCA CAGCATGCGC
GAGCCGGAGC ACGGCCGGCA CAAGCGCCTC CTGACGGGAA CGGCCAACCG CGCGCTGTTC
GACGCGCATC GATCCGGAAT GCAAACGGAT TTGAGCCGCT TTTGCGAAAT GCTGAACGGG
GACGGCCGGA TTTCCGTCGT GAGCCGAATG CGCGAGCTGA CAGCCAAGTT GGCGTCCCGT
CTTTTTCTTG GGCCGCAGTG TGAGGAGGAT GCCGAACTGA CGTTTCTGCT AAGCGCGTAT
TTCACGTTGC GGCGCGAGGC GTCTTCGCCC AACGCGCACG ATCCGTTGCA GTACCGGGAC
GCGCTGATCG GCGCCGGGCG GCAACTCGAT CGCGCGTTGC GGGCGCGGAT CCGGCGGTAT
CGAAAGGCGC CCGCCGGCGA GTGCGCGGGG CTGCTCCAGC GGCTGGCGAC GGCCGGCCAG
CCGGGTTCGC CCGCGCTTTC CGAGGATGAG ATCGTCGGCC ACGCCAACGT GATGTTCGTG
TCGAGCACGG AGCCGGTGGC GATATCGCTG GCGTGGCTGC TGCTCGTTCT GTCGCAGTTG
CCCGATCTGC GGCGTGCGCT TCGCGCGGAA AGCGCCGCCC GCGCATCGTC GCCGGCTTCG
CCGTACGACG CGCCATTGCT CGAGCGCGTC ATCAACGAGA CGCTTCGGCT CCTGACGCCC
AACGCGCTGA TGGTCCGCGC CACGACGCAG GCGGTTTCGT TGCAAGGCGT CGCGCTTCCC
GCGCGCTGCG AGATTGTCGT GTGCCCGTTC CTCGTGCACC GCGAGGCCAA CGCGTTTGCG
CGCCCGCATG CATTCTTGCC GTCCCGATGG GAGACGGCGA GGCCGTCGCC GTACGAGTAT
TTTCCATTCG GCGCCGGCGG CCATTTCTGC GCGGGGCGGA ATCTCGCGCT GTCGCTGATT
GGCGAAGTGC TGTCGACGCT GCTATCGCGA TTCGATTTCG TTTTGGATAT CGAACAGTTT
ATCGACTGGC GCATTCATAT CATGCTGATG CCGAAAGGCG ACCCGACGCT CACGGCGCAC
CCCGTCGACG AACGCCGCGA CACGCCGTCG CCGAAATGGC GCGGGCCTGT CGCCGAGCTG
TTCCATTTCG CGCCGGGGCT TTCTTGA
 
Protein sequence
MNGLPLSLPR VDSTDSLFAD PLAFLARARA QCGDVFVLRE HGPIFSRASD CSGVIAVFGE 
HRLRQVLTET DTFALPMSAA AKMALPKNLV NLNRGLHSMR EPEHGRHKRL LTGTANRALF
DAHRSGMQTD LSRFCEMLNG DGRISVVSRM RELTAKLASR LFLGPQCEED AELTFLLSAY
FTLRREASSP NAHDPLQYRD ALIGAGRQLD RALRARIRRY RKAPAGECAG LLQRLATAGQ
PGSPALSEDE IVGHANVMFV SSTEPVAISL AWLLLVLSQL PDLRRALRAE SAARASSPAS
PYDAPLLERV INETLRLLTP NALMVRATTQ AVSLQGVALP ARCEIVVCPF LVHREANAFA
RPHAFLPSRW ETARPSPYEY FPFGAGGHFC AGRNLALSLI GEVLSTLLSR FDFVLDIEQF
IDWRIHIMLM PKGDPTLTAH PVDERRDTPS PKWRGPVAEL FHFAPGLS