Gene BURPS1106A_A2733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2733 
Symbol 
ID4905969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2665466 
End bp2667082 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content64% 
IMG OID640145836 
Producttryptophan halogenase PrnA 
Protein accessionYP_001076763 
Protein GI126455699 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAACC CGATCCAGAA TATCGTCATC GTGGGCGGCG GCACCGCGGG CTGGATGGCC 
GCCTCGTACC TTGTCCGGGC GCTCCAACAG CAGGCGAACA TTACGCTCAT CGAGTCCGCG
GCGATCCCCC GGATCGGCGT GGGCGAGGCG ACCATCCCGA GTCTGCAGAA GGTGTTCTTC
GACTTCCTCG GGATTCCGGA GCGGGAGTGG ATGCCCCAGG TGAACGGCGC GTTCAAGGCC
GCCATCAAGT TCGTGAACTG GAGGAAGTCT CCCGACCGCT CGCGCGACGA TTACTTCTAC
CATTTGTTCG GCAGCGTGCC GAACTGCGAC GGCGTGCCGC TTACCCACTA CTGGCTGCGC
AAGCGCGAAC AGGGCTTCCG GCAACCGATG GAGTACGCCT GCTACCCGCA GCCCGAGGCG
CTCGACGGCA AGCTGGCACC GTGCCTGCCC GACGGCACCC GCCAGATGTC CCACGCGTGG
CACTTCGACG CGCACCTGGT GGCCGACTTC CTGAAGCGCT GGGCCATCGA ACGCGGGGTG
AACCGCGTGG TCGACGAGGT CGTGCAGGTT CACCTGAACG AGCGCGGCTA CATCTCCAGC
CTGTCCACCC AGGAGGGGCG AACGCTGGAG GCGGACCTGT TCATCGACTG CTCCGGCATG
CGAGGGCTCT TGATCAACCA GGCCCTGAAG GAGCCCTTCA TTGACATGTC CGACTACCTG
CTGTGCGACA GCGCGGTCGC GAGCGCCGTG CCCAACGACG ACGCGCGCGT GGGGGTCGAG
CCGTACACCT CCGCGATCGC CATGAACTCG GGATGGACCT GGAAGATTCC GATGCTGGGC
CGGTTCGGCA GCGGCTACGT CTTCTCGAGC AAGTTCACCT CGCGCGACGA GGCCACCGCC
GACTTCCTCA ATCTCTGGGG TCTCTCGGAC AAGCAGCCGC TCAACCAGAT CAAGTTCCGG
GTCGGGCGCA ACAGGCGGGC GTGGGTCAAC AACTGCGTCG CCATCGGGCT GTCGTCGTGC
TTTTTGGAGC CGCTGGAATC GACGGGAATC TATTTCATCT ACGCGGCGCT TTACCAGCTC
GTGAAGCACT TCCCCGACAC CTCGTTCGAT CCTCGGTTGA CCGACGCGTT CAACGCCGAG
ATCGTCTACA TGTTCGACGA CTGCCGAGAT TTCGTCCAGG CGCACTATTT CACCACGTCG
CGCGAAGACA CACCGTTCTG GCGCGCGAAC CGGCACGACC TGCGGCTCTC GGACGCCATC
AAAGAGAAGG TTCAGCGCTA CAAGGCGGGG CTGCCGCTGA CCACCACGTC GTTCGACGAT
TCCACTTACT ACGAGACGTT CGACTACGAA TTCAAGAACT TCTGGTTGAA CGGAAACTAC
TACTGCATCT TTGCCGGCTT GGGGCTACTG CCCGACCGAT CGCTGCCGCT CTTGCGGCAC
CGATCGGAGT CGATCGACAA GGCCGAGACG ATGTTCGCCC GCATCCGGCG CGAGGCCGAG
CGCCTGCGAA CGAGCCTGCC GACGAACTAC GACTACCTGC GCTCGCTGCG TGAGGGCGAC
GTGGGGCTGT CTCGCAGCCG GCCCGGGCCG ACGCTCGCGG CACCGGAAAT CCTGTAG
 
Protein sequence
MSNPIQNIVI VGGGTAGWMA ASYLVRALQQ QANITLIESA AIPRIGVGEA TIPSLQKVFF 
DFLGIPEREW MPQVNGAFKA AIKFVNWRKS PDRSRDDYFY HLFGSVPNCD GVPLTHYWLR
KREQGFRQPM EYACYPQPEA LDGKLAPCLP DGTRQMSHAW HFDAHLVADF LKRWAIERGV
NRVVDEVVQV HLNERGYISS LSTQEGRTLE ADLFIDCSGM RGLLINQALK EPFIDMSDYL
LCDSAVASAV PNDDARVGVE PYTSAIAMNS GWTWKIPMLG RFGSGYVFSS KFTSRDEATA
DFLNLWGLSD KQPLNQIKFR VGRNRRAWVN NCVAIGLSSC FLEPLESTGI YFIYAALYQL
VKHFPDTSFD PRLTDAFNAE IVYMFDDCRD FVQAHYFTTS REDTPFWRAN RHDLRLSDAI
KEKVQRYKAG LPLTTTSFDD STYYETFDYE FKNFWLNGNY YCIFAGLGLL PDRSLPLLRH
RSESIDKAET MFARIRREAE RLRTSLPTNY DYLRSLREGD VGLSRSRPGP TLAAPEIL