Gene BURPS668_A2885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2885 
Symbol 
ID4888107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2738772 
End bp2740388 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content64% 
IMG OID640132821 
Producttryptophan halogenase PrnA 
Protein accessionYP_001063877 
Protein GI126444889 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAACC CGATCCAGAA TATCGTCATC GTGGGCGGCG GCACCGCGGG CTGGATGGCC 
GCCTCGTACC TTGTCCGGGC GCTCCAACAG CAGGCGAACA TTACGCTCAT CGAGTCCGCG
GCGATCCCCC GGATCGGCGT GGGCGAGGCG ACCATCCCGA GTTTGCAGAA GGTGTTCTTC
GACTTCCTCG GGATTCCGGA GCGGGAGTGG ATGCCCCAGG TGAACGGCGC GTTCAAGGCC
GCCATCAAGT TCGTGAACTG GAGGAAGTCT CCCGACCGCT CGCGCGACGA TTACTTCTAC
CATTTGTTCG GCAGCGTGCC GAACTGCGAC GGCGTGCCGC TTACCCACTA CTGGCTGCGC
AAGCGCGAAC AGGGCTTCCG GCAACCGATG GAGTACGCCT GCTACCCGCA GCCCGAGGCG
CTCGACGGCA AGCTTGCACC GTGCCTGCCC GACGGCACCC GCCAGATGTC CCACGCGTGG
CACTTCGACG CGCACCTGGT GGCCGACTTC CTGAAGCGCT GGGCCATCGA ACGCGGGGTG
AACCGCGTGG TCGACGAGGT CGTGCAGGTT CACCTGAACG AGCGCGGCCA CATCTCCAGC
CTGTCCACCC AGGAGGGGCG AACGCTGGAG GCGGACCTGT TCATCGACTG CTCCGGCATG
CGAGGGCTCT TGATCAACCA GGCCCTGAAG GAGCCCTTCA TTGACATGTC CGACTACCTG
CTGTGCGACA GCGCGGTCGC GAGCGCCGTG CCCAACGACG ACGCGCGCGT GGGGGTCGAG
CCGTACACCT CCGCGATCGC CATGAACTCG GGATGGACCT GGAAGATTCC GATGCTGGGC
CGGTTCGGCA GCGGCTACGT CTTCTCGAGC AAGTTCACCT CGCGCGACGA GGCCACCGCC
GACTTCCTCA ATCTCTGGGG TCTCTCGGAC AAGCAGCCGC TCAACCAGAT CAGGTTCCGG
GTCGGGCGCA ACAGGCGGGC GTGGGTCAAC AACTGCGTCG CCATCGGGCT GTCGTCGTGC
TTTTTGGAGC CGCTGGAATC GACGGGAATC TATTTCATCT ACGCGGCGCT TTACCAGCTC
GTGAAGCACT TCCCCGACAC CTCGTTCGAT CCTCGGTTGA CCGACGCGTT CAACGCCGAG
ATCGTCTACA TGTTCGACGA CTGCCGAGAT TTCGTCCAGG CGCACTATTT CACCACGTCG
CGCGAAGACA CACCGTTCTG GCGCGCGAAC CGGCACGACC TGCGGCTCTC GGACGCCATC
AAAGAGAAGG TTCAGCGCTA CAAGGCGGGG CTGCCGCTGA CCACCACGTC GTTCGACGAT
TCCACGTACT ACGAGACGTT CGACTACGAA TTCAAGAACT TCTGGTTGAA CGGAAACTAC
TACTGCATCT TTGCCGGCTT GGGGCTACTG CCCGACCGAT CGCTGCCGCT CTTGCGGCAC
CGATCGGAGT CGATCGATAA GGCCGAGACG ATGTTCGCCC GCATCCGGCG CGAGGCAGAG
CGCCTGCGAA CGAGCCTGCC GACGAACTAC GACTACCTGC GCTCGCTGCG TGAGGGCGAC
GTGGGGCTGT CTCGCAGCCG GCCCGGGCCG ACGCTCGCGG CACCGGAAAT CCTGTAG
 
Protein sequence
MSNPIQNIVI VGGGTAGWMA ASYLVRALQQ QANITLIESA AIPRIGVGEA TIPSLQKVFF 
DFLGIPEREW MPQVNGAFKA AIKFVNWRKS PDRSRDDYFY HLFGSVPNCD GVPLTHYWLR
KREQGFRQPM EYACYPQPEA LDGKLAPCLP DGTRQMSHAW HFDAHLVADF LKRWAIERGV
NRVVDEVVQV HLNERGHISS LSTQEGRTLE ADLFIDCSGM RGLLINQALK EPFIDMSDYL
LCDSAVASAV PNDDARVGVE PYTSAIAMNS GWTWKIPMLG RFGSGYVFSS KFTSRDEATA
DFLNLWGLSD KQPLNQIRFR VGRNRRAWVN NCVAIGLSSC FLEPLESTGI YFIYAALYQL
VKHFPDTSFD PRLTDAFNAE IVYMFDDCRD FVQAHYFTTS REDTPFWRAN RHDLRLSDAI
KEKVQRYKAG LPLTTTSFDD STYYETFDYE FKNFWLNGNY YCIFAGLGLL PDRSLPLLRH
RSESIDKAET MFARIRREAE RLRTSLPTNY DYLRSLREGD VGLSRSRPGP TLAAPEIL