Gene BURPS1106A_A0378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0378 
Symbol 
ID4904024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp359807 
End bp360994 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content71% 
IMG OID640143485 
Productamidohydrolase family protein 
Protein accessionYP_001074421 
Protein GI126455778 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACG CACGTTTTAC CGAGGTCGAC GACCTCGCCC CGCTCGCCGA AGCACTGCGC 
GAGATCCGCC ATCGCATCCA CCGCCATCCG GAACTCGCGT ACGAGGAGGT CGAGACGGCC
GCGCTCGTCG CGGACAAGCT CGAAGCCTGG GGCTGGCGGG TGACGCGCGG CGTGGGCGGC
ACGGGCGTGG TCGGCACGCT GCGCGTGGGC GACGGCGCGC GCAGCGTCGG CGTGCGCGCG
GACATGGACG CGCTGCCGAT CGCCGAGGCG ACCGGGCTGC CTTATGCGAG CGCGGTGCCC
GGCAAGATGC ACGCGTGCGG CCACGACGGC CACACTGCGA TGCTGCTCGG CGCCGCATGG
CGGCTCGCGC AGGCGCGCCA CTTCTCCGGC ACCGTTCATC TGTATTTTCA GCCGGCCGAG
GAGCACGGCG TCGACAGCGG CGCGAAGCGC ATGATCGACG ACGGCCTTTT CGAGCGCTTT
CCGTGCGACG CGGTGTTCGG GATGCACAAC CATCCGGGCG TCGAGCCGGG CGTGTTCCTC
ACGCGGCGGG GGGCGTTCAT GTCGGCGGGC GACAAGGCGG TGATCGACAT CCACGGCGTG
GGCGGCCATG CGGCGCGGCC GCATCTGGCG GTCGATCCGG TCGTCGTCGC GGCGAGCGTC
GTGATGGCGC TGCAGACGAT CGTCGCGCGC AACGTCGATC CCGCGCAGCC CGCCGTCGTG
ACGGTCGGCT CGCTGCACGC CGGCACCGCG AACAACGTCA TTCCGAGCCG CGCGCGGCTC
GAGCTCTCCG TGCGCTCGTT CGATCCCGAG GTGCGCGCGC TGCTCAGGCG CCGGATCACC
GAGCTCGCCC AGGCGCAGGC GGCCAGCTAC GGCGCGAGCG CGAACGTCGA GTACATCGAG
GGCTACCCGG TCGTCGTCAA TTCGGACGCC GAAACCGACT TCGCCGCGCA GGTCGCGAAG
GAGCTGGTGG GCGAGCGCAA CGTCGTCGAG CAGGCCGACA TCCTGATGGG CAGCGAGGAT
TTCGCGTTCA TGCTGCAGCG GCGGCCGGGC TCGTTCGTGC GGCTCGGCAA CGGCGCGGGC
GAGGAAGGCT GCATGGTGCA CAACCCGAAA TACGACTTCA ACGATCGCAA CCTCGTGACG
GGCGCGGCGT TCTGGGCGCG GCTCGTCGAG CGGTATCTGG CGCGGTAG
 
Protein sequence
MNDARFTEVD DLAPLAEALR EIRHRIHRHP ELAYEEVETA ALVADKLEAW GWRVTRGVGG 
TGVVGTLRVG DGARSVGVRA DMDALPIAEA TGLPYASAVP GKMHACGHDG HTAMLLGAAW
RLAQARHFSG TVHLYFQPAE EHGVDSGAKR MIDDGLFERF PCDAVFGMHN HPGVEPGVFL
TRRGAFMSAG DKAVIDIHGV GGHAARPHLA VDPVVVAASV VMALQTIVAR NVDPAQPAVV
TVGSLHAGTA NNVIPSRARL ELSVRSFDPE VRALLRRRIT ELAQAQAASY GASANVEYIE
GYPVVVNSDA ETDFAAQVAK ELVGERNVVE QADILMGSED FAFMLQRRPG SFVRLGNGAG
EEGCMVHNPK YDFNDRNLVT GAAFWARLVE RYLAR