Gene BURPS1106A_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2039 
SymbolamaB 
ID4900083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2020858 
End bp2022138 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content69% 
IMG OID640135269 
Productallantoate amidohydrolase 
Protein accessionYP_001066304 
Protein GI126451874 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCGA CGGTCTTTCC GCCCCTGAAT GCCGAGCGCC TGAATGCGCG CGTCGAGCAA 
CTCGCGCGTT TCACGCGAGC CGACGTTCCG TGGACGCGCC GCGCGTTCTC GCCGCTGTTT
ACGGAAGCGC GCGCCTGGCT CGCCGCGCAG TTCGCCGCAG CCGGGCTCGC GGTATCGATG
GACGCCGCCG GCAACCTGAT CGGCCGCCGC GAGGGCAGCG GCCGCTGCAC GAAGCCGTTG
ATTACGGGCT CGCACTGCGA CACGGTCGTC GGCGGCGGGC GCTTCGATGG GATCATCGGC
GTGCTCGCCG GCATCGAGGT CGCGCATACG CTGAACGAGC AGGGGATCGT GCTCGACCAT
CCGCTCGAAG TGATCGACTT CCTGTCCGAG GAGCCGAGCG ACTACGGCAT CTCGTGCGTC
GGCAGCCGCG CGCTGTCGGG CGTGCTCGAT GCGGGCATGC TACGCGCGAC GAACGGGGAA
GGCGAAACGC TCGCGCAGGC GCTGACGCGC ATCGGTGGCA AGCCGGAGGC GCTGAACATG
CCGTTGCGCG CGCCAGGCAG CACGGCTGCA TTCGTCGAAC TGCATATCGA GCAGGGCCCG
GTGCTGGAAG CGCGCGGCCT GCCGATCGGC GTCGTGACCA ATATCGTCGG CATCCGGCGC
GTGCTGATCA CCGTGATCGG GCAGCCCGAC CATGCGGGGA CGACGCCGAT GGACATTCGT
CGAGACGCGC TTGTCGGTGC CGCACACCTG ATTGAGGCCG CGCATGCGCG CGCGTTGTCG
CTGTCGGGAA ATCCACACTA CGTGGTCGCG ACGATCGGGC GGATCGCGAT GACGCCGAAC
GTGCCGAACG CGGTGCCGGG GCAGGTCGAG CTCATGCTGG AAGTGCGGAG CGACAGCGAC
GCGGTGCTCG ACGCGTTTCC CGAGACGCTG CTGGCCGGTG CGGCCGCGCA GCTCGACGCG
TTGCGGTTGA GCGCGCACGC GGAGCATGTG AGCCGCGCGC GGCCGACCGA CTGCCAGCCG
CTCGTAATGG ACGCGGTCGA GCAGGCGGCA GCCCAGCTCG GCTACCCGAG CATGCGTTTG
CCGAGCGGCG CGGGGCACGA TGCCGTGTAT GTCGCGCCGA CCGGGCCGAT CGGGATGATC
TTCATTCCGT GCCTGGGTGG GCGCAGCCAT TGCTCGGAGG AATGGATCGA GCCGCAGCAG
TTGCTCGACG GCACGCGCGT GCTGTACCGG ACGCTCGTCG TGCTCGATCG CACGCTGGCA
GCGCATGAAA CCGGCCGCTG A
 
Protein sequence
MNPTVFPPLN AERLNARVEQ LARFTRADVP WTRRAFSPLF TEARAWLAAQ FAAAGLAVSM 
DAAGNLIGRR EGSGRCTKPL ITGSHCDTVV GGGRFDGIIG VLAGIEVAHT LNEQGIVLDH
PLEVIDFLSE EPSDYGISCV GSRALSGVLD AGMLRATNGE GETLAQALTR IGGKPEALNM
PLRAPGSTAA FVELHIEQGP VLEARGLPIG VVTNIVGIRR VLITVIGQPD HAGTTPMDIR
RDALVGAAHL IEAAHARALS LSGNPHYVVA TIGRIAMTPN VPNAVPGQVE LMLEVRSDSD
AVLDAFPETL LAGAAAQLDA LRLSAHAEHV SRARPTDCQP LVMDAVEQAA AQLGYPSMRL
PSGAGHDAVY VAPTGPIGMI FIPCLGGRSH CSEEWIEPQQ LLDGTRVLYR TLVVLDRTLA
AHETGR