Gene BTH_I0229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_I0229 
Symbol 
ID3846923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007651 
Strand
Start bp262965 
End bp264044 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content74% 
IMG OID637839902 
Producturea amidolyase-related protein 
Protein accessionYP_440787 
Protein GI83719147 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTCAC GCCAAGCAAC AGGCGGCATC GAGGTGCTGC GCGCCGGGCC GCTGTCGACG 
GTCCAGGATC TCGGACGCCG CGGCTATCGC CATTTGGGCG TCGCGCAAAG CGGCGCGCTC
GACTCGCTCG CGCTCGAAGT CGGCAATCGC CTCGTCGGCA ACCGGCCGGA CGCCGCCGCA
ATCGAAATCA CGCTCGGCCC CGCGTCGTTC CGGTTTCCGC GCGCGACCCG CATCGCGATC
ACCGGCACCG AGTTCGGCGC GACGCTCGAC GGCGCGCCGA TCTACTCGTG GTGGAGCGTG
CCCGTCGCGG CCGGCCAGAC GCTCGCGCTG CCCGTCGCGA AGCGCGGAAT GCGCGGCTAC
CTGTGCGTCG CGGGCGGCAT CGACGTGCTG CCGATGCTCG GCTCGCGCAG CACCGATCTC
GCGTCGCGCT TCGGCGGCCT CGGCGGCCGC GTGCTGCGCG ACGGCGACAG GCTGCCCGTC
GGCGCGCCGC CCGCCGCCGC GCCCGCGTGC GTCGGGCCCG ACGCGCCCGA GTTCGGCGTG
AAGGCTCCCG CGTGGTGCGC GTTCGCGCGC GTCGACAAGG AGCCGCGCCG CCCCAAGCAT
GCGCACGCCG CATGGGCGAT GCCCGTGCGC GTGCTGCCCG GCCCGCAATA CGCGAGCTTC
ACGCCGGCAT CGCAGCAGAC TTTCTGGGAT GAGGAATGGG TCGTCACGCC GAACAGCAAC
CGGATGGGCT ACCGGCTCGC GGGCGCGAAG CTCGAGCGCG CCGAGACGGG CGACCTGCTG
TCGCACGCGG TGCTGCCGGG CACGATCCAG GTGCCGGGCA ACGGCCAGCC GATCGTGCTG
ATGAACGACG CGCAAACGAC GGGCGGCTAT CCGCGGATCG GCGCCGTGAT CCGCGCGGAC
CTCTGGAAGC TCGCGCAGGC GCGCCTGAAC CTGCCGGTGC GCTTCGTCCG CGTGACGGCG
GCGGCCGCGC GCGACGCGCT CGCCGCCGAG CGCGCGTACC TGCGGCAGAT CGACATCGCG
ATCGAGATGC GCGAGGAGGC GCTGCAGCGC GCGCTCGCCG CGCGCGCAGC GGCCGCATGA
 
Protein sequence
MTSRQATGGI EVLRAGPLST VQDLGRRGYR HLGVAQSGAL DSLALEVGNR LVGNRPDAAA 
IEITLGPASF RFPRATRIAI TGTEFGATLD GAPIYSWWSV PVAAGQTLAL PVAKRGMRGY
LCVAGGIDVL PMLGSRSTDL ASRFGGLGGR VLRDGDRLPV GAPPAAAPAC VGPDAPEFGV
KAPAWCAFAR VDKEPRRPKH AHAAWAMPVR VLPGPQYASF TPASQQTFWD EEWVVTPNSN
RMGYRLAGAK LERAETGDLL SHAVLPGTIQ VPGNGQPIVL MNDAQTTGGY PRIGAVIRAD
LWKLAQARLN LPVRFVRVTA AAARDALAAE RAYLRQIDIA IEMREEALQR ALAARAAAA