Gene Bcep18194_A6385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A6385 
Symbol 
ID3751618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp3552115 
End bp3553182 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content73% 
IMG OID637764706 
Productallophanate hydrolase subunit 2 
Protein accessionYP_370623 
Protein GI78067854 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGA GCAACGTACC GGCCACCATC GAGGTGGTGC GGGCCGGCCC GCTGACCACC 
GTGCAGGATC TCGGGCGCCG CGGCACGCGC CATCTCGGCG TCGCGCAGGG TGGCGCGCTC
GACGGCCTCG CGCTCGAAGT CGGCAACCGG CTGGTCGGCA ACCGCCCCGA TGCGGCGGCC
GTCGAGATCA CGATCGGCCC GGCCGCTTTC CGCTTCCCGC GCGCGACCCG CATCGCGATC
ACCGGCACCG AGTTCGGTGC AACGCTCGAC GGCCAGCGCG TATATTCATG GTGGAGCCTG
CCGGTCGAGG CCGGGCAAAC GCTCGTGCTG CCCGCCGCGA AACGCGGGAT GCGCGGCTAC
CTGTGCATCG CCGGCGGCAT CGACGTGCTG CCGATGCTCG GCTCGCGCAG CACCGATCTC
GCATCGCGCT TCGGGGGCCT CGGCGGCCGC GCGCTGCGCG ACGGCGATCG CCTCCCGGTC
GGCGTGCTTC CGGCCGGGAT GGGTTGCCTC GCGGCCGACG CGCCCGAATT CGGTGTCAAG
GCGCCCGCGT GGTGCGCATT CGTGCGCGTC GACGAGCCGC CGCGCCGTCA CCGCCCCGCG
CACGCGCCGT GGGCGATGCC CGTGCGCGTG CTGCCGGGCC CCGACTACGC GTCGTTCGCC
GCCGATTCGC AGCAAGCGTT CTGGGATGAA GAATGGCTGG TCACGGCAAA CAGCAACCGC
ATGGGCTACC GGCTCGCCGG CGTCGAGCTC GCGCGCGAGC GGCCGGCCGA ACTGCTGTCG
CACGCGGTGC TGCCCGGCAC GATCCAGGTG CCGCCGAACG GCCAGCCGAT CGTGCTGATG
CACGACGCGC AGACCACCGG CGGCTACCCG AAGATCGGCA CGGTGATCCG CGCGGATCTG
TGGAAACTCG CGCAGGCGCG GCTCAACCTG CCGATCCGCT TCGTGCGCAC GACGCCCGAT
GCCGCGCGCG CCGCGCTGGC CGCGGAACGT GCGTATCTGC GGCAGATCGA CGTGGCGATC
GACATGCGCG AGGAAGCGCG CCGCCGCGCC CAGTCGCGTG CGGCATGA
 
Protein sequence
MTQSNVPATI EVVRAGPLTT VQDLGRRGTR HLGVAQGGAL DGLALEVGNR LVGNRPDAAA 
VEITIGPAAF RFPRATRIAI TGTEFGATLD GQRVYSWWSL PVEAGQTLVL PAAKRGMRGY
LCIAGGIDVL PMLGSRSTDL ASRFGGLGGR ALRDGDRLPV GVLPAGMGCL AADAPEFGVK
APAWCAFVRV DEPPRRHRPA HAPWAMPVRV LPGPDYASFA ADSQQAFWDE EWLVTANSNR
MGYRLAGVEL ARERPAELLS HAVLPGTIQV PPNGQPIVLM HDAQTTGGYP KIGTVIRADL
WKLAQARLNL PIRFVRTTPD AARAALAAER AYLRQIDVAI DMREEARRRA QSRAA