Gene Bpro_1043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_1043 
Symbol 
ID4012261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp1069620 
End bp1071164 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content70% 
IMG OID637940721 
Producthistidine ammonia-lyase 
Protein accessionYP_547894 
Protein GI91786942 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.693412 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.100833 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACAT CCACCCACAT CCTGCACCCC GGCCGCGTCA GCCTGGCCCT GTTGCGCAGC 
ATCCATGCCG GCGGCGTGCA GCTGGCCCTG GCGCCCGAGG CCCGTGCCGG CCTGCTGGCA
GCCCAGGCCA CCGTGCAGCG CATCGTCGAT GAAGACCAGG TCGTCTACGG CATCAACACC
GGCTTCGGCA AACTGGCCAG CACCAAGATT GCGCCCAACC GCCTGGTCGA ATTACAGCGC
AACCTGGTGC TGTCGCACAG CGTGGGCACC GGCGAGCCGC TACCGGCCGC CGTGGTGCGC
GTGATCCTCG CTACCAAGGC GGTGAGCCTG GCGCGCGGCC ACTCCGGCGT GCGGCCCGAA
CTGGTGGACG CCTTGCTGGC GCTGGCCAAC GCGGGCGTGA TGCCGCGCAT CCCGGCCAAG
GGCTCCGTGG GTGCCTCCGG CGACCTGGCG CCTCTGGCGC ACCTGGCCTG CGTGCTGATC
GGCGAAGGCC AGGCCACCAC CGCCGACGGC GAAGTCATTT CCGGCGCCCA GGCCATGCGC
CGCATCGGCG TGGCGCCGTT TGTGCTCGGC CCCAAGGAAG GCCTGGCGCT GCTCAACGGC
ACCCAGGTGT CTACCGCCCT GGCGCTGGCC GGCCTGTTCG GCGCCGAACA GGTATTTGCC
GCGGGCCTGG TGTCCGGCGC CTTGTCGCTG GAAGCCGTGC AGGGCTCCAT CAAGCCGTTT
GACGCGCGGG TGCATGCCGC GCGTGGCCAG CCCGGCCAGA TGGCCGTGGC GGCGGCGGTG
CGCACCCTGC TCGAAGGCAG CGAGATCGTG CCGTCGCACC CCGACTGCGG CCGCGTGCAG
GACCCGTATT CGATCCGCTG CGTGCCCCAA GTCATGGGCG CCTGCCTGGA CAACCTGCAG
CACGCCGCCC GGGTGCTGCA GATCGAGGCC AATGCCGCCT CCGACAACCC GCTGGTCTTC
GACAACGGCG ACGTGATCTC CGGCGGCAAC TTCCACGCCG AGCCGGTGGC CTTTGCGGCC
GACATCATCG CGCTGGCCGT GGCCGAGATC GGCGCGATTG CCGAGCGCCG CCTGGCACTG
CTGCTCGACA CCGGCCTCTC CGGCCTGCCG CCTTTCCTGG TGCGCGATGG CGGCGTCAAC
TCCGGCTTCA TGATTGCCCA GGTCACCGCC GCCGCGCTGG CCTCGGAAAA CAAATCCCTG
GCGCACCCGG CCAGCGTGGA CAGCCTGCCC ACCTCGGCCA ACCAGGAGGA CCATGTGTCG
ATGGCGACCT TCGCCGCGCG CCGCCTGGGC GAGATGGTGA ACAACACCGC CGTCGTCGTC
GGCATCGAAG CCATGGGCGC TGCTCAAGGC ATAGAACTCA AGCGCGGGCT CAAAAGCTCC
CCGCTGATCG AAGCCGAGTT CGCCCGCATC CGCGGGAAGG TGGCCTTCCT CGACCAGGAC
CGCTTCCTCG CGACCGACGT CGAAGCCATG CGCCAGTGGG CCTTGCACAG CGACTGGCCC
GCCGCCCTGC AGCAGATCCT GCCCAGCCAC GCCACCGCGG CCTGA
 
Protein sequence
MTTSTHILHP GRVSLALLRS IHAGGVQLAL APEARAGLLA AQATVQRIVD EDQVVYGINT 
GFGKLASTKI APNRLVELQR NLVLSHSVGT GEPLPAAVVR VILATKAVSL ARGHSGVRPE
LVDALLALAN AGVMPRIPAK GSVGASGDLA PLAHLACVLI GEGQATTADG EVISGAQAMR
RIGVAPFVLG PKEGLALLNG TQVSTALALA GLFGAEQVFA AGLVSGALSL EAVQGSIKPF
DARVHAARGQ PGQMAVAAAV RTLLEGSEIV PSHPDCGRVQ DPYSIRCVPQ VMGACLDNLQ
HAARVLQIEA NAASDNPLVF DNGDVISGGN FHAEPVAFAA DIIALAVAEI GAIAERRLAL
LLDTGLSGLP PFLVRDGGVN SGFMIAQVTA AALASENKSL AHPASVDSLP TSANQEDHVS
MATFAARRLG EMVNNTAVVV GIEAMGAAQG IELKRGLKSS PLIEAEFARI RGKVAFLDQD
RFLATDVEAM RQWALHSDWP AALQQILPSH ATAA