Gene Bpro_1035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_1035 
Symbol 
ID4012156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp1061850 
End bp1063127 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content71% 
IMG OID637940713 
Productimidazolonepropionase 
Protein accessionYP_547886 
Protein GI91786934 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.281228 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACAG CCCCCCACCC CGCTGCCGAC GGCATCTGGG AGCATTTGCG CCTGATGCCT 
GGCGCGCTGG CCGACGACAG CCCCGTGGCG ACGAACACCG AGGCCGCCAT CGTCGTCACC
GAAGGCCGGA TCCGCTGGAT AGGCGCCAGC GCCGCGCTGC CCGCCGGGTT CAGCGCGCTG
CCGCGTTTTG ACGGCGGCGG CGCGCTGGTG ACGCCCGGCC TGGTCGATTG CCATACCCAT
CTGGTGTACG GTGGCCAGCG CGCCAACGAG TTCGCCATGC GGCTGGCCGG TGCCAGCTAT
GAAGAGGTGG CGAAGGCCGG CGGCGGCATC GTCTCCAGCG TGCGCGCCAC CCGTGCGGCC
GGTGAAGACG AGCTTTTTGC GCAGGCCGCG CCTCGCCTGG AACAACTGCT GGCCGATGGC
GTGTGCGCCA TCGAAATCAA GTCCGGCTAT GGTCTCGCCC TCGAACACGA GCGCAAGCAG
CTGCGCGTGG CGCGCCGGCT CGGTGAGGCC TACGGCGTCA CCGTGCGCAC CACCTTCCTC
GGCGCGCACG CACTGCCGCC CGAGTACGCG GGCCGCAGCC AGGACTACAT CGACCTGGTC
TGCCGCGAGA TGCTGCCCGC ACTCGCCGCC GAAGGCCTGG TCGATGCGGT GGACGTATTT
TGCGAACGCA TCGCGTTTTC GCTGAGCGAG ACCGAGCAGG TGTTCCAGGC CGCGCAGCGT
TTAGGCCTGC CGGTCAAGCT GCATGCCGAG CAGCTCTCCG ACATGGGCGG CGCCGCGCTC
GCCGCGCGTT ATGGCGCGCT GTCGTGCGAC CACATCGAAC ACCTGTCGCA GGCCGGCATC
GACGCCATGC GCGCGGCCGG CACGGTGGCC GTGCTGCTGC CCGGCGCCTA CTACACGCTG
CGCGACACCC ACCTGCCACC GATCGCCGCG CTGCGCGAAG CCGGCGTGCC CATGGCCGTC
TCGACCGACC ACAACCCCGG CACGTCGCCC GCGCTCAGCC TGCTGCTCAT GGCCAACATG
GCCTGCACAC TGTTTCGCCT GACCGTGCCG GAAGCGCTGG CCGGCATCAC GCGCCACGCA
GCCCGTGCGC TCGGACTGCA GGACACGCAC GGCGCACTCG GCGTGGGCCG GCCCGCCAAT
TTCGTGCTGT GGCAGCTGAA TGACAGCGCC GAGCTGGCCT ACTGGCTGGG CCAGCAGGCG
CCGCGCACCA TCGTGCGGCA GGGGCGCGTT GCGCTCGACG GGCTCCAGAT CGCCCCCAAC
GCCAGGATCA CCCCATGA
 
Protein sequence
MTTAPHPAAD GIWEHLRLMP GALADDSPVA TNTEAAIVVT EGRIRWIGAS AALPAGFSAL 
PRFDGGGALV TPGLVDCHTH LVYGGQRANE FAMRLAGASY EEVAKAGGGI VSSVRATRAA
GEDELFAQAA PRLEQLLADG VCAIEIKSGY GLALEHERKQ LRVARRLGEA YGVTVRTTFL
GAHALPPEYA GRSQDYIDLV CREMLPALAA EGLVDAVDVF CERIAFSLSE TEQVFQAAQR
LGLPVKLHAE QLSDMGGAAL AARYGALSCD HIEHLSQAGI DAMRAAGTVA VLLPGAYYTL
RDTHLPPIAA LREAGVPMAV STDHNPGTSP ALSLLLMANM ACTLFRLTVP EALAGITRHA
ARALGLQDTH GALGVGRPAN FVLWQLNDSA ELAYWLGQQA PRTIVRQGRV ALDGLQIAPN
ARITP