Gene Bpro_1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_1042 
Symbol 
ID4012260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp1067822 
End bp1069561 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content66% 
IMG OID637940720 
Producturocanate hydratase 
Protein accessionYP_547893 
Protein GI91786941 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0991609 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCA AGGATTCCTT CATCGCCCCC ACCGACGCCC AAAGCGGCAC CCTCACCGAC 
CCGCGCGTCG ACCCCACCCG TGTGATCCGC GCGCCGCGCG GCAGCCAGCT CCATTGCAAA
AGCTGGCTCA CCGAAGCCGC CTTTCGCATG ATCCAGAACA ACCTCGACCC CGAGGTGGCC
GAGAACCCGC AAAGCCTGGT GGTCTACGGC GGCATCGGCC GCGCCGCACG CGACTGGGCC
TGCTTCGACC AGATTCTGGC TTCGCTGAAA GAACTAAACG ATGACGAATC GCTGTTGGTC
CAGTCAGGCA AGCCCGTGGG CGTATTCAAG ACCCACGCCA ACGCGCCTCG CGTGCTGATC
GCCAACTCCA ACCTGGTGCC CAAGTGGGCC AACTGGGAAC ACTTCAACGA ACTCGACCGC
AAGGGCCTCT TCATGTATGG CCAGATGACG GCCGGCAGCT GGATCTACAT CGGCAGCCAG
GGCATCGTGC AGGGCACCTT CGAGACCTTC GTCGAAGCCG GCCGCCAACA CTACAACAAC
AGCCTCGCTG GCAAATGGAT CCTGACCGCG GGCCTGGGCG GCATGGGGGG CGCGCAGCCG
CTGGCAGCAA CCTTGGCCGG CGCGGTGTCG CTGAACATCG AGTGCCAGCA AAGCAGCATC
GACTTCCGCC TGCGCACCCG CTACGTGGAC AAGCAGGCCA GGGACATCGA CGACGCGCTG
GCGCTGATCA AGTACCACAC CGACCGCAAG GAGGCCGTGT CGATCGCGCT GCTGGGCAAC
GCCGCCGACA TCCTGCCCGA GCTGGTGAAG CGCGCCAAAG CCGGCGCCAT CAAGCCCGAC
CTGGTGACCG ACCAGACCTC GGCCCACGAC CTGATCAACG GCTACCTGCC CCGCGGCTGG
AACGTGGCGC AGTGGAAGGC CGCGCAGCAG GATCCGGCGC AGCACAAGCG CCTGACGGAC
GAAGCCGCCC AAAGCTGCGC CGTGCATGTG CAGGCCATGC TGGACTTCCA GGCCATGGGC
ATCCCGACCG TGGACTACGG CAACAACATC CGCCAGGTGG CCTTCGACCA GGGCGTGAAA
AACGCCTTTG ATTTCCCCGG CTTTGTGCCC GCCTACATCC GCCCGCTGTT CTGCGAAGGC
AAGGGCCCGT TCCGCTGGGT CGCGCTCTCC GGCGATCCGG AAGACATCTA CAAGACCGAC
GCCAAGATCA AGGAGCTGTT CCCCGAGAAC AAGCACACGC ACCGCTGGCT CGACATGGCG
CGCGAGCGCA TCGCCTTCCA GGGCCTGCCC GCACGCATCT GCTGGCTGGG CCTGGGTGAA
CGCCACATTG CGGGTCTGGC CTTCAACGAG ATGGTGAAGA AGGGCGAGCT CAAGGCCCCG
ATCGTGATCG GCCGCGACCA CCTGGACACC GGCTCGGTGG CCAGCCCGAA CCGCGAGACC
GAGGCCATGA AGGACGGCAC CGATGCCGTG TCCGACTGGC CGCTGCTCAA CGCGCTGCTC
AATGCCTCGG GCGGCGCCAC CTGGGTCAGC CTGCACCACG GCGGCGGCGT GGGCATGGGT
TACTCGCAGC ATGCGGGCAT GGTCATCGTG GCCGACGGCA CCGATGCCGC CGCCGAGCGG
CTGGCACGCG TGCTGGTCAA CGACTGCGGC TCGGGCGTGA TGCGCCATGC CGACGCGGGT
TACGAGATCG CCATTGCCAC CGCGAAGAAG CAGGGCCTCA AGCTGCCGAT GGTCCGGTGA
 
Protein sequence
MNAKDSFIAP TDAQSGTLTD PRVDPTRVIR APRGSQLHCK SWLTEAAFRM IQNNLDPEVA 
ENPQSLVVYG GIGRAARDWA CFDQILASLK ELNDDESLLV QSGKPVGVFK THANAPRVLI
ANSNLVPKWA NWEHFNELDR KGLFMYGQMT AGSWIYIGSQ GIVQGTFETF VEAGRQHYNN
SLAGKWILTA GLGGMGGAQP LAATLAGAVS LNIECQQSSI DFRLRTRYVD KQARDIDDAL
ALIKYHTDRK EAVSIALLGN AADILPELVK RAKAGAIKPD LVTDQTSAHD LINGYLPRGW
NVAQWKAAQQ DPAQHKRLTD EAAQSCAVHV QAMLDFQAMG IPTVDYGNNI RQVAFDQGVK
NAFDFPGFVP AYIRPLFCEG KGPFRWVALS GDPEDIYKTD AKIKELFPEN KHTHRWLDMA
RERIAFQGLP ARICWLGLGE RHIAGLAFNE MVKKGELKAP IVIGRDHLDT GSVASPNRET
EAMKDGTDAV SDWPLLNALL NASGGATWVS LHHGGGVGMG YSQHAGMVIV ADGTDAAAER
LARVLVNDCG SGVMRHADAG YEIAIATAKK QGLKLPMVR