Gene Caul_1295 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1295 
Symbol 
ID5898750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1366640 
End bp1368304 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content70% 
IMG OID641561780 
Producturocanate hydratase 
Protein accessionYP_001682923 
Protein GI167645260 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0784383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0497778 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGCC TCGACAACAC CCGCGTCATC CGCCCCGCCA CCGGAACGGA ACTCACCGCC 
AAGAGCTGGC TGACCGAAGC CCCGCTGCGG ATGCTGATGA ACAACCTGCA CCCCGACGTC
GCCGAGCGGC CCGAGGAGCT GGTGGTCTAT GGCGGCATCG GCCGGGCGGC GCGGGACTGG
GAAAGCTACG ACAAGATCGT CGAGACCCTG CGCCGGCTGG AGGACGACGA GACCCTGCTG
GTCCAGTCGG GCAAGCCGGT CGGGGTGTTC AAGACCCACC CCGACGCGCC GCGCGTGCTG
ATCGCCAACT CCAACCTCGT GCCGCGCTGG GCGACCTGGG AGCATTTCAA CGAGCTCGAT
CGCAAGGGCC TGGCCATGTA CGGCCAGATG ACCGCCGGCT CGTGGATCTA TATCGGCGCC
CAGGGCATCG TGCAGGGCAC CTACGAGACC TTCGTCGAGA TGGGCCGCCA GCATCACGGC
GGCGACCTGG CGGGCAAATG GCTGCTGACC GCGGGCCTGG GCGGCATGGG CGGGGCCCAG
CCGCTGGCGG CGGTGATGGC CGGCGCCTCG TGCCTGGCCA TCGAGTGCCA GCCGTCGCGG
ATCGAGATGC GCCTGCGCAC CGGCTATCTG GACAAGGCCA CCGAGCGCCT CGACGAGGCC
CTGGCCTGGA TCGCCGAGGC CAACGCGGCC AAGGCCCCGG TCTCGGTCGG CCTGCTGGGC
AACGCCGCCG AATTGCTGCC GGCCATGTTC GCGGCCGGCG TCCGCCCCGA CCTGCTGACC
GACCAGACCA GCGCCCACGA CCCGATCAAC GGCTACCTGC CGGCCGGCTG GACCCTGGAT
CAGTGGGCGA CCGCCAAGGA GCGCGAGCCG GAAACCGTCA ACCGCGCCGC CCGCGCCTCG
ATGGCCGTGC ACGTCCAGGC GATGCTCGAC TTCCAGGCCG CCGGCGTACC CACGGTCGAC
TACGGCAACA ACATCCGCCA GATGGCGCTG GAGGAAGGCG TCAAGAACGC CTTCGACTTC
CCCGGCTTCG TGCCGGCCTA TATCCGGCCG CTGTTCTGCC GGGGGATCGG GCCGTTCCGC
TGGGCGGCGC TGTCGGGCGA TCCGGAAGAC ATCGCCAAGA CCGACGCCAA GGTCAAGGAA
CTGATCCCCG ACAATCCCCA CCTGCACCAC TGGCTGGACA TGGCGGCCGA GAAGATCAAG
TTCCAAGGTC TTCCCGCTCG CATCTGCTGG GTCGGCCTGG GCGATCGCCA CAGGCTGGGC
CTGGCCTTCA ACGCGATGGT CGCCAGCGGC GAGTTAAAGG CCCCGGTGGT GATCGGCCGC
GACCACCTGG ACAGCGGCTC GGTCGCCTCG CCCAACCGCG AGACGGAAGC GATGATGGAC
GGCTCGGACG CGGTGTCGGA CTGGCCGCTG CTGAACGCCC TGCTCAATAC GGCGTCCGGC
GCCACCTGGG TGTCGCTACA CCATGGCGGC GGGGTCGGTA TGGGCTTCTC ACAGCACGCG
GGCATGGTCA TCGTCGCCGA CGGCACCGAA GCCGCCGCCA AGCGGCTGGC GCGGGTGCTG
TGGAACGACC CGGCCTCCGG CGTCATGCGC CACGCCGACG CCGGCTACGA GATCGCCAAG
GCCTGCGCCC GGGAACACGG GCTGGATTTG CCTGGCATAC TGTAG
 
Protein sequence
MTRLDNTRVI RPATGTELTA KSWLTEAPLR MLMNNLHPDV AERPEELVVY GGIGRAARDW 
ESYDKIVETL RRLEDDETLL VQSGKPVGVF KTHPDAPRVL IANSNLVPRW ATWEHFNELD
RKGLAMYGQM TAGSWIYIGA QGIVQGTYET FVEMGRQHHG GDLAGKWLLT AGLGGMGGAQ
PLAAVMAGAS CLAIECQPSR IEMRLRTGYL DKATERLDEA LAWIAEANAA KAPVSVGLLG
NAAELLPAMF AAGVRPDLLT DQTSAHDPIN GYLPAGWTLD QWATAKEREP ETVNRAARAS
MAVHVQAMLD FQAAGVPTVD YGNNIRQMAL EEGVKNAFDF PGFVPAYIRP LFCRGIGPFR
WAALSGDPED IAKTDAKVKE LIPDNPHLHH WLDMAAEKIK FQGLPARICW VGLGDRHRLG
LAFNAMVASG ELKAPVVIGR DHLDSGSVAS PNRETEAMMD GSDAVSDWPL LNALLNTASG
ATWVSLHHGG GVGMGFSQHA GMVIVADGTE AAAKRLARVL WNDPASGVMR HADAGYEIAK
ACAREHGLDL PGIL