Gene Caci_0253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0253 
Symbol 
ID8331580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp284332 
End bp286002 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content70% 
IMG OID644953420 
Producturocanate hydratase 
Protein accessionYP_003111047 
Protein GI256389483 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.242309 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGA CCAGTGGTCC GCGCCCGGTC CGCGCCGCCC GCGGCACGAG CATCACCGCG 
CAGGGCTGGC AGCAGGAAGC CGCTCTGCGG ATGCTGATGA ACAACCTCGA CCCGGAGGTC
GCCGAGCACC CCGACGAACT GGTCGTCTAC GGCGGCACCG GCAAGGCGGC GCGCAACTGG
CCGTCCTTCG ACGCGATGGT GCGCACCCTG CAGACGCTGA AGAACGACGA GACGATGCTG
GTGCAGTCCG GCAAGCCGGT CGGCGTCATG CAGACCCACG AGTGGGCGCC GCGCGTTCTG
CTCGCCAACT CCAACCTGGT CGGCGACTGG GCGAACTGGG AGGAGTTCCG GCGCCTGGAC
GCCTTGGGCC TGACCATGTA CGGGCAGATG ACCGCCGGCT CCTGGATCTA CATCGGCACG
CAGGGCATCT TGCAGGGTAC GTACGAGACG TTCGCCGCAG TCGCCGCCAA GAAGTTCAAC
GACACCCTGG CCGGGACCAT CACCCTGACC GCGGGTCTGG GCGGCATGGG CGGCGCGCAG
CCGCTGGCCG TCACCATGAA CGGCGGCGTG GCGATCTGCG TCGACTGCGA CGAGCGCTCC
ATCGACCGCC GCGTCGAGCA CCGCTACCTG GATGTGAAGG CGAACTCGCT GGACCACGCG
CTGCAGTTGG CCACTGAGGC CCGCGACAAG CGCGAGGCGC TGTCGATCGG CGTCCTGGGC
AACGCCGCCG AGCTGGTCCC GCGGCTGCTG GCGATGGACG CGCCGATCGA CATCGTCACC
GACCAGACCT CGGCGCACGA CCCGCTGGCG TACCTGCCGC TGGGCATGGA CTTCCACGAC
ATGAAGCAGT TCGCGAAGGA CAAGCCGGCC GAGTTCACGC AGCGGGCGCG CGAATCGATG
GCCAAGCACG TCGAGGCGAT GGTCGGCTTC CAGGACAAGG GCGCCGAGGT CTTCGACTAC
GGCAACTCCA TCCGCGGCGA GGCGCAGCTG GCCGGATACA CGCGCGCCTT CGACTTCCCC
GGCTTCGTGC CGGCGTATAT CCGTCCCCTG TTCTGCGAGG GCAAGGGCCC GTTCCGCTGG
GCCGCGCTGT CGGGTGAGGC ATCCGACATC GCGAAGACGG ACAAGGCGAT TCTGGAGCTG
TTCCCGGAGA ACGAGTCGCT GGCGCGCTGG ATCAAGATGG CCGGCGAGCG CGTGCACTTC
CAGGGTCTGC CGGCGCGCAT CTGCTGGCTC GGCTACGGCG AGCGCGACAA GGCCGGCGCG
CGTTTCAACG ACATGGTCGC CGACGGCACG CTGGCCGCGC CGATCGTCAT CGGGCGCGAC
CACCTGGACG CCGGGTCCGT GGCCTCGCCG TACCGCGAGA CCGAGGCGAT GGCCGACGGC
TCCGACGCGA TCGCGGACTG GCCGCTGCTG AACGCGATGG TGAACGTCGC CTCCGGCGCC
TCGTGGGTGT CGATCCACCA CGGCGGCGGC GTCGGCATGG GCCGCTCGAT CCACGCCGGG
CAGGTGACCG TCGCCGACGG CACGAAGCTG GCCGGGGAGA AGGTCCGCAG GGTTCTGACC
AACGACCCGG GCATGGGGGT GATCCGGCAC GTGGACGCCG GGTACGACCG TGCCGACGAG
GTCGCCGACG AGCGTGGTGT ACGCGTGCCG ATGCGTGAAG GTGACGCGTA A
 
Protein sequence
MTATSGPRPV RAARGTSITA QGWQQEAALR MLMNNLDPEV AEHPDELVVY GGTGKAARNW 
PSFDAMVRTL QTLKNDETML VQSGKPVGVM QTHEWAPRVL LANSNLVGDW ANWEEFRRLD
ALGLTMYGQM TAGSWIYIGT QGILQGTYET FAAVAAKKFN DTLAGTITLT AGLGGMGGAQ
PLAVTMNGGV AICVDCDERS IDRRVEHRYL DVKANSLDHA LQLATEARDK REALSIGVLG
NAAELVPRLL AMDAPIDIVT DQTSAHDPLA YLPLGMDFHD MKQFAKDKPA EFTQRARESM
AKHVEAMVGF QDKGAEVFDY GNSIRGEAQL AGYTRAFDFP GFVPAYIRPL FCEGKGPFRW
AALSGEASDI AKTDKAILEL FPENESLARW IKMAGERVHF QGLPARICWL GYGERDKAGA
RFNDMVADGT LAAPIVIGRD HLDAGSVASP YRETEAMADG SDAIADWPLL NAMVNVASGA
SWVSIHHGGG VGMGRSIHAG QVTVADGTKL AGEKVRRVLT NDPGMGVIRH VDAGYDRADE
VADERGVRVP MREGDA