Gene Noca_4397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4397 
Symbol 
ID4596915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4648593 
End bp4650260 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content72% 
IMG OID639779007 
Producturocanate hydratase 
Protein accessionYP_925581 
Protein GI119718616 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.458241 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCACTC CCGCCAACCC GCGCCTGCCC ATCCACGCCG CGCACGGCAC CGAGCTGACC 
GCCCGGTCCT GGCAGACCGA GGCCCCGCTG CGGATGCTGA TGAACAATCT CGACCCCGAG
AACGCCGAGC GCCCCGAGGA CCTCGTCGTG TACGGCGGCA CCGGTCGCGC GGCCCGGTCC
TGGGAGGCCT ACGACGCGCT GGTGCGCACG CTGACCACGC TCGGTGACGA CGAGACGATG
CTGGTGCAGT CCGGCAAGCC GGTTGGCGTG ATGAGGACGC ACGAGTGGGC GCCGCGGGTG
CTGATCGCGA ACTCCAACCT GGTCGGCGAC TGGGCCAACT GGGAGGAGTT CCGCCGGCTC
GAGGACCTCG GCCTGACCAT GTACGGCCAG ATGACCGCGG GCTCGTGGAT CTACATCGGC
ACCCAGGGCA TCCTCCAGGG CACCTTCGAG ACGTTCGCGG CCGTCGCCGA CAAGAGGTTC
GGCGGGACCC TCGCCGGGAC CATCACCGTG ACCGCCGGCC TGGGTGGCAT GGGCGGCGCG
CAGCCGCTGG CGGTCACGAT GAACGACGGC GTGGTGATCT GCGTCGAGTG CGACCCCGAG
CGGATCCGGC GGCGTATCGA CCACCGCTAC CTCGACGTCG AGGCGCCGTC GCTCGAGGCG
GCCGTGGCGC TCGCGGTCGA GGCGCGCGAC GAGCGGCGGC CGCTGTCGAT CGGGCTGCTC
GGCAACGCCG CCGAGGTGCT GCCGCGGATC CTCGAGACCG AGGTGCCGGT GGACATCGTC
ACCGACCAGA CCTCGGCGCA CGACCCGCTC TACTACCTGC CCGTCGGCGT CCCCTTCGAG
GAGTGGGCGG CCCGGCGGGA GGCCGACCCG GAGGGTTTCA CCAAGGAGGC GCGGGCCTCG
ATGGCGGCGC ACGTGCGGGC CATGGTCGAG CTCCAGGACC GCGGGGCCGA GGTCTTCGAC
TACGGCAACT CGATCCGCGA CGAGGCCCGC AAGGGTGGCT ACGACCGGGC CTTCGAGTTC
CCCGGCTTCG TGCCCGCCTA CATCCGGCCG CTGTTCTGCG AGGGGAAGGG GCCGTTCCGG
TGGGCCGCGC TGTCCGGCGA CCCGGCCGAC ATCGCCGCCA CCGACCGCGC CATCCTCGAG
CTGTTCCCCG CCAACGAGCG GCTGCGGAAG TGGATCACGA TGGCCGGTGA GCGGGTGCAC
TTCCAGGGGC TGCCGGCGCG GATCTGCTGG CTCGGGTACG GCGAGCGCCA CCTCGCCGGC
CTGCGGTTCA ACGAGATGGT GGCCTCCGGT GAGCTGAAGG CCCCGATCGT GATCGGCCGC
GACCACCTGG ACTGCGGCTC CGTGGCCTCG CCGTACCGCG AGACCGAGGG CATGCTCGAC
GGCTCGGACG CCATCGCGGA CTGGGCCGTT CTCAACGCCC TGGTCAACAC CGCGTCCGGC
GCCAGCTGGG TCTCCTTCCA CCACGGCGGC GGCGTCGGCA TCGGCCGGTC CCTGCACGCC
GGCCAGGTCT GCGTCGCCGA CGGCACCGAC CTCGCCGCGC AGAAGATCGA GCGGGTCCTC
ACCAACGACC CCGGCATGGG CGTGATCCGC CACGTCGACG CCGGCTACGA CCGCGCCGCC
GAGGTCGCCC GCGAGCGCGG CGTCCGGATC CCGATGTCCG AGGGCTGA
 
Protein sequence
MTTPANPRLP IHAAHGTELT ARSWQTEAPL RMLMNNLDPE NAERPEDLVV YGGTGRAARS 
WEAYDALVRT LTTLGDDETM LVQSGKPVGV MRTHEWAPRV LIANSNLVGD WANWEEFRRL
EDLGLTMYGQ MTAGSWIYIG TQGILQGTFE TFAAVADKRF GGTLAGTITV TAGLGGMGGA
QPLAVTMNDG VVICVECDPE RIRRRIDHRY LDVEAPSLEA AVALAVEARD ERRPLSIGLL
GNAAEVLPRI LETEVPVDIV TDQTSAHDPL YYLPVGVPFE EWAARREADP EGFTKEARAS
MAAHVRAMVE LQDRGAEVFD YGNSIRDEAR KGGYDRAFEF PGFVPAYIRP LFCEGKGPFR
WAALSGDPAD IAATDRAILE LFPANERLRK WITMAGERVH FQGLPARICW LGYGERHLAG
LRFNEMVASG ELKAPIVIGR DHLDCGSVAS PYRETEGMLD GSDAIADWAV LNALVNTASG
ASWVSFHHGG GVGIGRSLHA GQVCVADGTD LAAQKIERVL TNDPGMGVIR HVDAGYDRAA
EVARERGVRI PMSEG