Gene Arth_0374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0374 
Symbol 
ID4447168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp395948 
End bp397642 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content69% 
IMG OID639688170 
Producturocanate hydratase 
Protein accessionYP_829875 
Protein GI116668942 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACCCG CCGATTTCAC CACCGGTGCC CGCCCGGTCA AAGCAGCCCG CGGCACCGAG 
CTCACCGCCA AGAGCTGGCA GACGGAAGCC CCGCTGCGCA TGCTCATGAA CAACCTGGAC
CCTGAGGTGG CCGAGCGCCC TGACGACCTG GTGGTCTACG GCGGCACCGG CCGCGCGGTC
CGCAGCTGGG CCGCGTTCGA CGCCATCACC CGCACCCTGG AAACCATGGA AAAGGACGAG
ACCCTGCTGG TTCAGTCCGG CAAGCCGGTA GGTGTCTTCC GCACCAACGA ATGGGCGCCG
CGCGTGCTCC TGGCCAACTC TAACCTCGTC GGCGACTGGG CCAACTGGCC CGAATTCCGC
CGCCTCGAGG CCGAGGGCCT CATGATGTAC GGCCAGATGA CCGCCGGCTC CTGGATCTAC
ATCGGCACCC AGGGCATCCT GCAGGGCACC TTTGAAACCT TCGCCGCGAT CGCCCGCAAA
CTCACAGGGG ACGAGAACGG CACCCTCGCC GGCACGCTGA CCCTCACGGG CGGCTGCGGC
GGCATGGGCG GCGCCCAGCC CCTCGCCGTC ACCCTGAACG AGGGCGCCTG CCTGATTGTC
GACGTCGATG AGACCCGCCT GCGCCGCCGC GCCGGCAAAC GCTACCTTGA CGAGGTGGAA
ACGGACCTCG ACGCCGCGAT CGCCAAGGTG CTCAAGGCCA AGGAAGAGCG CCGCGGCTGG
TCCGTGGGCT ACGTCGGCAA CGCGGCCGAG GTCTTCCCGG AGATCCTGCG CCGCCACAAC
GCCGGCGAGC TCACCGTGGA CATCGTCACG GACCAGACCT CCGCGCACGA TCCGCTGAGC
TACCTGCCCG AAGGCATCAC GGTGGAGGAA TGGCACCGCG AGGCCGCGGC CGACCCCGAA
GGCTTCACCA AGAAGGCCCA GGCCTCGATG GCCAAGCACG TCCAGGCCAT GGTGGAGTTC
CAGGATGCCG GCGCCGAGGT CTTCGACTAC GGCAACTCCA TCCGCGACGA GGCCCGCAAG
GGCGGCTACA ACCGCGCCTT CGAATTCCCC GGCTTCGTCC CGGCCTACAT CCGTCCGTTG
TTCTGCGAAG GACTTGGCCC GTTCCGCTGG GTGGCCCTCT CCGGTGACCC CGAAGATATC
CGCGTCACCG ATGAGGCCAT CAAGGAACTG TTCCCGGAGA ACAAGCACCT GCACCGCTGG
ATCGACGCCG CCCAGGAGCG GGTCGAGTTC GAAGGCCTGC CGGCCCGCAT CTGCTGGCTG
GGCTACGGTG AACGTGCCAA GGCCGGCCTG CTGTTCAACC AGCTCGTCAA GGAAGGCAAG
GTCAAGGCGC CCATCGTGAT CGGCCGTGAC CACCTCGACT CCGGCTCCGT CGCCTCCCCG
TACCGCGAGA CCGAGGCCAT GGCCGACGGC TCCGACGCGA TCGCCGACTG GCCGCTGCTC
AACGCCCTGC TCAACACCGC CTCCGGCGCC ACCTGGGTCT CCATCCACCA CGGCGGCGGC
GTCGGCATCG GCCGCTCCAT CCACGCCGGG CAGGTCTCCG TCGCGGACGG CACCGACCTC
GCCGCGGAAA AGCTCGAACG CCTGCTCACC AACGACCCCG GCATGGGCGT CATCCGCCAC
GCCGACGCCG GCTACGACCG CGCCGTCGAG GTCGCCAAGG AACGCGGCGT CCGCATCCCC
ATGAACGAAA AGTAG
 
Protein sequence
MAPADFTTGA RPVKAARGTE LTAKSWQTEA PLRMLMNNLD PEVAERPDDL VVYGGTGRAV 
RSWAAFDAIT RTLETMEKDE TLLVQSGKPV GVFRTNEWAP RVLLANSNLV GDWANWPEFR
RLEAEGLMMY GQMTAGSWIY IGTQGILQGT FETFAAIARK LTGDENGTLA GTLTLTGGCG
GMGGAQPLAV TLNEGACLIV DVDETRLRRR AGKRYLDEVE TDLDAAIAKV LKAKEERRGW
SVGYVGNAAE VFPEILRRHN AGELTVDIVT DQTSAHDPLS YLPEGITVEE WHREAAADPE
GFTKKAQASM AKHVQAMVEF QDAGAEVFDY GNSIRDEARK GGYNRAFEFP GFVPAYIRPL
FCEGLGPFRW VALSGDPEDI RVTDEAIKEL FPENKHLHRW IDAAQERVEF EGLPARICWL
GYGERAKAGL LFNQLVKEGK VKAPIVIGRD HLDSGSVASP YRETEAMADG SDAIADWPLL
NALLNTASGA TWVSIHHGGG VGIGRSIHAG QVSVADGTDL AAEKLERLLT NDPGMGVIRH
ADAGYDRAVE VAKERGVRIP MNEK