Gene Acry_1749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_1749 
Symbol 
ID5159966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp1930184 
End bp1931854 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content70% 
IMG OID640553666 
Producturocanate hydratase 
Protein accessionYP_001234872 
Protein GI148260745 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.132868 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGTC GTCTCGACAA TGCGCGGACG ATCCGCGCGC CGCATGGCAG CGATCTCAGC 
GCCCGGTCCT GGCTGACCGA GGCGCCGCTC AGGATGCTGA TGAACAACCT CGACCCCGAT
GTCGCGGAGC GGCCGGAGGA ACTCGTCGTC TATGGTGGGA TCGGCCGGGC GGCGCGGGAC
TGGGAGAGTT TCGACCGCAT TCTCGGCGCG CTGCGCGACC TCGGGCCGGA GGAGACGCTG
CTGGTGCAGT CGGGCAAGCC GGTCGGCGTG TTCCGCACCC ATGCGGACGC GCCGCGCGTG
CTGATCGCCA ATTCCAACCT GGTGCCGAAC TGGGCGAACT GGCAGCATTT CCACGAACTC
GATCGCAAGG GCCTGATGAT GTACGGCCAG ATGACGGCGG GGTCGTGGAT CTATATCGGC
TCGCAGGGCA TCGTGCAGGG CACGTATGAG ACCTTCGTCG AGGCCGGGCG GCAGCATTAC
GGCGGCGATC TCGCCGGGCG GTGGATCCTG ACCGGCGGGC TCGGCGGCAT GGGCGGGGCG
CAGCCGCTGG CGGCGACGAT GGCCGGGGCC TCGATGATCG CGGTGGAGTG CACGCCCTCG
CGCATCGAGA TGCGGCTGCG CACGCGCTAT CTCGACCGCC GGGCCGACAC GCTCGACGAG
GCGCTGGCGA TGCTGGAGGC GGCGAAGCGC GACGGAAAAC CGGTCTCGAT CGGGCTGCTC
GGCAATGCCG CGGAGGTGTT TCCCGAACTC GTCCGCCGCG GCATCCGCCC GGATATCGTG
ACCGACCAGA CCTCGGCGCA TGATCCGGTG AACGGCTACC TGCCGGCGGG ATGGACGCTG
GAGCACTGGG CGGCGATGCG CGAGCGCGAC CCCGAGGCGG TGGCGCTGGC GGCGAAGCGG
TCGATGGCCG GGCAGGTGCG GGCGATGCTC GATTTCTGGC GGATGGGGAT TCCGGTGCTC
GACTACGGCA ACAACATCCG CGCCATGGCG CAGGAGATGG GGGTGGCCGA CGCGTTCGAC
TTTCCGGGCT TCGTGCCGGC CTATATCCGG CCGCTGTTCT GCCGGGGGAT CGGGCCGTTC
CGCTGGGCGG CGCTGTCGGG CGACCCCGAG GACATCTACC GGACGGACGC GAAGGTGAAG
GAGCTGATCC CCGACGATCC GCATCTGCAT CACTGGCTCG ACATGGCGCG CGAGTGGATC
GCCTTCCAGG GCCTGCCGGC GCGGATCTGC TGGGTGGGGC TCGGGCAGCG GCACCGGCTC
GGCCTCGCGT TCAACGAGAT GGTGGCGCGG GGCGAGCTGT CGGCGCCGGT GGTGATCGGG
CGGGACCATC TCGATTCCGG CTCGGTCGCC AGCCCCAACC GCGAGACCGA GGCGATGCGC
GACGGGTCGG ACGCGGTGTC GGACTGGCCG CTGCTCAACG CGCTGCTCAA CTGCGCCTCG
GGGGCGACCT GGGTCTCGCT GCATCACGGC GGCGGGGTCG GGATGGGGTA TTCGCAGCAT
GCGGGGATGG TGATCGTCGC CGACGGCACC GAGGCGGCGG CGAAGCGGCT GGAGCGGGTG
CTGTGGAACG ACCCGGCGAC CGGGGTGATG CGCCATGCCG ATGCCGGCTA CGACATCGCC
ATCGACTGCG CGCGGGAGAA CGGGCTGAAC CTGCCGGGGA TCACCGGCTG A
 
Protein sequence
MNRRLDNART IRAPHGSDLS ARSWLTEAPL RMLMNNLDPD VAERPEELVV YGGIGRAARD 
WESFDRILGA LRDLGPEETL LVQSGKPVGV FRTHADAPRV LIANSNLVPN WANWQHFHEL
DRKGLMMYGQ MTAGSWIYIG SQGIVQGTYE TFVEAGRQHY GGDLAGRWIL TGGLGGMGGA
QPLAATMAGA SMIAVECTPS RIEMRLRTRY LDRRADTLDE ALAMLEAAKR DGKPVSIGLL
GNAAEVFPEL VRRGIRPDIV TDQTSAHDPV NGYLPAGWTL EHWAAMRERD PEAVALAAKR
SMAGQVRAML DFWRMGIPVL DYGNNIRAMA QEMGVADAFD FPGFVPAYIR PLFCRGIGPF
RWAALSGDPE DIYRTDAKVK ELIPDDPHLH HWLDMAREWI AFQGLPARIC WVGLGQRHRL
GLAFNEMVAR GELSAPVVIG RDHLDSGSVA SPNRETEAMR DGSDAVSDWP LLNALLNCAS
GATWVSLHHG GGVGMGYSQH AGMVIVADGT EAAAKRLERV LWNDPATGVM RHADAGYDIA
IDCARENGLN LPGITG