Gene Namu_0040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0040 
Symbol 
ID8445619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp46106 
End bp47761 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content71% 
IMG OID645039191 
Producturocanate hydratase 
Protein accessionYP_003199467 
Protein GI258650311 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones68 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGGTG CCCGTCCGGT CCGCGCCCCC CGGGGGACCA GCCTGACCGC CAAGTCCTGG 
ACCACCGAGG CTCCGCTGCG GATGCTGATG AACAACCTCG ATCCGGAGAA CGCCGAGCGA
CCCGATGATC TGGTCGTTTA CGGCGGCACC GGCAAGGCCG CGCGGGACTG GAACTCGTTC
GACGCGATGG TCCGCACGCT GACCACGCTG GAGGCCGACG AGACCATGCT GGTCCAGTCC
GGCCGGCCGG TCGGGGTCAT GCGCACCCAC GAGTGGGCGC CGCGGGTCAT CCTGGCCAAC
TCCAACCTGG TGCCGGACTG GGCCAACTGG CCCGAGTTTC GCCGGCTGGA GAAGCTGGGC
CTGACCATGT ACGGCCAGAT GACGGCCGGT TCGTGGATCT ACATCGGCTC CCAGGGCATC
GTGCAGGGCA CCTACGAGAC GTTCGCCGCC ATCGCCGAGA AGCGGTTCAA CGGCACGCTG
GCCGGCACCC TCACGCTGAC CGGCGGAGCC GGCGGCATGG GTGGCGCCCA GCCGCTGGCC
GTCACCCTCA ACGGCGGCGC CTGCCTGATC GTCGACGTCG ACGAGTCCCG ACTGCAGCGC
CGGGTCGAGC ACCGCTACCT GGACGAGATC GCGGTCGACA TCGACGACGC CATCGCCAAG
TCGCTGCAGG CCAAGGCGCA GCGCAAGGCC TGGTCGGTGG GTCTGGTCGG CAACTGCGCG
GAGGTCTTTC CCGAGCTGCT GCGGCGCGGG GTGGACATCG ACATCGTCAC CGACCAGACC
AGCGCGCACG ACCCGCTGTC GTACCTGCCG GCCGGCGTCT CGATCGAGGA CTGGCCCGAC
TATGCCGAGC GCAAGCCCGA GGAGTTCACC GACCGGGCCC GGGAGTCGAT GGCCCGCCAC
GTCGAGGCGA TGGTCGGCTT CCAGGACGCC GGCGCAGAGG TCTTCGACTA CGGCAACAGC
ATCCGGGACG AGGCCCGGCA GGGGGGCTAC GAGCGGGCCT TCGACTTCCC CGGCTTCGTA
CCGGCCTACA TCCGGCCGCT GTTCTGCCAG GGCAAGGGGC CGTTCCGGTG GGCTGCGCTG
TCCGGCGACC CGAAGGACAT CTACGCCACC GACCAGGCGG TGATGGATCT GTTCCCGGAC
AACGACCGCC TGCAGAAGTG GATGCGGGGG GCCCGCGAGA AGATCAGCTT CCAGGGTCTG
CCGGCGCGGA TCTGCTGGCT GGGCTACGGC GAGCGGGACC GGGCCGGCCT GCGGTTCAAC
GAGATGGTCG CGTCCGGCGA GCTGTCCGCG CCGATCGTCA TCGGCCGGGA CCACCTGGAC
TGCGGGTCGG TCGCCTCGCC CTACCGGGAG ACCGAGTCGA TGGCCGACGG GTCGGACGCG
ATCGCCGACT GGCCGCTGCT CAACGCGCTG ATCAACACGG CCAGCGGCGC GTCCTGGGTG
TCCATCCATC ACGGCGGCGG CGTCGGCATC GGCCGGTCCA TCCACGCCGG CCAGGTCTCG
CTGGCCGACG GCACGGCGCT GGCCGCCGAG AAGCTGGCCC GGGTGCTGAC CAACGACCCG
GGCATGGGCG TGATCCGGCA CGTGGACGCC GGGTACGAGC TGGCCGAGCA GGTCGCCGCC
GACCAGGGCG TGCGCATCCC GATGAAGGAG GGCTGA
 
Protein sequence
MEGARPVRAP RGTSLTAKSW TTEAPLRMLM NNLDPENAER PDDLVVYGGT GKAARDWNSF 
DAMVRTLTTL EADETMLVQS GRPVGVMRTH EWAPRVILAN SNLVPDWANW PEFRRLEKLG
LTMYGQMTAG SWIYIGSQGI VQGTYETFAA IAEKRFNGTL AGTLTLTGGA GGMGGAQPLA
VTLNGGACLI VDVDESRLQR RVEHRYLDEI AVDIDDAIAK SLQAKAQRKA WSVGLVGNCA
EVFPELLRRG VDIDIVTDQT SAHDPLSYLP AGVSIEDWPD YAERKPEEFT DRARESMARH
VEAMVGFQDA GAEVFDYGNS IRDEARQGGY ERAFDFPGFV PAYIRPLFCQ GKGPFRWAAL
SGDPKDIYAT DQAVMDLFPD NDRLQKWMRG AREKISFQGL PARICWLGYG ERDRAGLRFN
EMVASGELSA PIVIGRDHLD CGSVASPYRE TESMADGSDA IADWPLLNAL INTASGASWV
SIHHGGGVGI GRSIHAGQVS LADGTALAAE KLARVLTNDP GMGVIRHVDA GYELAEQVAA
DQGVRIPMKE G