Gene Noca_4385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4385 
Symbol 
ID4596903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4636438 
End bp4637613 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content75% 
IMG OID639778995 
Productimidazolonepropionase 
Protein accessionYP_925569 
Protein GI119718604 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCAAGCA CCCTGATCAC GAACATCGGC GAGCTCGTCA CCAACGACCC CGAGGCCGCC 
GACGACGGCC GGGGGCTGCT CGGGATCGTC GAGCAGGCCG CGCTCGTGAT CGACGGCTCG
ACGGTCGCCT GGGTGGGCCG CGCCGCCGAC GCCCCGGACG CCGACCAGCT CGTCGACGCC
GGCGGCCGGG CGGTGCTGCC CGGCTTCGTG GACAGTCACA GTCATCTGGT CTTCGCCGGC
GACCGGGCCG CGGAGTTCGC GGCGCGGATG GCCGGGACGC CGTACGCCGC CGGCGGCATC
CGCACGACGG TGGCCGCCAC CCGGGCCGCC ACCGACGAGC AGCTCACCAG CCACGTCGCC
CGGCTGGTCG AGGAGATGCG CCGCCAGGGC ACGACCACGG TCGAGATCAA GAGCGGCTAC
GGGCTGACCG TCCACGACGA GGCCCGCAGC CTCGCGGTCG CGCGGCAGTT CACCGAGGAG
ACGACGTTCC TCGGGGCGCA CGTCGTGCCC GACGGCGACC CGGGGGAGTA CGTCGACCTG
GTCACCGGCC CGATGCTCGA CGCGGCCCGG GAGCACGCCC GCTGGATCGA CGTGTTCTGT
GAGCGCGGCG CCTTCGACGC CGACCAGGCC CGGGCGATCC TGGACGCCGG CGCGGCGGCC
GGACTGCGCG GCCGGCTGCA CGCCAACCAG CTGACCTACG GAGAGGGCGT CCGGCTCGCC
GCCGAGCTCG GCCTGGTCGC GGTCGACCAC TGCACCTACC TGGCCGACGA GGACGTCGCC
GCGCTGCGCG ACAGCGGCAC CATCGCCACG CTGCTGCCCG GGGTCGAGTT TTCGACCCGG
CAGCCCTATC CCGACGCCCG CCGCCTCCTC GACGCGGGCG TCCGGGTGGC TCTGGCCAGC
GACTGCAACC CCGGGTCCTG CTTCACCAGC TCGATCCCGC TGTGCATCGC GCTCGCCGTC
CGTGAGATGG GGATGACCCC CGCGGAGGCG GTGCACGCGG CGACCTACCG CGGTGCCCAG
GCCCTGGACC GCGACGGGCA GCACGGGATC GGCGCGCTCG TGCCCGGGCG CCGCGCCGAC
CTCGCGGTGC TCGACGCCCC CTCCCACGTC CACCTCGCCT ACCGCCCCGG CGTCCCTCTC
GTCCGCCAGA CCTGGGTCGC CGGCCGTCCG CTGTAA
 
Protein sequence
MASTLITNIG ELVTNDPEAA DDGRGLLGIV EQAALVIDGS TVAWVGRAAD APDADQLVDA 
GGRAVLPGFV DSHSHLVFAG DRAAEFAARM AGTPYAAGGI RTTVAATRAA TDEQLTSHVA
RLVEEMRRQG TTTVEIKSGY GLTVHDEARS LAVARQFTEE TTFLGAHVVP DGDPGEYVDL
VTGPMLDAAR EHARWIDVFC ERGAFDADQA RAILDAGAAA GLRGRLHANQ LTYGEGVRLA
AELGLVAVDH CTYLADEDVA ALRDSGTIAT LLPGVEFSTR QPYPDARRLL DAGVRVALAS
DCNPGSCFTS SIPLCIALAV REMGMTPAEA VHAATYRGAQ ALDRDGQHGI GALVPGRRAD
LAVLDAPSHV HLAYRPGVPL VRQTWVAGRP L