Gene Noca_1417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1417 
Symbol 
ID4597324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1500651 
End bp1502327 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content72% 
IMG OID639776015 
Productdihydroorotase 
Protein accessionYP_922618 
Protein GI119715653 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3653] N-acyl-D-aspartate/D-glutamate deacylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCGGCG GGCACGCTCG CGTCGATCTG CTCCTCCGCG GGGGGCGAGT CGTCGACGGC 
ACCGGCGCAC CGTGGCGCCA TGCCGACGTA CTGGTCGGGA ACGGTCTGAT CGCCGGCGTC
GTCCCGCCCG GTCGGTTCGA CGGCGAGGTG GACGAGGTCA TCGACGCCGA CGGTCTCGTC
GTCGCTCCCG GATTCGTCGA CATCCACTCC CACTCCGACC TCGCCCTCCT GGCTCGGCCC
GGCGCGGAGG AGAAGCTCAG GCAGGGCGTC ACCACGGAGG TCGTGGGCAA CTGCGGCATC
TCGGTCGCTC CGGTCACCGC CGAGCACCTC CTCGACTGCC AGCAGTACGC GCACCCGGTG
CTCGCGTTCG AGGAGATCGA GTGGGACTGG ACCGACACCG CCGGGTACCT CGAGCGGATC
GAGCGTGCCG GACCGGCGGT GAACGTGGCC ACCTACGTCG GCCTGGGCAC GGTCCGCTGC
GCAGTCTCCG CCTTCGACCC GACCGCGCCG AACCCCGAGC GGCGCCGCAC CATGCGTGAG
CTCACGCGGC AGGCCCTCGC GCAGGGGGCG GTCGGGGTCA GCACGGGTCT CGTCTACGCA
CCCGGGTCGT ACGCCTCCGA CGAGGAGATC TGCGAGCTCG TCGACGAGGC CGCCCGCGTC
GGCGCGCTCT ACTCGAGCCA CATGCGCGAC CAGGGTGACG GCTTCCTCGA GTCCGTCGCC
GAGACGCTCG ACGTCGGACG CCGTACCGGC GCGGCGGTCC AGATCGCCCA TCACAAGGTC
GTCGGCCCAC GCAACTGGGG TCAGGTGTCC ACCTCCCTCG CGATGCTCCG CGCCGCCCGA
GCAGAAGGCA TCGACGCCGG GTCGGACGTG TACCCCTACC TCGCCGGCAG CACCACCATG
ACCGCGCTCC TCCCCGACTG GACGCTCGTC GGGGGGCGGC ACGCGATGCT TCGCCGGCTC
GCCGACCCCA CCGACCGCGA GCGCATCAAG CGCGACTGGG TGCAGGGCAT CCCGCTCTGG
GACAACCGCG TCGCGAGCCT CGGCTGGGAC AACATCTACA TCGCCCACGT GGCGACGGAC
GTGAACCAGG ATGTCGTCGG GCTCAGCGTC GCGGCGGCAG CGGTCGCGCG CCAGCGTGGT
GCAGACCCCG GGGACTTCCT GCTCGACCTG CTGCTCGACG AGCAAGGGGC CGTCGGCAAC
GTGCAGATGG CCTGCGCCGA GGATGACCTG CGCCTCGTGA TGAGCGACCC GACGACCACG
TTCGGCAGCG ACGGCCTCTA CGCCGGCCGC CGGCCGCATC CTCGGCTGCA CGGGACGTTT
CCGCGGATCC TCGGGGAGTA CGTGCGCGAG ACCGGCCTGC TCACCCTCGA GGAGGCGATC
AGCAAGATGA CCTCGCGGCC CGCGCGCCGC CTCGGGCTCG ACGGGATCGG CCTGATCGAG
CCCGGCTACG CCGCCGACCT GGTCGTCCTC AACGCCCACA CCGTCGCCGG CCCGGCCGAC
TACGACCGTC CCACCCTGCA CCCGCGCGGG ATCCAGCACG TCCTGGTCTC GGGGCGGGTG
GCGCTTCGCG ACGGCGACCC GACCGGCGTG CGTGCCGGAC GCCCACTGCG CCGCAACCAG
ACCGGCCTCC CCGCACCTGC AGGGAGCGAC CATCGCAACC TCGAAGGAGC ATCGTGA
 
Protein sequence
MTGGHARVDL LLRGGRVVDG TGAPWRHADV LVGNGLIAGV VPPGRFDGEV DEVIDADGLV 
VAPGFVDIHS HSDLALLARP GAEEKLRQGV TTEVVGNCGI SVAPVTAEHL LDCQQYAHPV
LAFEEIEWDW TDTAGYLERI ERAGPAVNVA TYVGLGTVRC AVSAFDPTAP NPERRRTMRE
LTRQALAQGA VGVSTGLVYA PGSYASDEEI CELVDEAARV GALYSSHMRD QGDGFLESVA
ETLDVGRRTG AAVQIAHHKV VGPRNWGQVS TSLAMLRAAR AEGIDAGSDV YPYLAGSTTM
TALLPDWTLV GGRHAMLRRL ADPTDRERIK RDWVQGIPLW DNRVASLGWD NIYIAHVATD
VNQDVVGLSV AAAAVARQRG ADPGDFLLDL LLDEQGAVGN VQMACAEDDL RLVMSDPTTT
FGSDGLYAGR RPHPRLHGTF PRILGEYVRE TGLLTLEEAI SKMTSRPARR LGLDGIGLIE
PGYAADLVVL NAHTVAGPAD YDRPTLHPRG IQHVLVSGRV ALRDGDPTGV RAGRPLRRNQ
TGLPAPAGSD HRNLEGAS