Gene Noca_4086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4086 
Symbol 
ID4596600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4315576 
End bp4317042 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content69% 
IMG OID639778692 
Productamidohydrolase 
Protein accessionYP_925270 
Protein GI119718305 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAGC TGGATCTGAT CATCACCAAC GTGTCGGTGG TCGTCGACGA CGGGGCGGAG 
CCGCACCCCG TGGACATCGG GATCAAGGAC GGGAAGATCG CTCGGCTCGA GGCAGGGCTG
TCGCGCCAGA GCGCTGGCCG GGTGATCGAC GGGGCCGGAA AGGTCGCGTT CCCAGGGGTG
GTCGACGCCC ACCAGCACTG GGGCATCTAC AACCCGCTTC CCGAGGACGC AGTCAGCGAG
AGCCGCGCAT CGGCCCAGGG CGGCGTCACC ACGGCGATCA CCTACATGCG CACCGGCCAG
TACTACCTGA ACAAGGGCGG CGCCTACCGC GACTTCTTCC CCGAGGTCCT CTCGCTGACC
GAGGGCAGGT CGGTCGTCGA CTACGCCTTC CACCTCGCCC CGATGAGCGC CGGCCACATC
GCCGAGATCC CGGCTCTGGT CGAAGAGCAC GGCGTGACGT CGTTCAAGAT CTTCATGTTC
TACGGCGGGC ACGGGCTCCA CGGCCGCAGC GCCGACCAGA GCTCGTTCCT GATGACACCC
GAGGGTGAGC GATACGACTA TGCACACTTC GAGTTCGTCA TGCGCGGCAT CGAGGACGCG
CGCAGGAAGT TCCCCGAGAT CGCTGACCAG ATCTCGCTCT CGCTGCACTG CGAGACCGCC
GAGATCATGA CGGCGTACAC GAAGCTCGTC GAGGAGGACG GCACGCTCGC CGGCCTCCCG
GCATACAACG CCTCCCGCCC GCCGCACTCG GAGGGCCTGG CGGTCACGAT CGCGTCGTAC
CTGGCCCACG AGACCGAGCT GCCCACGATC AACCTGCTGC ACCTGACCTC GCGGAAGGCG
GTCGAGGCCG CGCTGACGAT GGCCGACGCG TTCCCGCACA TCGACTTCCG TCGCGAGGTC
ACCGTCGGCC ACCTCCTCGC CGACTGCGAC ACCGCCCACG GCGTCGGTGG CAAGGTCAAC
CCGCCGCTGC GCCCCCGCGA GGACGTCGAA GCCCTGTGGG GCTACCTGCT CGACGGCAAG
ATCGACTGGG TCGTCTCGGA CCACGCCTGC TGCAAGGAGG AGCTCAAGTT CGGCGACCCC
GAAGACGACG TCTTCTTGGC CAAGTCCGGC TTCGGCGGCG CCGAGTACCT CCTCGCCGGC
ATGATCACCG AAGGCCGCAA GCGCGGTCTC GGCCTCGGCC GGATCGCCGC GCTCACCGCT
ACCAACCCCG CCGAGCGCTA TGGTCTCGGT GCCACCAAGG GCTCCCTCCA GGTCGGCAAG
GACGCCGACA TCGCGCTCGT CGACCTCGAC CACACCTGGA CGGTCCGCGC CGAAGACTCC
CCGTCCGCCC AGGAGTACAC GCCCTTCGAG GGTCAGGAGC TCACGGCCAA GGTGACCGAC
ACGTTCGTCC GGGGCCACCA CGTGATGGCC GACGGCGTCG TGGGCGACGA GCCGGTGGGT
CGGTACCTCG CCCGGCCCAC GGCCTGA
 
Protein sequence
MSELDLIITN VSVVVDDGAE PHPVDIGIKD GKIARLEAGL SRQSAGRVID GAGKVAFPGV 
VDAHQHWGIY NPLPEDAVSE SRASAQGGVT TAITYMRTGQ YYLNKGGAYR DFFPEVLSLT
EGRSVVDYAF HLAPMSAGHI AEIPALVEEH GVTSFKIFMF YGGHGLHGRS ADQSSFLMTP
EGERYDYAHF EFVMRGIEDA RRKFPEIADQ ISLSLHCETA EIMTAYTKLV EEDGTLAGLP
AYNASRPPHS EGLAVTIASY LAHETELPTI NLLHLTSRKA VEAALTMADA FPHIDFRREV
TVGHLLADCD TAHGVGGKVN PPLRPREDVE ALWGYLLDGK IDWVVSDHAC CKEELKFGDP
EDDVFLAKSG FGGAEYLLAG MITEGRKRGL GLGRIAALTA TNPAERYGLG ATKGSLQVGK
DADIALVDLD HTWTVRAEDS PSAQEYTPFE GQELTAKVTD TFVRGHHVMA DGVVGDEPVG
RYLARPTA