Gene Noca_4386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4386 
Symbol 
ID4596904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4637598 
End bp4638890 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content73% 
IMG OID639778996 
ProductN-formimino-L-glutamate deiminase 
Protein accessionYP_925570 
Protein GI119718605 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02022] formiminoglutamate deiminase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.280885 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCGAGT CCCCCTCCTC GGCGTACCTC CTCGAGCGGG CGTGGGTCGA CGGCGCCGTC 
CGCGACGACG TGCTCGTCGA GATCGAGGAC GGCCGATTCA CATCCGTCAC ACCGGTAGGG
CAAAGCCTCC GCTTCTCGGG TGAAGTTTCG GCCCAGAACG TCCGGTTTCC CCGGCTAGCC
GGGGAAACCG TCCGGCTCGC CGGCCTCACC CTCCCCGGGC TGACCAACGA CCACAGCCAC
GCGTTCCATC GGGCGCTGCG GGGGCGGACC CAGCGGGGGC GGGGCACGTT CTGGACCTGG
CGCGAGCAGA TGTACGCCGT GGCGGAGCGG CTCACCCCGG ACACCTACTT CACCCTGGCC
CGGGCGACGT TCCGGGAGAT GGTGGCGGCG GGCTACACGA GCGTGTCCGA GTTCCACTAC
CTCCACGACC CGGAGCCGGC CACGGGCGAC GCCCTCCGCG AGGCCGCACG AGCGGCGGGC
ATCCGGCTCA CGCTGCTCGA CGCCTGCTAC GTCAGCAGCG GCTTCGGCGC GCCGCCGGAG
GGCGTGCAGC TCCGGTTCAG CGACGGCACC GCCGAGCGGT GGGCCGCACG GGTCGGCGCC
GGCCCGGCCG CGATCCACTC CGTGCGCGCC GTCCCCCGCG AGCAGCTGGC GGTCTTCCGC
GGCCGCGCGC CCCTGCACGT CCACCTCTCC GAGCAGGTCG CCGAGAACGA GGCCTGCCTG
GCGGCGTACG GCGTCACCCC GACCCGGCTG CTCCACGACG AGGGTCTGCT GGGTCCGGAC
ACCACCGTCG TGCACGCCAC CCACCTCACC GACGACGACA TCGCGCTGCT CGGCAGCACC
CGCACGAACG TCTGCATCAC GCCCACCACC GAGCGTGACC TCGCCGACGG GATCGGCCCG
GCGCGCCGGC TCGCGGACGT GGGCTGCCGG ATCAGCATCG GCAGCGACAG CCACGCGGTC
ATCGACCCGT TCGAGGAGCT GCGCGGCCTC GAGATGGACG AGCGGCTCGC GACCCAGGAG
CGCGGGCACT GGTCAGCTGC CGAGCTGTTA GCGATCGGCA CCGCCGGGCT GCAGATCGCG
GTCGGTGCGC CGGCCGACCT GGTCACGATC GACACGCGAA GCACCCGCAC CGCCGGCACC
GGCGCCGACG AGCACACCGC CGTCTTCGCC GCCACCGCCG CCGACGTCAC CCACGTCGTC
GTCGACGGCC GGGTCGTCGC GTCCGAGGAC GACCACGAGG ACATCGGCCG CGAGCTGGCG
GCTGCGATGG AGGCACTGTG GCAAGCACCC TGA
 
Protein sequence
MTESPSSAYL LERAWVDGAV RDDVLVEIED GRFTSVTPVG QSLRFSGEVS AQNVRFPRLA 
GETVRLAGLT LPGLTNDHSH AFHRALRGRT QRGRGTFWTW REQMYAVAER LTPDTYFTLA
RATFREMVAA GYTSVSEFHY LHDPEPATGD ALREAARAAG IRLTLLDACY VSSGFGAPPE
GVQLRFSDGT AERWAARVGA GPAAIHSVRA VPREQLAVFR GRAPLHVHLS EQVAENEACL
AAYGVTPTRL LHDEGLLGPD TTVVHATHLT DDDIALLGST RTNVCITPTT ERDLADGIGP
ARRLADVGCR ISIGSDSHAV IDPFEELRGL EMDERLATQE RGHWSAAELL AIGTAGLQIA
VGAPADLVTI DTRSTRTAGT GADEHTAVFA ATAADVTHVV VDGRVVASED DHEDIGRELA
AAMEALWQAP