Gene Noca_4087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4087 
Symbol 
ID4596601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4317128 
End bp4318798 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content72% 
IMG OID639778693 
Productalpha/beta fold family hydrolase/acetyltransferase 
Protein accessionYP_925271 
Protein GI119718306 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.779957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCGAAC GGCACTACGT GAGCCTGGAG TCGGGCTCGA TGCACTACCT GCGAGCCGGA 
CGGGGCACGC CCGTCGTGGT GCTGCACGCC TCGCCCATGT CCTCCCGATC GATGCTCGAC
TGGGTCCGCG GGCTGGCCGG CGACTTCACG GTCTACGCGC CGGACACCGC CGGGTTCGGG
CAGTCCGACC CACTTCCGTA CGGCGGTGAC GCGCCGGCGG TCTCTGACTA TGGCCGCCGC
GTGCTGGAGT TCGCCGACGC CGTCGGCCTG GACCGCTTCC TCCTGGGCGG CACCCACACC
GGCGCCAAGG TGGCGCTGGA GGCCGCGGTC CAGGCGCCCG CGCGCGTCGC CCAGCTCGTG
ATGGACGGGC TCGGTCTGTA CACCCCGGAG GAGATGAGCG ACCAGCTCGA GCACTACACA
CCTCCCATCG TGCCGGTGTG GCACGGTGGG CACCTGACCG AGGTCTGGCA CCGGCTGCGC
AACATGTGGA CGTTCTGGCC TTGGTATCGG CAGGAGGCGG AGTGCCGGCT GGCCGACGCC
ATCCCCGACC TCGACATCCT GCAGCAGATG GCCTTCGACT CCCTCCGTGC GCGTCCCGAC
TGGGGACTGG CCTACCGCGC CGCGTTCCAG TACGACGGGC GACGAGCGCT CGCGCGCCTG
TCCTCCCCGG CCGTGTTGAT CGCCAAGGAG GCCGACCCAC TGCACGAGCA CCTCGAGCGG
CTGGGCCTGG ACACGCCGAT GCTGCAGGTG CGGAGCGTCG CCAACGAGGA CCACCTGAGC
TCGGTCCTGG CCGCCTTCGA CCGTGCCTGC GGCCTCCCGG ACGCGCCAGC GCCGCGGCCG
GCCCGGCACG GGGGCGGGAC CACCCGGCGC TACGTCCGGA CGTCGTCGGG CGAGGTCCAT
GTCCGCGTCG ACGGCGACCC CGGTGCGCCT CCGCTGCTGT TCGTGCATGG GTCACCGGGA
TCGGCGGACA GCTCCGACCC GCTCATCCGC GACCTGTCGC GCGACCATCT CGTGGTGGCA
CCCGACACGC TAGGCAACGG CTATTCCGAG CCGGCGCCCG GGACGGACCC CGGCATCGAC
GTGTTCGCGG ACGCCGTCGC CGAGGTGCTC GCCCAGCTCG GACTCGGGCC GGTGACCGCC
TACGGGAGCC ACACGGGTGC CTGCATCGTC CTGGAGCTGG CCGTCCGCCG GCCGGACCTG
GTCGCGACCG TGGTGGCAGA CGGCCTCCCC GTCTTCGAAG AGGCGGAGCA GGCCGACCTG
CTCGAGAACT ACTTCGTCTC GCTCTCCCCG GAGCGCCACG GCGAGCACCT GCTGCGCGCC
TGGCACACCA TCCGTGACGT CCAGCTGTGG TGGCCCTGGT ACCGGCAGGA CGCCGAGCAC
CGGCGCCCCA CCGGGCCCTC CGACCCCGAG ACGCTGCACC GGCTGGTCGT CGAGTTCATC
AAGAGCGGGA ACACCTACCG TGCGTCGTAC GCGGCGGCGA TCCGGTATCC CTCGCTCGAC
CGGCTCGGAC AGGTGAGTGT TCCCACCGTC GCGTGCGCCG AGCCCACCGA CATGCTGTTC
GCCGGCAGCC GGGAGGCGAG TGCGCTCCCG GGCGTCCGGT TCGTGACGCT GGACGCTCGC
GCGGGACGCG ATGCGGCGTG GGTCGTGCGC GAGGCGGCCG ACTCCGCCTG A
 
Protein sequence
MIERHYVSLE SGSMHYLRAG RGTPVVVLHA SPMSSRSMLD WVRGLAGDFT VYAPDTAGFG 
QSDPLPYGGD APAVSDYGRR VLEFADAVGL DRFLLGGTHT GAKVALEAAV QAPARVAQLV
MDGLGLYTPE EMSDQLEHYT PPIVPVWHGG HLTEVWHRLR NMWTFWPWYR QEAECRLADA
IPDLDILQQM AFDSLRARPD WGLAYRAAFQ YDGRRALARL SSPAVLIAKE ADPLHEHLER
LGLDTPMLQV RSVANEDHLS SVLAAFDRAC GLPDAPAPRP ARHGGGTTRR YVRTSSGEVH
VRVDGDPGAP PLLFVHGSPG SADSSDPLIR DLSRDHLVVA PDTLGNGYSE PAPGTDPGID
VFADAVAEVL AQLGLGPVTA YGSHTGACIV LELAVRRPDL VATVVADGLP VFEEAEQADL
LENYFVSLSP ERHGEHLLRA WHTIRDVQLW WPWYRQDAEH RRPTGPSDPE TLHRLVVEFI
KSGNTYRASY AAAIRYPSLD RLGQVSVPTV ACAEPTDMLF AGSREASALP GVRFVTLDAR
AGRDAAWVVR EAADSA