Gene Noca_4587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4587 
Symbol 
ID4598685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4856993 
End bp4858390 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content71% 
IMG OID639779196 
ProductBeta-glucosidase 
Protein accessionYP_925769 
Protein GI119718804 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.349984 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCCCCCG CGTCCCCCGA CAGCTCCCCC GGCAGCCGCT CCGGCAGCCC TGCGGGCAGC 
TCCCTCCCGC AGCTCCCTCC CGGCTTCCGG TTCGGCACCA GCACGGCGTC GTACCAGATC
GAGGGCGCGG CGACGGAGGA CGGCAAGGGC CCCAGCGTGT GGGACACCTT CACCGCCGAG
GAGGGCCGGA TCGTCGACGG CTCGAGCGGA GCGGTCGCGT GCGACCACTA CCACCGCTAC
GGCGAGGACG TGGCGCTGAT GAAGCGCCTG GGCGCCGGCG GCTACCGCTT CTCGCTGTCC
TGGCCGCGGA TCCAGCCCAC CGGCTCGGGT CCGGCGAACC CGAAGGGCCT GGACTTCTAC
GACCGCTTGA TCGACGAGCT GCTCGCCAAC GGCGTGCAGC CGATGGCCAC CCTCTACCAC
TGGGACCTGC CCCAGGCGCT CGAGGACGAC GGCGGCTGGC TGAACCGCGC CACCGTCGAC
CGCTTCGCGG AGTACGCCGC GATCGTCGGG GAGCGGTTCG CCGACCGGGT CGAGCACTGG
ATCCCCGTCA ACGAGCCCAA CGTCGTGATG ATGATGGGCT ACGCGGTCGG CTTCCAGGCG
CCCGGCCGGA CGCTGATGTT CGACTCGATG CCGGTCGCCC ACCACCTGCT GCTCGCGCAC
GGCCGCGCCG CAGTCGAGCT GCGCGCCGCC GGCGCCACCA GCATCGGCTG CGCCAACAAC
CACTCGCCGA TGTGGCCGGC CAGCGACGAC GAGGCGGACG TCGGTGCGAC CAAGCTCTTC
GACGCGTTGT GGAACGGCAT GTTCACCGAG CCGATGCTGC TCGGCCGCTA CCCCGCCGAC
CTGCAGCCGC TGATGGCCGA CGTGGTCTGC GACGGCGACC TGTCGGTGAT CCGCCAGCCG
CTCGACTTCT ACGGCGTCAA CTACTACCAC CCGTTCAAGA TCGGCGCCGC CCGCGAGGAC
GCCGAGATGC CCTTCGAGTT CCGCGAGCTG GTCGGCTACC CGACCACGGA CTTCGGCTGG
CCGGTGGTGC CCGACGCGTT GCGCGAGTGG CTGATCACGC TGCGGGCCCG CTACCGGGCC
GCGCTACCGC CGATCTACAT CACCGAGTCC GGCTGTTCCT ACAACATGGG CCCCGACGAG
TTCGGCGTCG TCGACGACCA GCCGCGCATC GACTACCTCG ACGCCCACCT GCGGGCGGTC
GCGACCGCCT GCCAGCGCGG CGTCGACGTA CGCGGCTACT ACACGTGGTC GCTGATGGAC
AACTTCGAGT GGTCCGAGGG CTACACCCAG CGCTTCGGCC TCGTGCACGT CGACTTCGAC
ACCCAGGTGC GCACCCCCAA GCGCTCCTTC CAGTGGTACG CCGACGTGAT CGCCCGGCAG
ACCCGCTCCG TGGGCTGA
 
Protein sequence
MPPASPDSSP GSRSGSPAGS SLPQLPPGFR FGTSTASYQI EGAATEDGKG PSVWDTFTAE 
EGRIVDGSSG AVACDHYHRY GEDVALMKRL GAGGYRFSLS WPRIQPTGSG PANPKGLDFY
DRLIDELLAN GVQPMATLYH WDLPQALEDD GGWLNRATVD RFAEYAAIVG ERFADRVEHW
IPVNEPNVVM MMGYAVGFQA PGRTLMFDSM PVAHHLLLAH GRAAVELRAA GATSIGCANN
HSPMWPASDD EADVGATKLF DALWNGMFTE PMLLGRYPAD LQPLMADVVC DGDLSVIRQP
LDFYGVNYYH PFKIGAARED AEMPFEFREL VGYPTTDFGW PVVPDALREW LITLRARYRA
ALPPIYITES GCSYNMGPDE FGVVDDQPRI DYLDAHLRAV ATACQRGVDV RGYYTWSLMD
NFEWSEGYTQ RFGLVHVDFD TQVRTPKRSF QWYADVIARQ TRSVG