Gene Noca_3917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3917 
Symbol 
ID4598052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4123014 
End bp4124381 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content74% 
IMG OID639778523 
ProductBeta-glucosidase 
Protein accessionYP_925102 
Protein GI119718137 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCTGC ACCTGCCGAG CCCCAGCACC CTCGCCTACG GCGCCGCGAC CGCGTCGTAC 
CAGATCGAGG GAGCGACCGC CGAGGACGGT CGCGGCGCCT CGATCTGGGA CACCTTCACA
ACCCGTCCCG GCGCCATCCG GGACGGCTCG GACGGCTCGG TCGCCTGCGA CTCCTACCAC
CGCTACGAGG AGGACGCCGA CCTGGTCGCC GGCCTCGGCG TCGGGTGGTA CCGCTTCTCG
ATCGCCTGGC CGCGGGTGCT GCCCGAGGGC ACCGGCCGGG TCGAGCCGCG CGGCCTCGAC
TACTACGACC GCCTGGTCGA CGCGCTGCTG GCGCGCGGCG TCTCCCCGAC GGCGACCCTC
TACCACTGGG ACCTGCCGCA GGCCCTCGAG GACCGGGGCG GGTGGCTGGA GCGGTCGACC
GCCGAGGCCT TCGCCGACTA CGCGATGGTC GTCCACGAGC GGCTCGGCGA CCGGGTGGGC
GTCTGGGCGA CCCACAACGA GCCGTGGTGT GCGGCCTACC TCGGCTACGC CGCCGGCATC
CACGCCCCCG GCCGGCGCGA GGGCGGCGCC GCCCACCGGG CGGCCCACCA CCTGCTGCTC
GGCCACGGCC TCGCCGCCGC CCGCCTGCAC GAGGCCGGGG TCGGCGACGT CGGCATCGCC
CTCAACCTGG CGCCGTTCTG GCCCGAGTCG CCCGACGCGG TGGCCGCGGC GGACGGGGTC
GACGCCATCC GCAACCGGCT CTGGCTCGGC CCGCTCGTCG ACGGCGCGTA CGACGACGGG
CTGCTCGCCG TCGCGCCCGA GCTGGCCGAC CCCGACGTGG TCCACGAGGG CGACCTCGAG
CTGGTCCGCG GCTCGGCCGA CTGGATCGGG ATCAACTACT ACACGCCGTT CCGGCCGACC
CTCGCCGACC CCGCGCTCGA GACCCACCCG GAGGTCGACG CCTATCCCGG CGCCACCCCG
GTGTCCTTCG TGGTCCGCGA GCCGCGCACC GACATCGGCT GGGAGGTCGA GGCCCGTGGC
CTGGAGGAGC TGCTCGTCGA GACGCACCGG CGCACCGGGC TGCCGCTGAT CGTGACCGAG
AACGGCGCGG CCTACGCCGA CGACACCCTC CGGGAGGGTG CCGCCGGCGT CATCGACGAC
CAGGACCGGA TCGCCTACCT GCGCGACCAC ATCGCCGCGA CCGAGCGGGC CCGGTCGGCC
GGTGCCGACG TGCGGGCCTA CATCGTGTGG ACCCTGCTGG ACAACTTCGA GTGGGCCGAG
GGCTACACCA AGACGTTCGG TGTCGTCCAC GTGGACCCGA AGGACCAGAC CCGGACCCCC
AAGGCGTCCT ACCACTGGCT GGCCGAGCAC GTCGCCGAGG CCCATTGA
 
Protein sequence
MSLHLPSPST LAYGAATASY QIEGATAEDG RGASIWDTFT TRPGAIRDGS DGSVACDSYH 
RYEEDADLVA GLGVGWYRFS IAWPRVLPEG TGRVEPRGLD YYDRLVDALL ARGVSPTATL
YHWDLPQALE DRGGWLERST AEAFADYAMV VHERLGDRVG VWATHNEPWC AAYLGYAAGI
HAPGRREGGA AHRAAHHLLL GHGLAAARLH EAGVGDVGIA LNLAPFWPES PDAVAAADGV
DAIRNRLWLG PLVDGAYDDG LLAVAPELAD PDVVHEGDLE LVRGSADWIG INYYTPFRPT
LADPALETHP EVDAYPGATP VSFVVREPRT DIGWEVEARG LEELLVETHR RTGLPLIVTE
NGAAYADDTL REGAAGVIDD QDRIAYLRDH IAATERARSA GADVRAYIVW TLLDNFEWAE
GYTKTFGVVH VDPKDQTRTP KASYHWLAEH VAEAH