Gene Noca_4513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4513 
Symbol 
ID4597032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4773409 
End bp4774575 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content75% 
IMG OID639779124 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_925697 
Protein GI119718732 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.662398 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTCGT CGCCTCGTCC CGTCCTGGTC GCGGTCGCGG CCGCTGCCCT GGGGGCCGGG 
CTGCTGGCGC CCGCCAGCGG GTCCGCGCAC GCGCCTGCGG TGAGCGGCCG GCACCAGGAG
ACCCGGACGG CGGCGTCGGT GTATCGGGAG ATGAGCCTGG CCCAGCGGGT CGGGCAGCTG
TTCATGGTCG GCACCCCCGC GGACCGGGTC GACCGCCGTA CCCGCGCCCA GATCCATCGC
TTCCACGTCG GCAACGTGAT GCTGACCGGC CGCAGCTATG ACGGGGTCCG CGCGCCGGCC
CGGGTGTCCC GGGCGATGCG CGGCGAGGTC GACGGGAGGT CGACCGCCGG CGTCCGGCTC
TTCGTCGCGA CCGACCAGGA GGGCGGCCAG GTCCGGGTGT TGCAGGGGCC CGGCTTCTCC
GACATCCCCT CGGCCCTGGA GCAGGGCACC TGGCAGCCGC GCCGGCTGCG TGGCGCCGCG
AAGTTGTGGG CCGGGCAGCT GCGCCGGGCC GGCGTGAACC TCGACCTGGC GCCGGTGATG
GACACCGTTC CCAGCCGGCG GGCGGCTCGG CACAACCCGC CGATCGGCCG CTACGACCGC
GAGTTCGGCT TCACGACCAA GGTCGTCGCC CGGCACGGGG TGGCGTTCCT CAACGGCATG
GCCGACGGCG GCGTCGTACC GACGGCGAAG CACTTCCCCG GCCTGGGCCG GGTCCACGCG
AACCCCGACA CCCACGCCGG CGTCACCGAC CGGGTCACGA CCCGGCACGA CGCCTACCTG
CGGCCGTTCG GGGCGGCGAT CGACGCGGGC GTCCCGATCG TGATGATGTC GACGGCGTAC
TACGAGCACC TCGACCCGCG GAACCCCGCG GCGTTCTCAC CGTTCGTGGT CGGCACCATG
CTGCGCGGCG ACCTCGGGTT CCGCGGCGTG GTCATCTCCG ACGACCTGGC CCGGGCCCGG
CAGGTCGCGG GCTTCAGCCC GGCCGGCCGG GCACTGCGGT TCATCGGCGC GGGTGGCGAC
ATCGTGCTCA GCGTCGATGC CGACCCGGTG GGGGAGATGT ACCGCGCGGT CCTCGAGCGC
GCCCGGACCA GCGAGCGGTT CCGCGCCAAG GTCGACGCGG CGGTGCTGCG GGTGCTGCGC
GCCAAGCAGG ACCGGCACCT GCTGTGA
 
Protein sequence
MSSSPRPVLV AVAAAALGAG LLAPASGSAH APAVSGRHQE TRTAASVYRE MSLAQRVGQL 
FMVGTPADRV DRRTRAQIHR FHVGNVMLTG RSYDGVRAPA RVSRAMRGEV DGRSTAGVRL
FVATDQEGGQ VRVLQGPGFS DIPSALEQGT WQPRRLRGAA KLWAGQLRRA GVNLDLAPVM
DTVPSRRAAR HNPPIGRYDR EFGFTTKVVA RHGVAFLNGM ADGGVVPTAK HFPGLGRVHA
NPDTHAGVTD RVTTRHDAYL RPFGAAIDAG VPIVMMSTAY YEHLDPRNPA AFSPFVVGTM
LRGDLGFRGV VISDDLARAR QVAGFSPAGR ALRFIGAGGD IVLSVDADPV GEMYRAVLER
ARTSERFRAK VDAAVLRVLR AKQDRHLL