Gene Noca_3384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3384 
Symbol 
ID4598182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3586037 
End bp3587932 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content74% 
IMG OID639777991 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_924572 
Protein GI119717607 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGCCC GGCCGAGCGC CGCCACCCGT CGCCGCAGGA CCGCCGTCCG GACCCTCGGT 
GCGGGCTTGA TCCTGCTGCT CGCCGCAGCG TTCGGGCCGA CCTTGTCGCC ACCCCCGGCG
ACCGCCGCTG TCGGGGACCG AGCCGCCGCC GGCCCGTCGT ACGACCGGGC GCGCACCGCG
CTGGCGCAGC TGACGCGCAA GCAGAAGGTC GGGCAGCTGT TCGTGATCGA GGTCGCCGGT
CGTGACGCCA ACGACGTGAG CGACGCGGCG AAGGCGGTCA ACCAGAGGCT GTACGGCGTC
GACACCCCGG CCCAGGCGAT CGCGAAGTAC CAGCCCGGCG GGGTCATCTA CTTCACCACC
CGCAACGGGG ACGACAACAT CGGTACGCCG GAGCAGGTGG CCAGGCTGTC GAACGGGCTG
CAGGCCGCGG CGCGGGCGCT GCCGGGCGGG ATCCCGCTGC AGATCTCGGT CGACCAGGAG
GGCGGTGCCC TCGTGGCGCG CTTCGGCGCG GCGTCGGGAG CCACGCAGCT GCCGGGGAAC
ATGGCGCTCG GTGCGGGCGC GCTCGGGACC GGAGGGTCGG CGGCGGACGC CCGCCGCTCG
GCGACCGTGA TCGGCGCCGA GCTCGCGGCG GTCGGGGTGA CGCAGGACTA CGCGCCGGTC
GCGGACGTGA ACGTGAACCC GAACAACCCG GTGATCGGGA TCAGGTCCAT CGGCTCCGAC
CCGGCGCTGG TCTCCGACCT GGTCGCCGCC CAGGTCCGCG GCTTCCACCG CGGCGGAGTC
TCCGCGGTCG CGAAGCACTT CCCGGGACAC GGGGACACGG GCGTGGACAG CCACTTCGGG
CTGCCCGAGG TGACGCACAC GCGGTCGCAG CTGGAGGAGA TCGACCTGCC GCCGTTCCGT
GCAGCGATCG CCGCCGGGGT CGACACGATC ATGACCGCGC ACGTCGTGCT GCCGGCGATC
GACCCGGGTG TCCCCGCGAC GATGTCGCGG AAGATCCTCA CCGGGCTGCT GCGCCGCGAG
CTGGGCTTCG ACGGGCTGAT CGTGACCGAC GCGCTGGACA TGGGTGGCGC GACGGCGACG
TACCCGCCCG ACGTGGCGCC GGTGCGGGCG CTGCTCGCCG GCGCCGACCA GCTGCTGATC
CCGCCCGAGA TGGACACGGC GTACCGCGCG GTGCTGAAGG CGGTGCGCAG CGGGCAGATC
AGCAGGGAGC GGCTCAACGA GTCGGTGTAC CGGATCCTGC TGCACAAGTA CGAGCGCGGG
CTCTTCGGCG ACCCGTACGT CGACCGGGCC GCGGCGGCGG GGATCGTGGG CGCCCCGACG
CACCTCGCGA CCGCGCAGGC GATCACCGAC CGCACGACGA CGCTGCTCAA GAACGACGCC
GGGCTGCTGC CGCTGACCGC CGGGCCGCGG CAGGTCCTGG TCGCCGGGTG GGGCGCGACG
ACGACGCAGA CGCTCGCGAC GGCGCTCGGC ACCCGCGGCG CCACGACGCA GGTCCTCGAG
TCCGGCACCA CGCCCTCGGA CGCGGCGATC GAGGACGCCG TCGCGGCGGC GCAGGACGCC
GACCTGGTCG TCGTGACGAC GAACAACGCG TACGCCGTCG ACGCGTCGAC CGGGGCACCG
ACCAACGCCG CGGCCGCGCA GACCCGGCTG GTGCGCGCCC TGCTCGAAAC GGGTAGGCCG
GTCGTGGTCG CCGCCGTGCG CAACCCGTAC GACGTCGCCT CGTTCCCCTC GGCGCCGACG
GTGCTGGACA CCTACGGCTA CACCGCGGCC CAGGTCGAGT CGCTGGTCCG GGTGCTGTTC
GGCGAGGTCG AGCCGACCGG CCGGCTGCCG GTCGCGATCC CCGGCCCCAA CGGCACCGGC
GAGCTCTTCG AGCTCGGCCA CGGTCTGGGC TACTGA
 
Protein sequence
MPARPSAATR RRRTAVRTLG AGLILLLAAA FGPTLSPPPA TAAVGDRAAA GPSYDRARTA 
LAQLTRKQKV GQLFVIEVAG RDANDVSDAA KAVNQRLYGV DTPAQAIAKY QPGGVIYFTT
RNGDDNIGTP EQVARLSNGL QAAARALPGG IPLQISVDQE GGALVARFGA ASGATQLPGN
MALGAGALGT GGSAADARRS ATVIGAELAA VGVTQDYAPV ADVNVNPNNP VIGIRSIGSD
PALVSDLVAA QVRGFHRGGV SAVAKHFPGH GDTGVDSHFG LPEVTHTRSQ LEEIDLPPFR
AAIAAGVDTI MTAHVVLPAI DPGVPATMSR KILTGLLRRE LGFDGLIVTD ALDMGGATAT
YPPDVAPVRA LLAGADQLLI PPEMDTAYRA VLKAVRSGQI SRERLNESVY RILLHKYERG
LFGDPYVDRA AAAGIVGAPT HLATAQAITD RTTTLLKNDA GLLPLTAGPR QVLVAGWGAT
TTQTLATALG TRGATTQVLE SGTTPSDAAI EDAVAAAQDA DLVVVTTNNA YAVDASTGAP
TNAAAAQTRL VRALLETGRP VVVAAVRNPY DVASFPSAPT VLDTYGYTAA QVESLVRVLF
GEVEPTGRLP VAIPGPNGTG ELFELGHGLG Y