Gene Noca_0056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0056 
Symbol 
ID4600107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp62748 
End bp63995 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content68% 
IMG OID639774670 
Productglycosyl hydrolase family 32 protein 
Protein accessionYP_921292 
Protein GI119714327 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1621] Beta-fructosidases (levanase/invertase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGCCCC TGGTCCACTT CACTGCAGAC GCCGGCTGGA TCAATGACCC CCACGGCCTG 
ACCTTCCACC GCGGCCGGTA TCACCTGTTC CACCAGTACG TCCCCGAGAG CATGGTGTGG
GCGCCCAACT GCCATTGGGG CCACGCCACG AGCTCGAACC TCCTCACGTG GACGCGGCAC
CGGGTGGCGA TCGCCCCTGG AGACGGGGAC GACGGCATCT GGACGGGAAG CCTCGCTCTG
ACGGGCCAGG ACGCCACCAT CCTCTACACC TCGGTCGCCC AACCGGACCT CGGCTTGGGT
CGCGTCCGCC TCGCCACCCC GGCCGACGAC TCGTGGGAGA TCTGGAGCAA GGGTGACATC
GTCGTCACCC CTCCCGACGA GCTCGATCTG ATCGCATTCC GCGATCCCTT CGTCGTTCGC
GACGCGGCGG GCTGGCGCAT GTTCATCGGC GCGGCGACGC GGGAGGGTGA CGCGCTCGCC
CTCACCTACA CCTCGCCGGA CCTGTCGTCC TGGATCTATG AGGGCATCGC CCTCCAGAGG
TCCACGAAAG AGAAGGACCC GGTGTGGATG GGAGCGCTCT GGGAATGCCC CCAGGTCTTC
GAGGTCGACG ACCACTGGGT GATGGTGAGC TCCGTCTGGG ACAACGACGT GCTGCACTAT
GCCGGCTACG CCCTTGGCGA CCGCGACTCC TACAGCGCGG GAAAGCTGAT GCCGACCGAA
TGGGGTCAAC TCAGCTTCGG TGACTCCTAC TACGCCCCGT CCTACTTCCT CGATGAGGAC
GAGCTTCCGT GTCTGATGTT CTGGATGCGC GGCGTGAGCG ATGCAGACGA CGGCTGGGCG
AGCTGCCTGA GCCTGCCCTA TTCCTTGACC GTCCGCGATG ACCGGCTCGT CGCCGAGCCT
CACGCCGCGC TCGCCGAGGC GCGCGGCGAC GCGTTGGCGG CGGGTGCCGA CGCCCGCGCC
TACGACCTGG AGTGGGACCC GACCGCCAGC CAGGCCGAGC TCGTGCTGGC CTCGGACCTC
GGAAAGAGCG CCACCCTGCG CGCAACTGAG GGCAGGATCC ACCTCGAGCG TCCTGGCGTC
GACGCTCAGT CGATGCCCTG GCCAGGCGGC CCCGTCCGAG TCGTCGTCGA CGGTCCCGTT
CTCGAGGTCT CCTGCGCCGG CGGGCTACTC GGAGGACCCA TAGCCCCGGC CACCCGGTGG
GACGGACCAG CCGAGGCGTG CTCGGCCTGG AGGCTTGCTA TCGACTAG
 
Protein sequence
MRPLVHFTAD AGWINDPHGL TFHRGRYHLF HQYVPESMVW APNCHWGHAT SSNLLTWTRH 
RVAIAPGDGD DGIWTGSLAL TGQDATILYT SVAQPDLGLG RVRLATPADD SWEIWSKGDI
VVTPPDELDL IAFRDPFVVR DAAGWRMFIG AATREGDALA LTYTSPDLSS WIYEGIALQR
STKEKDPVWM GALWECPQVF EVDDHWVMVS SVWDNDVLHY AGYALGDRDS YSAGKLMPTE
WGQLSFGDSY YAPSYFLDED ELPCLMFWMR GVSDADDGWA SCLSLPYSLT VRDDRLVAEP
HAALAEARGD ALAAGADARA YDLEWDPTAS QAELVLASDL GKSATLRATE GRIHLERPGV
DAQSMPWPGG PVRVVVDGPV LEVSCAGGLL GGPIAPATRW DGPAEACSAW RLAID