Gene Noca_3501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3501 
Symbol 
ID4595600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3709635 
End bp3710675 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content76% 
IMG OID639778109 
Productcellulase 
Protein accessionYP_924688 
Protein GI119717723 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.724652 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCCGCC GCCCCTCGCC GACCGCCTCG TCGATCCGGC CCCCGGTCCT GATGCTGGTC 
GCCCTGGTGG CCGCCGTGCT CGCGCTCGGC GGGTGCTCGG GCGGCGGCGA GCCGGACGAC
GCGCCGAGCA GCGCGGACCC GGTCGCGGAC AACCCCTACG CGGGCCGGAC CGGCTTCGCC
GACCCGGCCT CGCGGACCGC CCAGGCCGCA GCGCAGGCGA AGGCGGATGG TGACGTCGAG
GCGGAGCGGG TCCTCTCCCG GCTCGCCAGC ACGCCGCAAG GCATCTGGCT GACGCCCGAG
GAGTACCCGC CCGGCTCGGT CGCGCCGTTG GTCGCGCGGG TCGTCCGGGC CGCCGACGCA
GCCGGCCAGG TGCCGACGTT CGTCGTGTAC GGCATCCCCG ACCGCGACTG CACCGGCGGC
TTCTCCGGCG GTGGCCTGAC CGCGGACCAG TACGGGCCCT GGGTGCAGGA GATCGCGGAC
GCCGTCTCGG GCGCCGACCC CTCGGTCCCG GTCGTGGCGG TCGTCGAGCC GGACGCGCTC
GCCTCGGCGA TCGCGTGCGA CCGCCGCTCG GAGCGGGTGC GGCTGATCGC CGACGCCGTG
ACCCGGCTCG CCGACGCCGA GGTGACGACC TACGTCGACG GCGGCCACTC GCACTGGATC
GAGCCGGATC AGCTGGCGAG GCTGCTCGAG CAGGCCGGGA TCGACCAGGC CCGGGGCTTC
GCCACCAACG TCTCGAACTA CCAGACCGAC GCCGACGAGC GTGCGTACGG CGAGCAGCTC
AGCGCCCTGC TCGACGGAGC CCACTACATC GTCGACACCG GACGGAACGG CAACGGCTCG
ACCGAGGATT GGTGCAACCC GACCGGTCGG GCCTACGGCA CCGACCCGGC GCCGGCGCCG
GAGGGCGACG CGGAGCACCT CGACGCCTAC GTCTGGGTCA AGCCGCCGGG GGAGAGCGAC
GGCGAGTGCG GAGGCGGGCC GCCCGCCGGG CGCTTCTGGC GTGAGCGGGC GCTGGAGATG
GCCGTCTCGT CCGGGTGGTG A
 
Protein sequence
MRRRPSPTAS SIRPPVLMLV ALVAAVLALG GCSGGGEPDD APSSADPVAD NPYAGRTGFA 
DPASRTAQAA AQAKADGDVE AERVLSRLAS TPQGIWLTPE EYPPGSVAPL VARVVRAADA
AGQVPTFVVY GIPDRDCTGG FSGGGLTADQ YGPWVQEIAD AVSGADPSVP VVAVVEPDAL
ASAIACDRRS ERVRLIADAV TRLADAEVTT YVDGGHSHWI EPDQLARLLE QAGIDQARGF
ATNVSNYQTD ADERAYGEQL SALLDGAHYI VDTGRNGNGS TEDWCNPTGR AYGTDPAPAP
EGDAEHLDAY VWVKPPGESD GECGGGPPAG RFWRERALEM AVSSGW