Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3501 |
Symbol | |
ID | 4595600 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 3709635 |
End bp | 3710675 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 639778109 |
Product | cellulase |
Protein accession | YP_924688 |
Protein GI | 119717723 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.724652 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGCCGCC GCCCCTCGCC GACCGCCTCG TCGATCCGGC CCCCGGTCCT GATGCTGGTC GCCCTGGTGG CCGCCGTGCT CGCGCTCGGC GGGTGCTCGG GCGGCGGCGA GCCGGACGAC GCGCCGAGCA GCGCGGACCC GGTCGCGGAC AACCCCTACG CGGGCCGGAC CGGCTTCGCC GACCCGGCCT CGCGGACCGC CCAGGCCGCA GCGCAGGCGA AGGCGGATGG TGACGTCGAG GCGGAGCGGG TCCTCTCCCG GCTCGCCAGC ACGCCGCAAG GCATCTGGCT GACGCCCGAG GAGTACCCGC CCGGCTCGGT CGCGCCGTTG GTCGCGCGGG TCGTCCGGGC CGCCGACGCA GCCGGCCAGG TGCCGACGTT CGTCGTGTAC GGCATCCCCG ACCGCGACTG CACCGGCGGC TTCTCCGGCG GTGGCCTGAC CGCGGACCAG TACGGGCCCT GGGTGCAGGA GATCGCGGAC GCCGTCTCGG GCGCCGACCC CTCGGTCCCG GTCGTGGCGG TCGTCGAGCC GGACGCGCTC GCCTCGGCGA TCGCGTGCGA CCGCCGCTCG GAGCGGGTGC GGCTGATCGC CGACGCCGTG ACCCGGCTCG CCGACGCCGA GGTGACGACC TACGTCGACG GCGGCCACTC GCACTGGATC GAGCCGGATC AGCTGGCGAG GCTGCTCGAG CAGGCCGGGA TCGACCAGGC CCGGGGCTTC GCCACCAACG TCTCGAACTA CCAGACCGAC GCCGACGAGC GTGCGTACGG CGAGCAGCTC AGCGCCCTGC TCGACGGAGC CCACTACATC GTCGACACCG GACGGAACGG CAACGGCTCG ACCGAGGATT GGTGCAACCC GACCGGTCGG GCCTACGGCA CCGACCCGGC GCCGGCGCCG GAGGGCGACG CGGAGCACCT CGACGCCTAC GTCTGGGTCA AGCCGCCGGG GGAGAGCGAC GGCGAGTGCG GAGGCGGGCC GCCCGCCGGG CGCTTCTGGC GTGAGCGGGC GCTGGAGATG GCCGTCTCGT CCGGGTGGTG A
|
Protein sequence | MRRRPSPTAS SIRPPVLMLV ALVAAVLALG GCSGGGEPDD APSSADPVAD NPYAGRTGFA DPASRTAQAA AQAKADGDVE AERVLSRLAS TPQGIWLTPE EYPPGSVAPL VARVVRAADA AGQVPTFVVY GIPDRDCTGG FSGGGLTADQ YGPWVQEIAD AVSGADPSVP VVAVVEPDAL ASAIACDRRS ERVRLIADAV TRLADAEVTT YVDGGHSHWI EPDQLARLLE QAGIDQARGF ATNVSNYQTD ADERAYGEQL SALLDGAHYI VDTGRNGNGS TEDWCNPTGR AYGTDPAPAP EGDAEHLDAY VWVKPPGESD GECGGGPPAG RFWRERALEM AVSSGW
|
| |