Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0723 |
Symbol | |
ID | 4599827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 769096 |
End bp | 770184 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639775324 |
Product | glycoside hydrolase family protein |
Protein accession | YP_921935 |
Protein GI | 119714970 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGCCG GCGTCCCCGC GAACACCGGG GCCGCACATC CCGTGGTCGA CGGCAACCGG ATCATCGACG CCCGCAGCGG TCGCGACTTC GTGCCGCGTG GGGTCAACTG GAGCAGCTTC GAGTACGCCT GCGCCCAGGG CTGGGGCATG TCGGCGCTCG ACAACCTGGT CGCCCGGGAC CCGGCCACGA CGGAGGCGAA GGAGATCGCC GGCTGGGGTG CGAACACGGT CCGGCTCCCC CTCAACCAGG ACTGTTGGCT CGGCACCCGC GGAGCACCGG TCAGCGACCA GTACGAGGAA CGCTCGGTGG CCGGCTACCG CAGCGACGTG CACGCGTTCG TGACCGCGCT CAACCGGGCC GGCATCGTGG TCGTGCTGGA CCTGCACAGC CGCAAGCGGA TCGGGCAGCC GGAGTTCGGC AACCTGGCGA TGCCGGACTC CGAGTCGATC GCGTTCTGGA CCTCGGTCGC CACCGAGTAC GCCGACAACC CGTCGGTGCT GTTCGACGCC TTCAACGAGC CCTACTCCCG GTACAGCTCC AGCGGTGCCC GGCTGTTGTT CGGGCTCACC TGGCGGTGCT GGCGGGACGG CGGTTGCCAG GCGCCGGTCG AGGATGACCA GACCGCGACC CTCGGACAGG TCACCTATCC CGTCCAGGGC ATGGCCGCCG TGGTCAACGC CATCCGGGAC GCCGGTGCCG AGCAGCCGAT CCTGCTCGGC GGCCTGGACT ACGCCAACGA CATCAGCCAC TGGCTGGAGT TCGCGCCGGA CGACGACCAG CTGGTGGCGG CGTTCCACTC CTACGACTTC AAGGCGTGCG GCGACCCGGA CTGCTGGAAC GACGTCATCG CCCCGGTCGC CGAGACCGTG CCCGTGCTGA CCTCCGAGCT CGGGGCCCGC CATCCCGAGA ACGGGTACGT CGGGCGCTAC CTCGACTGGG CCGACGAGCA CAACCTGGGT GTGCTGTTCT GGGTCTGGGC GGACCATCCC GGCGACCCGA TGGCACTGGT CACCGACGAG CGCGGTCGGC CGACGGCGTA CGGCCGGGCT GCGCGCACCT GGCTGACCGG GCACCTGCCG GCACGGTGA
|
Protein sequence | MAAGVPANTG AAHPVVDGNR IIDARSGRDF VPRGVNWSSF EYACAQGWGM SALDNLVARD PATTEAKEIA GWGANTVRLP LNQDCWLGTR GAPVSDQYEE RSVAGYRSDV HAFVTALNRA GIVVVLDLHS RKRIGQPEFG NLAMPDSESI AFWTSVATEY ADNPSVLFDA FNEPYSRYSS SGARLLFGLT WRCWRDGGCQ APVEDDQTAT LGQVTYPVQG MAAVVNAIRD AGAEQPILLG GLDYANDISH WLEFAPDDDQ LVAAFHSYDF KACGDPDCWN DVIAPVAETV PVLTSELGAR HPENGYVGRY LDWADEHNLG VLFWVWADHP GDPMALVTDE RGRPTAYGRA ARTWLTGHLP AR
|
| |