Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1157 |
Symbol | |
ID | 4599332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 1231160 |
End bp | 1232068 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639775753 |
Product | LmbE family protein |
Protein accession | YP_922360 |
Protein GI | 119715395 |
COG category | [S] Function unknown |
COG ID | [COG2120] Uncharacterized proteins, LmbE homologs |
TIGRFAM ID | [TIGR03445] 1D-myo-inosityl-2-acetamido-2-deoxy-alpha-D-glucopyranoside deacetylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.103463 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGTTCG ACCAGCGGCT CCTCCTCGTG CACGCCCACC CCGACGACGA GTCGATCGGC CAGGGCGCCA CGATGGCGAA GTACGCCGCC GAGGGCCGCG GCGTCACGCT CGTGACCTGC ACCGGCGGCG AGATGGGGGA GATCCTCGTC CCCGAGCTCA CCCACCTGGC CGCCGACCAG GAGGACCGGC TCGGCGAGCA CCGGCGCGGC GAGCTCGACG CGGCGATGGC CGAGCTGGGG GTCACCGACC ACCGCTACCT CGGCGGCTTC GGCACCTACC GCGACAGCGG GATGAAGTGG CACGAGGACG GGCATGCCGT CCCGGCCGAC GACATCCATC AGAACGCGTT CTGGCACGCC GACCTCACCG AGGCCGCCGA CCACCTGGTC GCGGTGATCC GCGAGGTGCG GCCCCAGGTC CTGGTCACCT ACGACCAGTT CGGCGGCTAC GGCCATCCCG ACCACATCCA GGCGCACCGG GTCGCGACGT ACGCCGCCGC TCTGGCCGCG GTCCCGTCGT ACCGCAAGGA CCTCGGCGCC GCTTGGGACA TCGCGAAGAT CTACTGGGGC GCGATGTCGG AGAGCCGGAT GCGCGCGGCC CTGCGGGCGC TGCGCGAGGC CGGCGACACC ACGGCGTTCG AGGGCATGGA CCCCGACGGC CCGCTGCCGC CGTTCGTCAC CGCCGACGAG GACCTGTCCG CGGTGGTCGA CGCCCAGGAG CACGTCGAGG CGAAGCTGGC AGCGATGCGC GCGCACGCCA CCCAGATCAC CACCGACGGG CCCTTCTTCG CGCTGTCCAA CAACGTGGGC AACGTCGCCT GGGGGCTGGA GTTCTTCCGC CTCGCCAAGG GCGAGCGCGG CGAGCTGAAC CAGGACGGTC TCGAGACCGA CCTGTTCGCC GGGCTCTGA
|
Protein sequence | MTFDQRLLLV HAHPDDESIG QGATMAKYAA EGRGVTLVTC TGGEMGEILV PELTHLAADQ EDRLGEHRRG ELDAAMAELG VTDHRYLGGF GTYRDSGMKW HEDGHAVPAD DIHQNAFWHA DLTEAADHLV AVIREVRPQV LVTYDQFGGY GHPDHIQAHR VATYAAALAA VPSYRKDLGA AWDIAKIYWG AMSESRMRAA LRALREAGDT TAFEGMDPDG PLPPFVTADE DLSAVVDAQE HVEAKLAAMR AHATQITTDG PFFALSNNVG NVAWGLEFFR LAKGERGELN QDGLETDLFA GL
|
| |