Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0754 |
Symbol | |
ID | 4599639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 800416 |
End bp | 801291 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 639775355 |
Product | lytic transglycosylase, catalytic |
Protein accession | YP_921966 |
Protein GI | 119715001 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) [COG4623] Predicted soluble lytic transglycosylase fused to an ABC-type amino acid-binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.970703 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCAGG TCGCCCCGTC CAGCGCTGCC GGAGCCCTCG GCACGCCCGG CACGGCGAGC GGGGATGCCG TCGTCGCGGC AGCGAAGAAG TACCTCGGCG TCCCCTACGT GTGGGGCGGC ACCGACCCTG CGAAGGGCCT GGACTGCTCG GGCCTCGTGC AGCGCGCCTA CCGGGACCTC GGCTACGAGC TGCCCCGCGT CTCCTATCAG CAGGCCGAGG CCGGTCGACC CGTGAGCGGC GGCCTCGCCA ACGCACAGCC CGGCGACATC CTCGCCTTCG GCTCTCCGGT CCACCATGTC GGCATCTACG TGGGCGACAA CCAGATGATC GAGGCACCCC GGCCCGGGCT GGACGTGCGG ATCGGCCCGG TCTACGAGAC GCCGACCGCG ATCCGGCGGG TGGTCCCCGA CGCCGGCGCG ATCGCACCGG TCACCCCGGC CGGGGGCGTC GTCGCCGGCC CGGTGGCGGC CGGCGCCGCG GCCCGCCTGG CGCCCGGGAC GCCGTACGCG TCCCTGTTCG AGTCCGCCGC CCAGAAGTAC GGCGTCAGCC CCGCGCTGCT GTCCGCCGTC GCCCGCCAGG AGTCCGGCTA CAACCCGCGC GCCACCAGCC CGGCCGGCGC GCAAGGGCTG ATGCAGCTGA TGCCCGGCAC CGCCTCCGGG CTCGGGGTCG ACAACCCCTA CGACCCGACC CAGGCTGTCG ACGGCGCCGC CCGGCTGCTC AGCAGCCTGC TCGACCGCTT CGGGAGCACG CCGCTCGCGC TCGCCGCCTA CAACGCCGGC CCGGGGGCCG TGCTCCGGTA CGGCGGGGTC CCGCCCTACC CCGAGACCAA GAACTACGTC CGTTCCGTGA TGTCGATGCT GGGAGCCGCC GCATGA
|
Protein sequence | MRQVAPSSAA GALGTPGTAS GDAVVAAAKK YLGVPYVWGG TDPAKGLDCS GLVQRAYRDL GYELPRVSYQ QAEAGRPVSG GLANAQPGDI LAFGSPVHHV GIYVGDNQMI EAPRPGLDVR IGPVYETPTA IRRVVPDAGA IAPVTPAGGV VAGPVAAGAA ARLAPGTPYA SLFESAAQKY GVSPALLSAV ARQESGYNPR ATSPAGAQGL MQLMPGTASG LGVDNPYDPT QAVDGAARLL SSLLDRFGST PLALAAYNAG PGAVLRYGGV PPYPETKNYV RSVMSMLGAA A
|
| |