Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4584 |
Symbol | |
ID | 4598682 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 4853655 |
End bp | 4855295 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 639779193 |
Product | glycosyl transferase family protein |
Protein accession | YP_925766 |
Protein GI | 119718801 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGCCG ACCTCGCCGC ACCGGCCCTG GTGCTCGTCG GCCCCGGGGT CGACCTGGCG ACCGGCTCCG GCCCGGTGCC CGAGGGCGTG ACCCGCGTGG CGCTCGGGGA CGAGGTCCCG GACGGGGTCT GGGCCAGCGT GCTCCTGGCG GTCGCTGACG AGGCCGCGCT GCGGGTGGCC GTCTCCCGGC TGCCCCGCCT CGGCCGGACC AAGCACGTCG GGTGCCACCT GGCGTCGGCG ACCGGCCCGG TGACCACGGT GCTCCGTCCG GAGTGGCCGC CGCTGGCCAA CCTGGTCGCG GAGACCGCGG ACGGCGGCGC GCACACCCGG CTGACCTTCC GGCGACCCGC CCCGGCCGGG CCGGTGCTCG TCGAGCTCGC ACGGCACACC GGCACCCGGG TCGTGACCGG CAACCACGGG CTGGTCGTCA CCGGTGCCCC GGTGCCGGTG GACCCGACCG CCGTCACCGA CGAGGTCCCG GCCGCGGTGG TCGTCGGCGG CCGCGCCGAC CTGCCCGAGC ACCCGGTCCT GGGCCGGGCG CCGGTCGCGC TGCACCTGCG CGGCGAGCCG GCCGGCCCGC TCGACGAAGC CCCCCTCGAT GAGGCCCTGC TCAACCCGGT CGGCTTCCGC CGCGACTGGG ACCGCGGGCC GGTCCCGCTG CCCGCGGGCG ACCCGACCCC GGAGCTGGTC GCCGCCGTCC GCGACGCCCA GGCCGTCGTG GTGCCCGACG ACGCAGATCC CCGCACGGTG GCCGGGCTGG CGATGGCCGG TGTGCCGCTG CTCGGCGCTG GCTACCCGGG CATCGACCCG GTCGACCCGG CCGACCTGGA CGACCCGCTG CGCCGCGAGG AGCACTCGGT CCGGCTGCGC CGGGCCGCCC TGCGCGAGCA CTCGCACCTG GCCTGGCGGC ACCGGGTCGC CCGGCGCGCC GGGCTGCGGG CAGCGGCGTA CCCCTCGGTC AGCGTGCTGT TGCCGACCCG CCGACCCGAG CAGCTCGCCT TCGCGCTCGA CCAGGTCGCC CGCCAGCGCG GCGTCGCCGA GCTCGAGCTG GTGCTGGCCA CCCACGGGTT CGAGGCCGAC CCCGGGCTGG TGCGCGAGCG GCTGGGCGAG CGGCCGGTCA CGCTGCTGCC GCTGCCGGCC GACACGGTCT TCGGTGACGT GCTCCGCGCC GCGACCGACG CCGCCACGGG CGACGTGGTC GTGAAGATGG ACGACGACGA CTGGTACGGG CCGGACCTCC TCGCCGACCT GCTGCTCGCC AAGCACTACT CCGGCGCGGA CCTGGTCGGC ACGCCCGCGG AGCTGGTCTA CCTGGAGCCG ATCCGCACCA CCGTGCGCCG CCGCGGCCCG AGCGAGAGCT TCGGCGCCGT GGTCGCCGGC GGCACGATGA CCCTCGACCG GGCGCTGCTG CGCGCGGTGG GCGGGTTCCG CAGCGTCCCG CGGCACGTCG ACGCCCGGCT CCTCGAGGAC GTCCGCGCCG CGGGGGGCAG CGTGTACCGG ACCCAGGGCC TCGGGTACGT CCTGCGGCGT ACGACGCACG GCCACACCTG GGACACCGGA CTCGGCTACT TCCTGACCCG GCAGAGCGTC GCCGCGCAGT GGCGCGGCTT CCGGCCCAGC AGACTCCTGG AGGCAGGGTG A
|
Protein sequence | MLADLAAPAL VLVGPGVDLA TGSGPVPEGV TRVALGDEVP DGVWASVLLA VADEAALRVA VSRLPRLGRT KHVGCHLASA TGPVTTVLRP EWPPLANLVA ETADGGAHTR LTFRRPAPAG PVLVELARHT GTRVVTGNHG LVVTGAPVPV DPTAVTDEVP AAVVVGGRAD LPEHPVLGRA PVALHLRGEP AGPLDEAPLD EALLNPVGFR RDWDRGPVPL PAGDPTPELV AAVRDAQAVV VPDDADPRTV AGLAMAGVPL LGAGYPGIDP VDPADLDDPL RREEHSVRLR RAALREHSHL AWRHRVARRA GLRAAAYPSV SVLLPTRRPE QLAFALDQVA RQRGVAELEL VLATHGFEAD PGLVRERLGE RPVTLLPLPA DTVFGDVLRA ATDAATGDVV VKMDDDDWYG PDLLADLLLA KHYSGADLVG TPAELVYLEP IRTTVRRRGP SESFGAVVAG GTMTLDRALL RAVGGFRSVP RHVDARLLED VRAAGGSVYR TQGLGYVLRR TTHGHTWDTG LGYFLTRQSV AAQWRGFRPS RLLEAG
|
| |