Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1399 |
Symbol | |
ID | 4595872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 1477851 |
End bp | 1478798 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639775997 |
Product | glycosyl transferase family protein |
Protein accession | YP_922600 |
Protein GI | 119715635 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCCAGC CGTCCCAGCA CCGCCGCGTG GTGGCGGTCG TGGTCACGTT CAACCGGCTC GCGCTGCTGC AGCGGCTGGT CGAGCGCCTC GACCAGGTGC CGGAGCTGAG CGAGATCCTC GTGGTCGACA ACGCCTCGAC GGACGGGACG GGTGAGTGGC TCGCCGGCCT CGACGATCGG GATCCCGGCG ACGAGCGGCC GACCACCCCG GTGCTGGGCC GCACCCTGGC CGAGAACGGC GGCGGCGCCC GCGGGTTCCA CGACGGGCTC GCGTGGGCGA TGGAGCGTGG CGCCGACCTC GCGTGGCTGA TGGACGACGA TGGGCTGCCC GAGCCGGACT GCCTGTCCCG GCTCCTGGTC GAGGAGGACC TGGACTTCTG GGGACCGGCC GTCGTCGACC AGGACCGCCC GGACCGGCTG GTCTTCCCGA TCCGGCTGCC GGGGGGCACC CGCGTGGCGC ACGACCTCCC CTACGTCGAG CGCGCGGCGA TCGGTGGCCG GATCGACGGC ATCGTGATCC CGTTCAACGG CGTGCTCGTC ACCCGTGACC TCGTCGAGCG GATCGGCCTG CCGCGCGCCG AGTACTTCAT CTGGGGCGAC GACCACGAGT ACCGCCTGCG TGCCGAGGCG GCCGGGGCCC GGATCGCCAC CGTCGTGGGC GCGCGCGTGC TCCACCCCTC GGTCGGGAGC CTCGGCACCC CGATGATGTT CGGGCGGACC ACGTACAACC ACAGCCCGAG CGACCTCAAG CACTACTGCA TGGCCCGCAA CAACCTGCTG AACCTGCGCG AGTACCGCGG CTGGCCGCAC GCGCTCGCAT TCGTGGCGAA GACCGCGTGG TTCTACACGT TCACCCGTCC CGACCCGCGT CGGGTGGCGC TCAGCGCCCG TGCCATGTAC GCCGGGCTCC GCGGCGACTT CACCGGTCAT CGGAGGTTCC TGCGATGA
|
Protein sequence | MSQPSQHRRV VAVVVTFNRL ALLQRLVERL DQVPELSEIL VVDNASTDGT GEWLAGLDDR DPGDERPTTP VLGRTLAENG GGARGFHDGL AWAMERGADL AWLMDDDGLP EPDCLSRLLV EEDLDFWGPA VVDQDRPDRL VFPIRLPGGT RVAHDLPYVE RAAIGGRIDG IVIPFNGVLV TRDLVERIGL PRAEYFIWGD DHEYRLRAEA AGARIATVVG ARVLHPSVGS LGTPMMFGRT TYNHSPSDLK HYCMARNNLL NLREYRGWPH ALAFVAKTAW FYTFTRPDPR RVALSARAMY AGLRGDFTGH RRFLR
|
| |