Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1412 |
Symbol | |
ID | 4597319 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 1493039 |
End bp | 1495894 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 639776010 |
Product | glycosyl transferase family protein |
Protein accession | YP_922613 |
Protein GI | 119715648 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGTTCCT CGGTCTCAGC GCTGCTCGTC AGCCACGACG GCGCTCGCTG GCTGCCCTCC GTGATCGAGG GGCTGCAGGC CCAGCGCACG CCGGTCGACG ACGTCGTGGT CGTCGACACG ACCAGCAGGG ACGGCAGCGC CGACCTGCTC CACGACGCGT TCGGGGAGGT CGTCTCGGCG CCGGGCGCCA CGTCGTTCCC GGCCGCGGTG GCCCTGGGAC TCGACGAGCT GCGCCGCCGC GGCTCCACCA GCGAGTGGGT CTGGATCCTC CACGACGACG CCAACCCCGA CCCGGGAGCG CTTGCCGCGC TGCTGGCAGC GGCGGCCGCC GACCCCGGCG TCGACATCCT CGGACCGAAG CTGCGGGAGT GGCCCTCCCT CAAGCGGCTC CTCGAGCTCG GGGTCACCAT CTCCGCCACC GGCCGTCGGC AGACCGGCCT CGAGCGCGGT GAGTACGACC AGGGTCAGCA CGACGAGATC CGCGAGGTCC TCGCCGTCAA CACCGCAGGC ATGCTCGTGC GGCGCGCGGT GCTCGAGGGG CTGGGTGGCC TCGACCCGCA GCTGCCGATC TTCGGCAACG ACATCGACTT CGGCTGGCGG GCCGCCATGG CCGGGCACCG CACGGTCGTC GTCCCGGACG CCGTGGTGTT CCACGCGGAG GCGGCCCATC GCGGCCTGCG GCGCACGCCG CTGACCGGGC GGCACACGCA CTACCAGGAG CGCCGCGCCG CGCTGTACAC CCTGCTGGTC AACGCCCGCC GTCGCTCGCT GCTGTGGCTC ACGATCCGGC TGGCGTTCGG CACCGTGCTG CGGATGATCG GCTTCCTGCT GGTCCGCGCC GTCGGCGAGG CGCTGGACGA CCTGGCCGCG TTGCTGTCGA TCTACACCCA TCCCGGCGAG GTGCGTGCGG CGCGGCGGGC CCGCCGCGCC CGCGGGCCCC TCCTGGAGGA GCGCGCCCGT CCGTTGCTCG CGCCGTGGTG GCTGCCCTAT CGGCACGGCC TGGACTTCCT CGGCGACCTG GTCACGGCGG CGACCAACCA GGCGCAGGAC ATCGCCGAGC GGCGTCGGAT CCCGGCGGCC CAGCACGCGC CCGCGAGCAC CGTCGCCCGA CCGCGCACCG AGGAGGACGA GCTCGCCGAG GACACCGGCG CGGTGGCCCG CTTCCTGACC AACCCGGTGG CCCTCGCGCT CGCCATCGTC GTCCTCGCCG CGCTGGTCGG CGGCCGGGAG GCCCTCGGCA GCGTGGCCGG CGGCGGGCTC TCGCCGACAC CGGCCGGCGC ATCGGACTGG TGGCGCCTCC ACACCGAGAC CTGGCACCAG CTCGGCACCG GGACGGCCGT CCCGGCCCCG CCGTACCTGC TCCCCTTGGC GCTGCTGGCC ACGCTGCTCG GTGGCAGCGC GTCGGCTGCG GTGTCGGTGG TGCTCGTGCT GGCGGTGCCG GTCGGCCTCT GGGGGGCGTG GCGGTTCCTG CGGGTCGTCG GGCGGCTCGT GACCCCGGCC GGCGCACCGC GCCGGGCGCT GCTGTGGGGC TCGGTCACCT GGGCGCTGGT GCCGGTCGTC AGTGGCGGCT GGGGCGACGG GCGCCTCGGC GTCGTGGTGG TCGCGGCGCT GCTGCCCTGG CTCGCCCACG CGGCTCTCGG CTTCGCGGAC CCGGACGCCG ACCGCCGCTG GCGGGCGGCC TGGCGGACCG GGCTGCTCCT GGCGGTGAGT GCCGCCTTCG CGCCCGTGCT CTGGCTGTTC GCGGGGCTGC TCGGCCTGGT GGTCCTCGCT GCGGCGTTCG CGATCGTCCG CGGTACGGGC CGCGACCGGT CGGTCTGGGG GCCGCCGGCC ACCGCGCTCG GTCTGGTCCC GCTGCTGCTC GCCCCCTGGT GGATCCCGGC CATCCAGCGC GGCGCCGCGG AGGCCCTGGT GCTCGACGTC GGCCGGCTGC CCGGACCGGA GGTGGACGGG CTCGGCCTGC TCAGCGGCCG ACTCGGCGAC CTCGGCGCTC CGTGGTGGCT GGGCGTCGTG CTCGCGGTCC TGGCCGTGCT CGCGTTGGTG CCGCGGACGA CGCGGATCCC CGTGCTGGTG TGCTGGGTGG TCGCCGCGGT CGCGGCGGTC CTGGCCGCGG CGCTCGGTGC CGTGACGGTC TCGATGGCGG CGACCTCCGC CGAGGCCGGC CTCGGTGCGC TGGTCGTGGT GCTCCAGGGG GCGCTCGTGG TTGCTGTCAC CACGGGAGCG ATCACCGCCG GCCGGGGGGC CGGGGCGTCG TGGCGCCGGG TCGTCGCTGT GGGGCTCGCG CTGGTGGCCG CCGCCGTGCC GGTCGGTGGG CTCGCCTGGT GGCTCGGCGG AGCCGATCCC GCGATCGCCG ACGGCATCGA GACCGACATC CCCGTGTACA TGGTGCAGAG CTCCGAGAGT GGTGCCTCGC ACGGGATCCT CGTGGTCCGC GGCAGCGTCG AGGACGGCCT GACCTTCACC GTGCGCCGCG GCGACGGCGT GACGCTCGGG GAGGACGAGA TCCTCGACCT CACCGGTGCC GACACGTCGT TCACCGCCCA GGTTCGTGAG CTCGTCTCCC GGCCGACCCC GGGCGTGATC GACCGGATCG CGCAGAGCGG CATCGAGTAC GTCGTTCTGC CCTCGCCCGC CGACGGCGAC GTCGCCGCGG TGCTGGACGC TGCGCCCGGC CTGGTCCAGG CGAGCGCGGA GGACCGCAGC ACGCGGGCCT GGCAGGTGGA CCGCCCGCTC GACTCCTCGG CCCTTGAGGG CCCGACCTCG TGGCTGCGGG TGCTCCTCCT CGTCGTCCAG GGAGCGGCGT TGCTCGTGGT CGCCGTGCTC TGCCTGCCCA CCACCAACCG GAGGCGGTCG TCGTGA
|
Protein sequence | MGSSVSALLV SHDGARWLPS VIEGLQAQRT PVDDVVVVDT TSRDGSADLL HDAFGEVVSA PGATSFPAAV ALGLDELRRR GSTSEWVWIL HDDANPDPGA LAALLAAAAA DPGVDILGPK LREWPSLKRL LELGVTISAT GRRQTGLERG EYDQGQHDEI REVLAVNTAG MLVRRAVLEG LGGLDPQLPI FGNDIDFGWR AAMAGHRTVV VPDAVVFHAE AAHRGLRRTP LTGRHTHYQE RRAALYTLLV NARRRSLLWL TIRLAFGTVL RMIGFLLVRA VGEALDDLAA LLSIYTHPGE VRAARRARRA RGPLLEERAR PLLAPWWLPY RHGLDFLGDL VTAATNQAQD IAERRRIPAA QHAPASTVAR PRTEEDELAE DTGAVARFLT NPVALALAIV VLAALVGGRE ALGSVAGGGL SPTPAGASDW WRLHTETWHQ LGTGTAVPAP PYLLPLALLA TLLGGSASAA VSVVLVLAVP VGLWGAWRFL RVVGRLVTPA GAPRRALLWG SVTWALVPVV SGGWGDGRLG VVVVAALLPW LAHAALGFAD PDADRRWRAA WRTGLLLAVS AAFAPVLWLF AGLLGLVVLA AAFAIVRGTG RDRSVWGPPA TALGLVPLLL APWWIPAIQR GAAEALVLDV GRLPGPEVDG LGLLSGRLGD LGAPWWLGVV LAVLAVLALV PRTTRIPVLV CWVVAAVAAV LAAALGAVTV SMAATSAEAG LGALVVVLQG ALVVAVTTGA ITAGRGAGAS WRRVVAVGLA LVAAAVPVGG LAWWLGGADP AIADGIETDI PVYMVQSSES GASHGILVVR GSVEDGLTFT VRRGDGVTLG EDEILDLTGA DTSFTAQVRE LVSRPTPGVI DRIAQSGIEY VVLPSPADGD VAAVLDAAPG LVQASAEDRS TRAWQVDRPL DSSALEGPTS WLRVLLLVVQ GAALLVVAVL CLPTTNRRRS S
|
| |