Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1402 |
Symbol | |
ID | 4595875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 1480977 |
End bp | 1482956 |
Gene Length | 1980 bp |
Protein Length | 659 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639776000 |
Product | putative glycosyltransferase |
Protein accession | YP_922603 |
Protein GI | 119715638 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGTCA CCCGCCTGCT CCAGCGACAG ATCCTGCCCG TCGACCGCGA CTTCGACGTG CTCGCGCTGT ACGTCGACCC CGAGGACGCC AAGCTCGACG CGGACAAGTA CGAGATCGGC GGCAGCCGCG CGGCCAAGGA CCTCAACAAC GCCGCGATCC GCCAGTCCAC CGCGACCGGC CACACGATCC ACCCCGACCA GATCGAGTCC CGCACCGCGC TGCGGGTCAA GTCGGGCGAC CGGCTCTCGT TCGGCACCTA CTTCAACGCC TTCCCCGCCA GCTACTGGCG CCGCTGGACG ATCGTCAAGG ACGTCACCTT GACGATCACC GTCGCGGGCC GGGGCGCCAC CGTCCTGGTC TACAAGTCGA TGGCCAAGGG CCACTCGCAG CGCGTCGCGT CCGCCGACAC CGGCGCCGAG GGCCGCAGCA CCTTCAGCTT CGACCTCAGC CTCAAGCCGT TCGTGGACGG CGGCTGGTAC TGGTACGACA TCATCGCCGG CGACGACGAC GTGGTGGTCG AGAGCGCCGA GTGGAGCGCC GAGGTGCCCG AGGACCGGGC CGAGCACGGC ACCGTCGACA TCGCGATCAC CACGATGAAC CGTCCCGACT TCTGCGCGAA GCTGCTCGGC CAGCTCGGCG ACGACCAGGA CGTGCGGCCC TACCTCGACA CCGTCTTCGT CATGGAGCAG GGCACCGACA AGGTGGTCGA CTCGCCGGAC TTCGCGAAGG CCCAGGGCGC GCTCGGCGAC CTGCTGCGCG TGATCGAGCA GGGCAACCTC GGCGGCTCCG GCGGCTACGC CCGCGGCCAG CTCGAGTCGG TCCGCAAGGG CACCGCGACG TACACGATGA TGATGGACGA CGACATCGTC TGCGAGCCCG AGGGCGTGAT CCGGGCGATC ACCTTCGCCG ACCTGGCCCG CCGCCCCACC ATCGTCGGCG GCCACATGTT CAACATCTAC TCCCGCTCCC GGCTGCACAG CTTCGGCGAG ATCGTCCAGC CGTGGCGGTT CTGGTGGCAG TCGCCGCTGG ACACCTACAG CGACTGGGAC CTCGCCGGGC GCAACCTGCG CTCGAGCCGG TGGCTGCACA AGCGCATCGA CGTGGACTTC AACGGCTGGT TCATGTGCCT GGTACCGCGG CAGGTGCTCG AGGAGATCGG GCTCTCGCTG CCGCTGTTCA TCAAGTGGGA TGACTCCGAG TTCGGGCTGC GCGCCAAGGA GGCCGGCTAC CCCACGGTGA CCTTCCCCGG CGCGGCGGTC TGGCACGTGC CGTGGACCGA CAAGAACGAC GGGTTGGACT GGCAGGCCTA CTTCCACCAG CGCAACCGGT TCGTCGCCGC GCTGCTGCAC TCGCCGTACC CCAAGGGGGG TCGGATGGTG CGGGAGAGCC GCAACCACCA GATCTCCCAC TTGGTCTCGA TGCAGTACTC CACGGTCCAG ATCCGCCACC AGGCGCTGCT CGACGTGCTG GCCGGGCCGG ACAAGCTGCA CGAGATGCTC CCGACCCGCC TCGCCGAGAT CAACGCGATG CGCAAGCAGT ACACCGACGC CCAGCTCGAG GCGGACCCGG ACGCGTTCCC GCCGATCCGG CGCAAGAAGC CGCCGCGCAA GGGGCGCGAC GGCAGCGAGA TCCCCGGGCG CCTCTCCCAG CTGGTCAGCG CCGGCCTCCA GCCGCTGCGC CAGCTCAAGC CCCCGCGCGA GCTCGCCCAG GAGCACCCCG AGGCCGAGAT CCGCGCGATG GACGCCAAGT GGTACCGGTT GGCGTCGTAC GACTCCGCGA TCGTCTCGAT GAACGACGGC GCCTCCGCGG CGTTCTACCG GCGCGACCCC CAGCTGTTCC GCGAGCTGAT GGTCAAGACC ATCGAGATCC ACGAGCGGCT CAAGCGCGAG TGGCCGCGGC TGGCCGAGGA GTACCGCGCC AAGCTCGGGG AGGTCACCTC GCCGGAGGCA TGGGAGGAGA CCTTCCGGCC GTGGACGTGA
|
Protein sequence | MTVTRLLQRQ ILPVDRDFDV LALYVDPEDA KLDADKYEIG GSRAAKDLNN AAIRQSTATG HTIHPDQIES RTALRVKSGD RLSFGTYFNA FPASYWRRWT IVKDVTLTIT VAGRGATVLV YKSMAKGHSQ RVASADTGAE GRSTFSFDLS LKPFVDGGWY WYDIIAGDDD VVVESAEWSA EVPEDRAEHG TVDIAITTMN RPDFCAKLLG QLGDDQDVRP YLDTVFVMEQ GTDKVVDSPD FAKAQGALGD LLRVIEQGNL GGSGGYARGQ LESVRKGTAT YTMMMDDDIV CEPEGVIRAI TFADLARRPT IVGGHMFNIY SRSRLHSFGE IVQPWRFWWQ SPLDTYSDWD LAGRNLRSSR WLHKRIDVDF NGWFMCLVPR QVLEEIGLSL PLFIKWDDSE FGLRAKEAGY PTVTFPGAAV WHVPWTDKND GLDWQAYFHQ RNRFVAALLH SPYPKGGRMV RESRNHQISH LVSMQYSTVQ IRHQALLDVL AGPDKLHEML PTRLAEINAM RKQYTDAQLE ADPDAFPPIR RKKPPRKGRD GSEIPGRLSQ LVSAGLQPLR QLKPPRELAQ EHPEAEIRAM DAKWYRLASY DSAIVSMNDG ASAAFYRRDP QLFRELMVKT IEIHERLKRE WPRLAEEYRA KLGEVTSPEA WEETFRPWT
|
| |