Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4050 |
Symbol | |
ID | 4596564 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 4273477 |
End bp | 4274853 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639778656 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_925234 |
Protein GI | 119718269 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | [TIGR03449] UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGCTG ACCGGCCGGG GCACCGCTCG AGGGGAATCA ATCCGGGCCC CGGCATGTTC ACCTTGGTTG GACCCGACGA GCGCGACGAC CCCGAGGTGG CAGTGGACCC GATCCGGCGT GTGGCGATGA TCAGCCTGCA CACCTCGCCC CTCGACCAGC CCGGGACGGG TGATGCGGGT GGGATGAACG TCTACGTCAT CGAGCTGTCC AAGCGACTGG CCGCCCAGGG CATCGCCGTC GACATCTTCA CCCGGGCCAC CACCTCGGCG GTCGAGCCGC TGGTCGAGGC GTACGACGGG GTGCAGGTCC GGCACATCCA CGCCGGGCCG TTCGAGGGGC TCACCAAGGC CGAGCTGCCC GGCCAGCTCT GCGTCTTCGC CCGCGAGGTG CTGCGCGCCG AGGCGGCCCA GCCGGTCGGG CACTACGACG TCGTGCACTC CCACTACTGG CTCTCCGGGC AGGTCGGCGC GCTGGCCCGC GACCGGTGGG GTGTGCCGCT GGTGCACTCC ATGCACACGA TGGCGAAGGT CAAGAACGAC GCGCTCGCCG AGGGCGACAC CCCCGAGCCC GCGGCCCGCA TCATCGGCGA GGAGCAGGTC GTCGAGGCCG CCGACATGCT GGTCGCCAAC ACCGACATCG AGGCCAAGCA GCTGGTCAAC ATGTACGACG CCGACCCCAG CCGGGTCGAG GTCGTCCACC CCGGCGTCGA CCTCGGGGTG TTCCGGCCCC AGGACCGCTC GACCGCCCGG GCCCGGCTCG GCCTGCCGGA GGACGCCGCG GTGCTGCTGT TCGCCGGCCG GATCCAGCCG CTCAAGGCCC CGGACGTGCT GCTGCGCGCC GTCGCCGAGC TGCTCGCCCA GACCCCGGAG CTGCGCTCAC GGCTGGTCGT CCCGATCGTC GGCGGGCCCT CGGGCTCCGG GCTCGAGCAC CCCGAGTCGC TGGCCCAGCT GGCGAGCGAG CTCGGGCTCG ACGGCGCCGG CGGCACCGGC CCCGTGGTGC GCTTCGTGCC TCCGGTCTCC CAGGAGGAGC TGGCCCGCTG GTGCGCGGCC GCGACCCTGG TCGCGGTGCC GTCGTACAAC GAGTCCTTCG GGCTGGTCGC GGCCGAGGCC CAGGCCACCG GCACCCCGGT CGTCGCCGCC GCGGTCGGCG GCCTCACGAC GGTCGTGCGC GACGGCCGCA GCGGCCTGCT CGTCGACACC CACGACCCGC GGGACTGGGC CGACGCGCTG CGCCGCGTGG TCGAGAACGA CGCGTTCCGC GACCGGTTGG CCGCGGGGGC GCTCGAGCAG GCGCGGCTGT TCTCCTGGGA GCACACCGCG CGGCAGACCC TCGACGTCTA CCGGCGGGCG CGGGCCGAGA TCCGGGAGGC CGTGTGA
|
Protein sequence | MRADRPGHRS RGINPGPGMF TLVGPDERDD PEVAVDPIRR VAMISLHTSP LDQPGTGDAG GMNVYVIELS KRLAAQGIAV DIFTRATTSA VEPLVEAYDG VQVRHIHAGP FEGLTKAELP GQLCVFAREV LRAEAAQPVG HYDVVHSHYW LSGQVGALAR DRWGVPLVHS MHTMAKVKND ALAEGDTPEP AARIIGEEQV VEAADMLVAN TDIEAKQLVN MYDADPSRVE VVHPGVDLGV FRPQDRSTAR ARLGLPEDAA VLLFAGRIQP LKAPDVLLRA VAELLAQTPE LRSRLVVPIV GGPSGSGLEH PESLAQLASE LGLDGAGGTG PVVRFVPPVS QEELARWCAA ATLVAVPSYN ESFGLVAAEA QATGTPVVAA AVGGLTTVVR DGRSGLLVDT HDPRDWADAL RRVVENDAFR DRLAAGALEQ ARLFSWEHTA RQTLDVYRRA RAEIREAV
|
| |