Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5074 |
Symbol | |
ID | 8336428 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 5826215 |
End bp | 5827471 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644958173 |
Product | UDP-N-acetylglucosamine |
Protein accession | YP_003115775 |
Protein GI | 256394211 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | [TIGR03449] UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.838773 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.526532 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCGC GACGGGGATC CGCGCCGCAC CGGATCGCGA CGATCAGCGT GCTGACATCG CCGCTGGCGC AGCCGGGCGG AGGCGACGCC GGAGGGCTCA ACGTCTATGT GGTCGAGACC GCCCGACGCT TCGCCGAGAC TGGCGTCCAG GTGGACATCT TCACCCGGGC CGCCGCTCCG CGCCTGCCGC CGATCGTCGA ACTCTGCGAC GGCGTCGTGG TGCGGCACGT ACCGGCCGGT CCGCCGCGCG AGGTGGACAA GGGCGCTTTG CCCAGAGTCC TCGGCGAGTT CACCGCCGGC ATGCTGCGCG CCCCGGGGGA CTACGACGTG GTGCACGCCC ACCACTGGCT CTCCGGGCGC GTCGGCGCTC TGGTGGCCCG CTCCCGGGGT GTTCCGCTCG TCCAGTCGAT GCACTCGCTC GGGCTGGTGA AGAACGCGGT CCTGCCCGGC GAGGACGGAT CGGCGCCGCC GGCTCAGATA GCGGGAGAGA GCGCGGTGAT CGCCGCCGCC GACCGCCTCG TCGCCAACAC CGCGCAGGAG GCGGATCAGC TCATCGCGCT CTACGGCGCG GCTCCCGAAC GCGTGCACAC CGTGCATCCC GGCGTGGATC TGGAGCTGTT CCGGCCGGGT GATCGAGACC AAGCCCGAGC CCGGCTCGGT CTTCCTCATG ACGCCTTCGT CCTGCTGTTC GCCGGGCGCG TGCAGCGGCT CAAAGGTCCT GACATCCTCA TGCGCGCCGC CGCGCAGTTG CTGCATGCGG ACCTTGACCT TGCTCAGCGC CTCGTGGTGG CCTTCGTCGG CGGTCCGAGC GGTGAATTGC AAGCAGACCC AGACCAGCTC ACGAAGCTCG CGACGGATCT GGGGATCGGC GAGCAGGTGC GCGTGGAACC ACCGTGTCCG CATCCGGAAC TCGCCGACTG GTATCGCGCC GCGACCCTCG TCGTCGTCCC GTCGCGGGCT GAGACCTTCG GTCTGGTGGC GGTGGAGGCG CAGGCTTGCG GGACGCCGGT GGTCGCCGCG GCGGTCGGCG GTTTGCAGAC CGCCGTGCGA GCAGGGGTCT CAGGAGTCCT GGTGGAAGGA CACGATCCGG CGCGGTACGC GGAGGTGATC AGAGCTCTGA TCGACGATCC GGCGCGGCTG ACGGCGTTGC GGGCGGGCGC GTTGCAGCAC GCCGCCGGAT TCGGCTGGAG CGAGGCCGTG GACCGGCTGC TCGCGGTCTA CCGGTCCGCT ATCGAAGGCG GTCGCGGACG GCCGTGA
|
Protein sequence | MTARRGSAPH RIATISVLTS PLAQPGGGDA GGLNVYVVET ARRFAETGVQ VDIFTRAAAP RLPPIVELCD GVVVRHVPAG PPREVDKGAL PRVLGEFTAG MLRAPGDYDV VHAHHWLSGR VGALVARSRG VPLVQSMHSL GLVKNAVLPG EDGSAPPAQI AGESAVIAAA DRLVANTAQE ADQLIALYGA APERVHTVHP GVDLELFRPG DRDQARARLG LPHDAFVLLF AGRVQRLKGP DILMRAAAQL LHADLDLAQR LVVAFVGGPS GELQADPDQL TKLATDLGIG EQVRVEPPCP HPELADWYRA ATLVVVPSRA ETFGLVAVEA QACGTPVVAA AVGGLQTAVR AGVSGVLVEG HDPARYAEVI RALIDDPARL TALRAGALQH AAGFGWSEAV DRLLAVYRSA IEGGRGRP
|
| |