Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3524 |
Symbol | |
ID | 8449143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 3870571 |
End bp | 3871983 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645042602 |
Product | exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
Protein accession | YP_003202838 |
Protein GI | 258653682 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0000171122 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.258747 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCAT TGCGCCCGGC ACCGGTCACG CGTGACACGA ATGCCAGCAA GAAACCGTCG GAACGACGAG GGTCGCGCAC GCGTCGCTTC CTCTGGTTGA TCCGAGCCTG GATGATCGTC CTGCCGGTCG ACGCGATCAT GCTGCTCATG CCCCTGCTGT GGGCGCCGGA CGAGTACGTC GCGATCATCT CGATGACGGC GTTGTCGCTG TTCCTGATCG TCGACCGCGG TCGCTATCAC GCGCGGTTGC ACGTCAGCGT GCTGGACGAG CTGCCCAGCC TGGTCGCCAG CCTGCTGACC GCGGCGGCCA TCGTGGCGAC CGGGATCGCC CTGCTGCCCG AGGGTCCCAT GGTCACCGCG TTCATGAAGA ACGCGGCGAT CGCCATCGTG TTGGTGGTCA TCGGCCGGAT CATCACCTCG CAGCTGATCA TCTTCAGCCG CCGCCGCCGT TACAGCCACC ACCGGACGGT GGTGATCGGG GGCGGCCAGA CCGCCGCCGA GCTCATCCGG ATCCTCAAAC AGCATCAGCG GTACGGACTG TCGGTCGTCG GCTTCGTCGA CGACAAGGAC GCCGCGGCCA GCATGGTCAG CGAACGGCTG GGCAACATCA GCGATCTGGA CTCGGTGGTC ACCCGCTACC GGGTGGACGT GCTGCTGGTG GCCGATTCCG CGGTGCCCGA GCGCACGCTG GTGGAGGAGG TCCGGGCCCC GGCCAGTTCC ACCTGCGACC TGCTGGTGGT GCCCCGGATG CACCAGTTCC GCACCGTGTC CGGCGGGTCG GACCACATCG GTTCGATCCC GATCATGCGC ATCGGCAACC CGCGACTGAG CGGGCCGGCC ATCACCGTCA AGCGGGTCTT CGACATCCTC GTCTCCGGCA CCGCGCTGAT TCTGGTCTCG CCGATCCTGG CCGTCTGCGC GCTGGCCGTG CGCATCGACG GCGGGCCGGG CGTGATCTTC CGCCAACCCC GGGTCGGCCG GAACGGGGAG CTGTTCGACT GCCTCAAGCT GCGGTCGATG CGGCCGGCCA CCAGTGCCGA ATCGGCCACC AACTGGTCGA TCGCCACCGA CAACCGGGTC AGCAAGGTCG GCCGGTTCCT GCGCCGCACC TCGCTGGACG AATTGCCGCA GCTGTGGAAC ATTCTGCGCG GCGACATGTC TCTGGTCGGG CCGCGTCCCG AGCGGCCGCA CTTCGTCGAG CAGTTCTCCG ACAAGTACGA CCGCTACGCC TACCGGCACC GGGTCAAGGT CGGGCTGACC GGGCTGGCCC AGGTCAGCGG GCTGCGCGGG GACACCTCGA TCGCCGACCG GGCCCGGTAC GACAACTACT ACATCGAGAA CTGGTCGCTC TGGCTCGACG TCAAGATCAT CATCCGGACG TTCTTCGAGG TCGTCTTCGC CCGCGGCCGG TGA
|
Protein sequence | MTALRPAPVT RDTNASKKPS ERRGSRTRRF LWLIRAWMIV LPVDAIMLLM PLLWAPDEYV AIISMTALSL FLIVDRGRYH ARLHVSVLDE LPSLVASLLT AAAIVATGIA LLPEGPMVTA FMKNAAIAIV LVVIGRIITS QLIIFSRRRR YSHHRTVVIG GGQTAAELIR ILKQHQRYGL SVVGFVDDKD AAASMVSERL GNISDLDSVV TRYRVDVLLV ADSAVPERTL VEEVRAPASS TCDLLVVPRM HQFRTVSGGS DHIGSIPIMR IGNPRLSGPA ITVKRVFDIL VSGTALILVS PILAVCALAV RIDGGPGVIF RQPRVGRNGE LFDCLKLRSM RPATSAESAT NWSIATDNRV SKVGRFLRRT SLDELPQLWN ILRGDMSLVG PRPERPHFVE QFSDKYDRYA YRHRVKVGLT GLAQVSGLRG DTSIADRARY DNYYIENWSL WLDVKIIIRT FFEVVFARGR
|
| |