Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4441 |
Symbol | |
ID | 8450068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 4925503 |
End bp | 4927287 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645043488 |
Product | exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
Protein accession | YP_003203716 |
Protein GI | 258654560 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.796924 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.559619 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGTTGA CCGACCCGAC CGGGGGCCGC AAGCTGAGCC TGCCCGGTCG GCGCCATCGC GGTCGTCCGG GGGCGCCCGC CGTGCTCACG GTCGTGCAGG GCAGCGCCGT CCAGCCTTCG GTCGCCAATG AGTTGCCCGA GATCGACGGG GTCGTCATGC GGCTGCCGGC TGGCGGCACC GCGTCGACCA CGACCGGACG GAACGTCGAA CCGGCGGCCG CCGATCGGTC CGCAGCCGGT CTGGCCGAGG ACTGCGAAGA CACGATCGAG TTCAGCATCC GCCAGTTCCG AGACCAGCAG GACGAGGATC TCGAGGTCGG CCGGACGGTC ACGGTCCCGC CGGCCAGCAC CCGAGGGGTG CGACGGACCT GGAGCGATCG CTACGCCCGC AAGCTGTTCC TGACCGACCT GATCTGCCTG ATCTGGGCGG CCTTCGGCGT GCACATCGTC GCGCCGCCGA TCCCCACCCG GATTTCCTCC GAGCCGACCA ACCTGGCGTT CGTCGCCGCC ACCGCGCTGC TCGTCGTGGT CTGGCTGCTC GCCCTGGACT GGTCCGGCAG CCGGGACGGC ACCGTCACCG GGCACGGTCC CACCGAGTAC AAGCGGGTCA TCCAGGCCTC GCTGAGCGTC TTCGGGCTGG CCGCCATCGG ATCGTTCCTG TTCGACCTGG ACATGCCCCG CAGCTACGTC ATCATGATGC TGCCGGCCGG CCTGACGATG CTGCTGGCCA GCCGATACGT GTGGCGCCGC TGGTTGTACC GGCGTCGGGA CAACGGCGCG ATGATGGCTA CGGTGGTTGC CGTGGGCGAC CGTCATGCGG TCGGCGAACT GGTGGCCGAC CTCGCCCGCT CGCCACGCGC CGGCTACCGG GTGGTCGGGG CGTGCGTGAG CCCGGATCCG GCGGCCCCGG ACGCCACCCA CTTCGCCGGG GTGCCCGTGC TCGGCGTGCC GGCCGACGTC GCCGAGGTGG CCACCGAACT GGGGGCCGAT GCGGTCGCGG TGACCGCGTC CGCCAGCTTC GGCCCGAGCG CCGTTCGCCG GCTCAGCTGG GACCTGGAGG GCACGGACAC CGAGCTGATC CTGGCGCCCG CCCTGACCAA CATCGCCGGC CCGCGGGTGC ACACCCAGCC GGTTGCCGGG CTGCCGCTGA TCCACGTCGA CCATCCCACC TATCGGGGTG CGAACCGCAT CGTCAAGCGC GTCTTCGACG TGTTCGGCAG CCTCGCGCTG GTCGTCCTGT TCTCGCCGGT GCTGCTGGCG GTCGCGGTGG CGATCAAGGC CACCAGCAAG GGACCGGTCT TCTTCCGGCA GGACCGGGTG GGGATCAACG GCGAGACCTT CCGGATGATC AAGTTCCGGT CGATGGTGAT CGACGCGGAG TCGCGCCTGG AGACGCTCAA GGCCGAGCAG CGGGACGCCG GCAACCAGGT CCTGTTCAAG ATGAAGAACG ACCCCCGGAT CACCCCGGTC GGCAAGTTCA TCCGCCGGTT CAGCATCGAC GAGCTGCCGC AGCTGTTCAA CGTGGTCGCG GGCTCCATGA GCCTGGTCGG TCCGCGCCCG CCGCTGCGTT CCGAGGTCGA CCTGTACGGC GATGACGCGC TGCGACGGCT GCTGGTCAAG CCGGGAATGA CCGGGCTCTG GCAGGTGTCC GGCCGGTCCG ACCTGACCTG GGACGACAGC GTCCGGCTCG ACGTGTACTA CGTGGAGAAC TGGTCCATCA CCGGTGATCT GGCCATCCTG TGGCGCACCG CCAAGGCGGT CCTCGGCTCG TCCGGGGCCT ATTGA
|
Protein sequence | MALTDPTGGR KLSLPGRRHR GRPGAPAVLT VVQGSAVQPS VANELPEIDG VVMRLPAGGT ASTTTGRNVE PAAADRSAAG LAEDCEDTIE FSIRQFRDQQ DEDLEVGRTV TVPPASTRGV RRTWSDRYAR KLFLTDLICL IWAAFGVHIV APPIPTRISS EPTNLAFVAA TALLVVVWLL ALDWSGSRDG TVTGHGPTEY KRVIQASLSV FGLAAIGSFL FDLDMPRSYV IMMLPAGLTM LLASRYVWRR WLYRRRDNGA MMATVVAVGD RHAVGELVAD LARSPRAGYR VVGACVSPDP AAPDATHFAG VPVLGVPADV AEVATELGAD AVAVTASASF GPSAVRRLSW DLEGTDTELI LAPALTNIAG PRVHTQPVAG LPLIHVDHPT YRGANRIVKR VFDVFGSLAL VVLFSPVLLA VAVAIKATSK GPVFFRQDRV GINGETFRMI KFRSMVIDAE SRLETLKAEQ RDAGNQVLFK MKNDPRITPV GKFIRRFSID ELPQLFNVVA GSMSLVGPRP PLRSEVDLYG DDALRRLLVK PGMTGLWQVS GRSDLTWDDS VRLDVYYVEN WSITGDLAIL WRTAKAVLGS SGAY
|
| |