Gene Namu_4441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4441 
Symbol 
ID8450068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4925503 
End bp4927287 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content70% 
IMG OID645043488 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_003203716 
Protein GI258654560 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.796924 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.559619 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTTGA CCGACCCGAC CGGGGGCCGC AAGCTGAGCC TGCCCGGTCG GCGCCATCGC 
GGTCGTCCGG GGGCGCCCGC CGTGCTCACG GTCGTGCAGG GCAGCGCCGT CCAGCCTTCG
GTCGCCAATG AGTTGCCCGA GATCGACGGG GTCGTCATGC GGCTGCCGGC TGGCGGCACC
GCGTCGACCA CGACCGGACG GAACGTCGAA CCGGCGGCCG CCGATCGGTC CGCAGCCGGT
CTGGCCGAGG ACTGCGAAGA CACGATCGAG TTCAGCATCC GCCAGTTCCG AGACCAGCAG
GACGAGGATC TCGAGGTCGG CCGGACGGTC ACGGTCCCGC CGGCCAGCAC CCGAGGGGTG
CGACGGACCT GGAGCGATCG CTACGCCCGC AAGCTGTTCC TGACCGACCT GATCTGCCTG
ATCTGGGCGG CCTTCGGCGT GCACATCGTC GCGCCGCCGA TCCCCACCCG GATTTCCTCC
GAGCCGACCA ACCTGGCGTT CGTCGCCGCC ACCGCGCTGC TCGTCGTGGT CTGGCTGCTC
GCCCTGGACT GGTCCGGCAG CCGGGACGGC ACCGTCACCG GGCACGGTCC CACCGAGTAC
AAGCGGGTCA TCCAGGCCTC GCTGAGCGTC TTCGGGCTGG CCGCCATCGG ATCGTTCCTG
TTCGACCTGG ACATGCCCCG CAGCTACGTC ATCATGATGC TGCCGGCCGG CCTGACGATG
CTGCTGGCCA GCCGATACGT GTGGCGCCGC TGGTTGTACC GGCGTCGGGA CAACGGCGCG
ATGATGGCTA CGGTGGTTGC CGTGGGCGAC CGTCATGCGG TCGGCGAACT GGTGGCCGAC
CTCGCCCGCT CGCCACGCGC CGGCTACCGG GTGGTCGGGG CGTGCGTGAG CCCGGATCCG
GCGGCCCCGG ACGCCACCCA CTTCGCCGGG GTGCCCGTGC TCGGCGTGCC GGCCGACGTC
GCCGAGGTGG CCACCGAACT GGGGGCCGAT GCGGTCGCGG TGACCGCGTC CGCCAGCTTC
GGCCCGAGCG CCGTTCGCCG GCTCAGCTGG GACCTGGAGG GCACGGACAC CGAGCTGATC
CTGGCGCCCG CCCTGACCAA CATCGCCGGC CCGCGGGTGC ACACCCAGCC GGTTGCCGGG
CTGCCGCTGA TCCACGTCGA CCATCCCACC TATCGGGGTG CGAACCGCAT CGTCAAGCGC
GTCTTCGACG TGTTCGGCAG CCTCGCGCTG GTCGTCCTGT TCTCGCCGGT GCTGCTGGCG
GTCGCGGTGG CGATCAAGGC CACCAGCAAG GGACCGGTCT TCTTCCGGCA GGACCGGGTG
GGGATCAACG GCGAGACCTT CCGGATGATC AAGTTCCGGT CGATGGTGAT CGACGCGGAG
TCGCGCCTGG AGACGCTCAA GGCCGAGCAG CGGGACGCCG GCAACCAGGT CCTGTTCAAG
ATGAAGAACG ACCCCCGGAT CACCCCGGTC GGCAAGTTCA TCCGCCGGTT CAGCATCGAC
GAGCTGCCGC AGCTGTTCAA CGTGGTCGCG GGCTCCATGA GCCTGGTCGG TCCGCGCCCG
CCGCTGCGTT CCGAGGTCGA CCTGTACGGC GATGACGCGC TGCGACGGCT GCTGGTCAAG
CCGGGAATGA CCGGGCTCTG GCAGGTGTCC GGCCGGTCCG ACCTGACCTG GGACGACAGC
GTCCGGCTCG ACGTGTACTA CGTGGAGAAC TGGTCCATCA CCGGTGATCT GGCCATCCTG
TGGCGCACCG CCAAGGCGGT CCTCGGCTCG TCCGGGGCCT ATTGA
 
Protein sequence
MALTDPTGGR KLSLPGRRHR GRPGAPAVLT VVQGSAVQPS VANELPEIDG VVMRLPAGGT 
ASTTTGRNVE PAAADRSAAG LAEDCEDTIE FSIRQFRDQQ DEDLEVGRTV TVPPASTRGV
RRTWSDRYAR KLFLTDLICL IWAAFGVHIV APPIPTRISS EPTNLAFVAA TALLVVVWLL
ALDWSGSRDG TVTGHGPTEY KRVIQASLSV FGLAAIGSFL FDLDMPRSYV IMMLPAGLTM
LLASRYVWRR WLYRRRDNGA MMATVVAVGD RHAVGELVAD LARSPRAGYR VVGACVSPDP
AAPDATHFAG VPVLGVPADV AEVATELGAD AVAVTASASF GPSAVRRLSW DLEGTDTELI
LAPALTNIAG PRVHTQPVAG LPLIHVDHPT YRGANRIVKR VFDVFGSLAL VVLFSPVLLA
VAVAIKATSK GPVFFRQDRV GINGETFRMI KFRSMVIDAE SRLETLKAEQ RDAGNQVLFK
MKNDPRITPV GKFIRRFSID ELPQLFNVVA GSMSLVGPRP PLRSEVDLYG DDALRRLLVK
PGMTGLWQVS GRSDLTWDDS VRLDVYYVEN WSITGDLAIL WRTAKAVLGS SGAY