Gene Namu_3524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3524 
Symbol 
ID8449143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3870571 
End bp3871983 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content68% 
IMG OID645042602 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_003202838 
Protein GI258653682 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0000171122 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.258747 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCAT TGCGCCCGGC ACCGGTCACG CGTGACACGA ATGCCAGCAA GAAACCGTCG 
GAACGACGAG GGTCGCGCAC GCGTCGCTTC CTCTGGTTGA TCCGAGCCTG GATGATCGTC
CTGCCGGTCG ACGCGATCAT GCTGCTCATG CCCCTGCTGT GGGCGCCGGA CGAGTACGTC
GCGATCATCT CGATGACGGC GTTGTCGCTG TTCCTGATCG TCGACCGCGG TCGCTATCAC
GCGCGGTTGC ACGTCAGCGT GCTGGACGAG CTGCCCAGCC TGGTCGCCAG CCTGCTGACC
GCGGCGGCCA TCGTGGCGAC CGGGATCGCC CTGCTGCCCG AGGGTCCCAT GGTCACCGCG
TTCATGAAGA ACGCGGCGAT CGCCATCGTG TTGGTGGTCA TCGGCCGGAT CATCACCTCG
CAGCTGATCA TCTTCAGCCG CCGCCGCCGT TACAGCCACC ACCGGACGGT GGTGATCGGG
GGCGGCCAGA CCGCCGCCGA GCTCATCCGG ATCCTCAAAC AGCATCAGCG GTACGGACTG
TCGGTCGTCG GCTTCGTCGA CGACAAGGAC GCCGCGGCCA GCATGGTCAG CGAACGGCTG
GGCAACATCA GCGATCTGGA CTCGGTGGTC ACCCGCTACC GGGTGGACGT GCTGCTGGTG
GCCGATTCCG CGGTGCCCGA GCGCACGCTG GTGGAGGAGG TCCGGGCCCC GGCCAGTTCC
ACCTGCGACC TGCTGGTGGT GCCCCGGATG CACCAGTTCC GCACCGTGTC CGGCGGGTCG
GACCACATCG GTTCGATCCC GATCATGCGC ATCGGCAACC CGCGACTGAG CGGGCCGGCC
ATCACCGTCA AGCGGGTCTT CGACATCCTC GTCTCCGGCA CCGCGCTGAT TCTGGTCTCG
CCGATCCTGG CCGTCTGCGC GCTGGCCGTG CGCATCGACG GCGGGCCGGG CGTGATCTTC
CGCCAACCCC GGGTCGGCCG GAACGGGGAG CTGTTCGACT GCCTCAAGCT GCGGTCGATG
CGGCCGGCCA CCAGTGCCGA ATCGGCCACC AACTGGTCGA TCGCCACCGA CAACCGGGTC
AGCAAGGTCG GCCGGTTCCT GCGCCGCACC TCGCTGGACG AATTGCCGCA GCTGTGGAAC
ATTCTGCGCG GCGACATGTC TCTGGTCGGG CCGCGTCCCG AGCGGCCGCA CTTCGTCGAG
CAGTTCTCCG ACAAGTACGA CCGCTACGCC TACCGGCACC GGGTCAAGGT CGGGCTGACC
GGGCTGGCCC AGGTCAGCGG GCTGCGCGGG GACACCTCGA TCGCCGACCG GGCCCGGTAC
GACAACTACT ACATCGAGAA CTGGTCGCTC TGGCTCGACG TCAAGATCAT CATCCGGACG
TTCTTCGAGG TCGTCTTCGC CCGCGGCCGG TGA
 
Protein sequence
MTALRPAPVT RDTNASKKPS ERRGSRTRRF LWLIRAWMIV LPVDAIMLLM PLLWAPDEYV 
AIISMTALSL FLIVDRGRYH ARLHVSVLDE LPSLVASLLT AAAIVATGIA LLPEGPMVTA
FMKNAAIAIV LVVIGRIITS QLIIFSRRRR YSHHRTVVIG GGQTAAELIR ILKQHQRYGL
SVVGFVDDKD AAASMVSERL GNISDLDSVV TRYRVDVLLV ADSAVPERTL VEEVRAPASS
TCDLLVVPRM HQFRTVSGGS DHIGSIPIMR IGNPRLSGPA ITVKRVFDIL VSGTALILVS
PILAVCALAV RIDGGPGVIF RQPRVGRNGE LFDCLKLRSM RPATSAESAT NWSIATDNRV
SKVGRFLRRT SLDELPQLWN ILRGDMSLVG PRPERPHFVE QFSDKYDRYA YRHRVKVGLT
GLAQVSGLRG DTSIADRARY DNYYIENWSL WLDVKIIIRT FFEVVFARGR