Gene Namu_3526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3526 
Symbol 
ID8449145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3873280 
End bp3875286 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content71% 
IMG OID645042604 
ProductO-antigen polymerase 
Protein accessionYP_003202840 
Protein GI258653684 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.000131402 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.176407 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGCG GCGGCCTGAT CGTGGTCGCC TTCGTCGTGC TCGCCATCAC CTACCCCCGG 
GTGTCGCTGC TGACCCTGGT GGCGCTGGAC GTCTCCAACA TCAACGGGGT CATCGCCGAC
CAGCTGGGCA CCAGCCCGTA CAAGCCGCAG CTGGCGCTGG CCGTCGTCGT GGTGCTGGTG
ATGATGCGCC GCAGGATGTT CCGGTTCGCC TGGTCGCCGG TGCTGCTGGG GGCGATGGTG
CTGTTCGCCG GGTTCTGCGT GAGCTTCGTG GCCGCGGCCG ACCCGGTGAC CTCGCTGGAT
CTGCTGTCCT CCCAGGCCCG CGACCTGCTG TATTTCGTGG TCGTCTACGC CCTGGTGCTG
TCCACCGACC AGGTCAAACC GACCGCCGCG GTCGCCGTCC TGGTGCTGGC CGCGTTGGCC
GGGCTGACCG TGGTGCACGA GTTCGTGCTG CACAACCAGG GCAGCCTGGG CGGCCTGAGC
CGGGTGCCGC TGGTCCAGGA GGACGGGGCG CTGACCCCCC GGCACGCGGG CACGTCCTCG
GACGTGAATT TCTGGGGCCG GCTGCTGATC CTGTTCACGC CCATGTCGCT CTCGCTGCTG
GCCATGAGCA AGTCCTGGCG GGACCGCGTG CTGTGGGGTG GGGCGACCGT GTCGCTGATG
CTGGGGGTGT ACCTGACCCA GTCCCGCGGC GGCTTCATCG CGTTGTTCAT CGGCCTGGTG
ATGTGGGCCG TGCTGGCCGG CGGGGCCTAC CGAAAGGCGT TGCTGTACCT GCCGGTCGCG
CTGATCGTCA TCGTCCCGTT GACCGGGATC GGCAGCCGGC TGGGCACGCT GACCGCGGTC
GTCTCCGGCT CCACGACCAC CGCGGACCCG TCGGTGGTGA CCCGTAAGCG CTTTCAGCTC
GACGCCTGGC ACATGTTCCT GGACCGACCG ATCACCGGCC ACGGGATCGG CAGCTACGGC
GGGCTATTCG CGGAGTACGA CCGGCTGGCC AACTTCTATG AACCAGTCAA CATCGTCGTC
GCCGCGCACA ATTTCTACCT GGAGCAGGCC GCCGACGGCG GGGTGGTCCT GCTGATCTGC
TGGGCGTTCT TCTTCGGCAC GATCCTGTTC GTCGCCCTGC GCACCATGAT CGGGGCCGGA
CGAACCGGTG ACGACACCCG CCGGTTCCTG GCCCTGGGGG TGATCGGCGG GCTGATCGGC
TGGCTGATCG CCAGCGTCTT CCTGCACCTG TCGGACTTCC GGGCCCTGCT GCTGATCGCG
GTCATCGCCG CGGCCGTCGA CGTGCAAAGC CGGGCCGCGA TCGACCGTCG CGGGATCGCC
CCCGAGCCTG AGCCGGGCCC GTGGCGCCCC GGCCGCGGCG TGGTCGCCGG GCTCGCTGTG
ATGGCCGCGC TGAGCCTGCT GGGCACGGTC GCCGTCCTCA CCAATCGCAC GACGGTCTAC
ACCAGCACGA CCGCCCTGGG CATCGTGCCG TCGGGGGCGG TGAATCCGGC CAGTGCCTAC
TCGCTGGACC TCATCACCCG GGGGCTGATC GGCCCGACGT TCACCGAGGT GCTCAGCCGG
TCGGTCGACG CCACCGCGGT CACCCAGCGG GCCGGCCCCG GCGGCGCTGA CAGTGCTGAC
AGTGCTGACG GGGCTGACCG TGCGGATGTC GAGATCGCTT TCGCCCAGTC GCGACTGGGC
GGTGGGGTGG TGCTCACGGT CACCGCCGAC GACAGCGACG CGGCCACCGA ACTGACCGCC
GCGGCGGTCG CGGCGAGCAA GGCTCGGATC GCCGACCTGG ACACCGGCTA CCAGCTGATC
GGCGATCCCG CCCAGCCGCA ACCGGTCTCG TCGCCGGAGT GGTGGTGGCT GCCGGCGCTG
GCGTTGTCCA CGATCGGCTG TGCCGCGCTC GCCGTGATCA CCGGCCGGCG GCGCCGGGCC
CTTCGACCGG CCGACCCGGA CCGCGGGCCG GCCGGAACCG CCGATCGGGC CGAGCAGACC
CGCGTCCCGG CCCTGGCGGC CCGGTGA
 
Protein sequence
MAGGGLIVVA FVVLAITYPR VSLLTLVALD VSNINGVIAD QLGTSPYKPQ LALAVVVVLV 
MMRRRMFRFA WSPVLLGAMV LFAGFCVSFV AAADPVTSLD LLSSQARDLL YFVVVYALVL
STDQVKPTAA VAVLVLAALA GLTVVHEFVL HNQGSLGGLS RVPLVQEDGA LTPRHAGTSS
DVNFWGRLLI LFTPMSLSLL AMSKSWRDRV LWGGATVSLM LGVYLTQSRG GFIALFIGLV
MWAVLAGGAY RKALLYLPVA LIVIVPLTGI GSRLGTLTAV VSGSTTTADP SVVTRKRFQL
DAWHMFLDRP ITGHGIGSYG GLFAEYDRLA NFYEPVNIVV AAHNFYLEQA ADGGVVLLIC
WAFFFGTILF VALRTMIGAG RTGDDTRRFL ALGVIGGLIG WLIASVFLHL SDFRALLLIA
VIAAAVDVQS RAAIDRRGIA PEPEPGPWRP GRGVVAGLAV MAALSLLGTV AVLTNRTTVY
TSTTALGIVP SGAVNPASAY SLDLITRGLI GPTFTEVLSR SVDATAVTQR AGPGGADSAD
SADGADRADV EIAFAQSRLG GGVVLTVTAD DSDAATELTA AAVAASKARI ADLDTGYQLI
GDPAQPQPVS SPEWWWLPAL ALSTIGCAAL AVITGRRRRA LRPADPDRGP AGTADRAEQT
RVPALAAR