Gene Namu_4935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4935 
Symbol 
ID8450566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5511072 
End bp5512472 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content74% 
IMG OID645043974 
ProductUDP-N-acetylglucosamine 
Protein accessionYP_003204198 
Protein GI258655042 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR03449] UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAAGC ACTCCGCGGC CGAGGACGAC GCCTCCGCCT TCATCCCCCC GGACGGTGGC 
GGGCCGGCTG ATCCGCCGAT CCGACGCCTG GCCGCGATCT CGCTGCACAC CTCGCCGCTG
GCCCAACCGG GGACCGGCGA CGCGGGTGGG ATGAATGTCT ACATCGAGGC CACCGCCCGC
CGGCTGGCCG CGCGCGGGGT CGAGGTGGAG ATCTTCACCC GGGCCACCTC CAGCGACTTG
CCGCCCATCG TGGAGATGGT GCCCGGGGTG CTCGTCCGGC ACATCGTGGC CGGGCCGTTC
GAGGGCCTGG ACAAGGAAGA TCTGCCCGGG CAGCTGTGCT CGTTCGCCGC CGGGGTGATG
CGCGCCGAGG CCCGCAACGC CCCCGGCCAC TACGACCTCG TGCACTCGCA CTACTGGCTC
TCCGGCCAGG TCGGGTATCT GGCCAAGGAC CGCTGGGGAG TGCCGCTGGT GCACTCCGCG
CACACCCTGG CCAAGGTCAA GAACGCGGCC ATGGCCGAGG GTGACGACCC CGAACCGCGC
GGCCGGATCA TCGGCGAGGA GCAGGTGGTG GTCGAGGCGG ATCGGTTGAT CGCCAACACC
GCCGCCGAGC GGGCCGAGCT GGTCGGCCTG TACGGCGCGG ACGAGCGGTT GATCGACGTG
GTCCCCCCGG GGGTGGACAC CGAGGTGTTC AGCCCGGGTG ACCGGGCCGC GGCCCGGCAG
GCGCTGGGCA TCGGGCCGGA CGAGAAGGTC ATCGTCTTCG CCGGCCGCAT CCAGCCGCTC
AAGGGACCGG ACGTGGTGGT GCGGGCGGTG CATCAGCTGG CCGACCGGTA CCCCGACCAG
CGATGGCGGC TGGTCATCGT GGGCGGCGCC TCCGGCGCCG GCCGCCGGCC CGGGCATCAA
CTGCACGAGC TGGTCGACCT GCTCGGCAGC CGCGACACGA TCGACTTCCG GCCGGCGGTG
CCGGCCGCCG AGCTGGCCGT GATCTACCGG GCCGCCGACG TGGTCGCCGT GCCCAGCTAC
AACGAGTCCT TCGGGTTGGT GGCGATCGAG GCCCAGGCGT CGGGGACGCC GGTGGTCGCG
GCCGCGGTCG GCGGCCTCAC GGTGGCCGTG GCCGACGGCG TCAGCGGGTC GTTGGTCAAC
GGTCACGACC CGGGTCGGTG GGCCGACGCG TTGGCCGCGG TCACCCTCGA CGCGCCCCGG
CGGGATCGCC TCTCGGTGGG TGCCCGGCAG CAGGCCGCCC AGTTCTCCTG GGACGCGACC
GTCGACGGCC TGCTGCGTAG CTACCGGGCC GCCCGCGACG GCGCCCGGGT GGACCAGGGC
CGGACCGGGA TCGCCCGGAT AGATCAGATC GGCCGGATCG AGGCGAACGG CAGCCCGCTG
GCCCGGGTGG CCGGCCGATG A
 
Protein sequence
MGKHSAAEDD ASAFIPPDGG GPADPPIRRL AAISLHTSPL AQPGTGDAGG MNVYIEATAR 
RLAARGVEVE IFTRATSSDL PPIVEMVPGV LVRHIVAGPF EGLDKEDLPG QLCSFAAGVM
RAEARNAPGH YDLVHSHYWL SGQVGYLAKD RWGVPLVHSA HTLAKVKNAA MAEGDDPEPR
GRIIGEEQVV VEADRLIANT AAERAELVGL YGADERLIDV VPPGVDTEVF SPGDRAAARQ
ALGIGPDEKV IVFAGRIQPL KGPDVVVRAV HQLADRYPDQ RWRLVIVGGA SGAGRRPGHQ
LHELVDLLGS RDTIDFRPAV PAAELAVIYR AADVVAVPSY NESFGLVAIE AQASGTPVVA
AAVGGLTVAV ADGVSGSLVN GHDPGRWADA LAAVTLDAPR RDRLSVGARQ QAAQFSWDAT
VDGLLRSYRA ARDGARVDQG RTGIARIDQI GRIEANGSPL ARVAGR