Gene Sare_3439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3439 
Symbol 
ID5703289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3964155 
End bp3965261 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content71% 
IMG OID641272866 
ProductUDP-N-acetylglucosamine--N-acetylmuramyl- (pentapeptide) pyrophosphoryl-undecaprenol N-acetylglucosamine transferase 
Protein accessionYP_001538232 
Protein GI159038979 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0707] UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase 
TIGRFAM ID[TIGR01133] undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0197262 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCCGC TGCGTTCGGT GGTGCTTGCG GGAGGTGGCA CCGGGGGCCA CATCTACCCG 
CTGCTCGCCT TCGCCGACTG CCTGCGCCGG CACGACTCCG GCGTCCGGGT CACCTGCCTG
GGCACCCCCA AGGGCCTGGA GAACGAGCTG ATCCCGCCGG CCGGGTACGA CCTGCGGCAG
ATCCCCGCGC ACCAGCTTCC CCGTTCGGTC AACCTGGACC TGGTGAAGAC CCCGGGGCGG
ATGTGGACCG CGGCCCGCGC CGCCGGCAAG GTCATCGACG AGGTGGAGGC CGATGTGGTG
GTGGGGTTCG GCGGGTACGT CTCGGTCCCG GCCTATCTCG CCGCGTGGCG ACGCGAGCTG
CCGATCGTCA TCCACGAGGT CAATGTGCCA CCGGGGGTGG CCAACCGACT GGGCATGAAG
TTCACCAGGC ACGTTGCGGT GGGCTTCCCG CACCAGCCGG CGCAGGCCGA GTCGCTGCGC
CAGGCCCGGG TAGTCGGGGT ACCGCTGCGC CGGGGTATCG CCGGCCTGGA CCGGGCCGCC
ATGCGCGATG CCGCCCGCGC CCACTTCGGG CTCCGTCCGG ACCTGCCGGT ACTCTTCGTC
GCCGGTGGAT CGCAGGGCGC GCGCTCGATC AACCTGGCGG TTTCGGGGGC GGCCAAGGAG
TTGGCCCGCA ACGGAGTGCA GGTGCTGCAC GTGATCGGTG CGCGTAACGA GACGGTGTCG
GTGCCGACCG ATTTGCCGGC GCCGTATGTG ACCCTGCCGT ACCTGTCGCA GATGGAGCTG
GGCTACGCCG CAGCCGATCT GATGCTCGGC CGCGGCGGGG CGATGACCTG CGCGGAGGTG
GCGGCGATCG GGTTGCCGAC GGTCTACGTT CCGTACCCAC ACAGCAACCA GGAGCAGCGG
CGCAACGCGT TGCCGGTGGT GGAGGCCGGT GGTGGACTAC TCGTTGACGA CGCTGAGCTG
ACGCCGGCCT GGGTGGAGGG CAATGTGATA CCGCTGGCCC GCGACCCGCA CCGGCTGGCC
GCGATGGGGG CTGCCGCCGC CGCGTACGGG AATCGCGACG GCGATGAGGC CCTGCTCAAC
TTCGTTTACG AGGCGGTGGT CCGGTGA
 
Protein sequence
MGPLRSVVLA GGGTGGHIYP LLAFADCLRR HDSGVRVTCL GTPKGLENEL IPPAGYDLRQ 
IPAHQLPRSV NLDLVKTPGR MWTAARAAGK VIDEVEADVV VGFGGYVSVP AYLAAWRREL
PIVIHEVNVP PGVANRLGMK FTRHVAVGFP HQPAQAESLR QARVVGVPLR RGIAGLDRAA
MRDAARAHFG LRPDLPVLFV AGGSQGARSI NLAVSGAAKE LARNGVQVLH VIGARNETVS
VPTDLPAPYV TLPYLSQMEL GYAAADLMLG RGGAMTCAEV AAIGLPTVYV PYPHSNQEQR
RNALPVVEAG GGLLVDDAEL TPAWVEGNVI PLARDPHRLA AMGAAAAAYG NRDGDEALLN
FVYEAVVR