Gene Sare_3539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3539 
Symbol 
ID5704607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4079755 
End bp4081203 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content68% 
IMG OID641272966 
ProductUDP-N-acetylglucosamine 1-carboxyvinyltransferase 
Protein accessionYP_001538332 
Protein GI159039079 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.553813 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000875806 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGGTGTCC CGGTTACCGA CCCCTACCAC CGCGGATCGC AGGACATCCA ACCCCACCGA 
TACCCGGCAC CACTCGACGC GCTGGAGGTT GCGTTGACCG ACGACGTCCT GGTCGTACAC
GGAGGCACCC CGCTGGAAGG GCGAATCCGC GTACGCGGCG CGAAGAACCT GGTCTCCAAG
GCAATGGTCG CCGCGCTGCT CGGTGACAGC CCGAGTCGGC TGTACGACCT GCCGAAGATC
CGTGACGTCG AGGTCGTCCG CGGCCTGCTC GGGGTACACG GGGTCAAGGT CACCGATGGC
GACGAGGACG GCGCGCTGGT CCTCGACCCC GCCAACGTGG AGAGCGCCAG CACCGACCAG
ATCAACGTGC ACGCTGGCTC AAGCCGGATC CCGATCCTGT TCTGCGGGCC GCTGCTGCAC
CGGCTCGGCC ACGCCTTCAT TCCCGATCTT GGCGGCTGCC ACATCGGCCC CCGCCCGATC
GACTTCCACC TCCAGGCGCT GCGCGAGTTC GGGGCGACCG TCGACAAGCA GCCGGAGGGC
CTGCACCTGT CGGCGCCGAA CGGACTACAC GGCACCAAGT TCGCTCTGCC CTACCCGAGC
GTCGGCGCCA CCGAGCAGGT GCTGCTGACC GCCGTGATGG CCGAGGGCGT CACCGAGCTG
CGCAACGCGG CGGTCGAACC GGAGATCGTC GACCTGATCT GTGTCCTGCA GAAGATGGGC
GCGATCATCA AGGTGCACAC CGACCGGGTG ATCGAGATCC AGGGTGTGCC GAAGCTCCAC
GGCTACTCCC ACCGCCCGAT CCCGGACCGG ATCGAGGCGG CCAGTTGGGC CGCCGCCGCG
CTCGCCACCC GTGGTCACGT CGAGGTGCTT GGCGCGGAGC AGGCCGACAT GATGACGTTC
CTCAACATCT TCCGCTCGGT CGGCGGTGAG TACGAGGTCA CCGATGCCCG CCCGCCCCGG
TTGAACGATC CCGGCCAGGA GGGCGGCATC CGATTCTGGC ACCCGGGCGG GGAGCTGAAG
TCGGTCGCAC TGGAGACCGA CGTACACCCG GGTTTCATGA CCGACTGGCA GCAACCCTTG
GTCGTGGCAC TGACCCAGGC CCGTGGTCTG TCGATCGTCC ACGAGACGGT GTACGAGCAG
CGGCTCGGCT ACACCGAAGC CCTCAACTCG ATGGGCGCGA ACATCCAGAT CTACCGGGAC
TGCCTGGGTG GCACCCCGTG TCGCTTCGGC CGACGCGACT TCAAGCACTC GGCGGTTATC
GCCGGGCCGA GCAAACTGCA CGCCGCCGAT CTGGTCATCC CCGACCTGCG GGCAGGGTTC
AGCCATCTGA TCGCGGCACT CGCCGCCGAG GGCACCTCCC GGGTGTACGG CGTCGACCTG
ATCAACCGCG GCTACGAGGA CTTCGAGGCG AAGCTCGCCG ACCTGGGCGC GCACGTCGAG
CGGCCGTGA
 
Protein sequence
MGVPVTDPYH RGSQDIQPHR YPAPLDALEV ALTDDVLVVH GGTPLEGRIR VRGAKNLVSK 
AMVAALLGDS PSRLYDLPKI RDVEVVRGLL GVHGVKVTDG DEDGALVLDP ANVESASTDQ
INVHAGSSRI PILFCGPLLH RLGHAFIPDL GGCHIGPRPI DFHLQALREF GATVDKQPEG
LHLSAPNGLH GTKFALPYPS VGATEQVLLT AVMAEGVTEL RNAAVEPEIV DLICVLQKMG
AIIKVHTDRV IEIQGVPKLH GYSHRPIPDR IEAASWAAAA LATRGHVEVL GAEQADMMTF
LNIFRSVGGE YEVTDARPPR LNDPGQEGGI RFWHPGGELK SVALETDVHP GFMTDWQQPL
VVALTQARGL SIVHETVYEQ RLGYTEALNS MGANIQIYRD CLGGTPCRFG RRDFKHSAVI
AGPSKLHAAD LVIPDLRAGF SHLIAALAAE GTSRVYGVDL INRGYEDFEA KLADLGAHVE
RP