Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3539 |
Symbol | |
ID | 5704607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4079755 |
End bp | 4081203 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641272966 |
Product | UDP-N-acetylglucosamine 1-carboxyvinyltransferase |
Protein accession | YP_001538332 |
Protein GI | 159039079 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase |
TIGRFAM ID | [TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.553813 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000875806 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGGTGTCC CGGTTACCGA CCCCTACCAC CGCGGATCGC AGGACATCCA ACCCCACCGA TACCCGGCAC CACTCGACGC GCTGGAGGTT GCGTTGACCG ACGACGTCCT GGTCGTACAC GGAGGCACCC CGCTGGAAGG GCGAATCCGC GTACGCGGCG CGAAGAACCT GGTCTCCAAG GCAATGGTCG CCGCGCTGCT CGGTGACAGC CCGAGTCGGC TGTACGACCT GCCGAAGATC CGTGACGTCG AGGTCGTCCG CGGCCTGCTC GGGGTACACG GGGTCAAGGT CACCGATGGC GACGAGGACG GCGCGCTGGT CCTCGACCCC GCCAACGTGG AGAGCGCCAG CACCGACCAG ATCAACGTGC ACGCTGGCTC AAGCCGGATC CCGATCCTGT TCTGCGGGCC GCTGCTGCAC CGGCTCGGCC ACGCCTTCAT TCCCGATCTT GGCGGCTGCC ACATCGGCCC CCGCCCGATC GACTTCCACC TCCAGGCGCT GCGCGAGTTC GGGGCGACCG TCGACAAGCA GCCGGAGGGC CTGCACCTGT CGGCGCCGAA CGGACTACAC GGCACCAAGT TCGCTCTGCC CTACCCGAGC GTCGGCGCCA CCGAGCAGGT GCTGCTGACC GCCGTGATGG CCGAGGGCGT CACCGAGCTG CGCAACGCGG CGGTCGAACC GGAGATCGTC GACCTGATCT GTGTCCTGCA GAAGATGGGC GCGATCATCA AGGTGCACAC CGACCGGGTG ATCGAGATCC AGGGTGTGCC GAAGCTCCAC GGCTACTCCC ACCGCCCGAT CCCGGACCGG ATCGAGGCGG CCAGTTGGGC CGCCGCCGCG CTCGCCACCC GTGGTCACGT CGAGGTGCTT GGCGCGGAGC AGGCCGACAT GATGACGTTC CTCAACATCT TCCGCTCGGT CGGCGGTGAG TACGAGGTCA CCGATGCCCG CCCGCCCCGG TTGAACGATC CCGGCCAGGA GGGCGGCATC CGATTCTGGC ACCCGGGCGG GGAGCTGAAG TCGGTCGCAC TGGAGACCGA CGTACACCCG GGTTTCATGA CCGACTGGCA GCAACCCTTG GTCGTGGCAC TGACCCAGGC CCGTGGTCTG TCGATCGTCC ACGAGACGGT GTACGAGCAG CGGCTCGGCT ACACCGAAGC CCTCAACTCG ATGGGCGCGA ACATCCAGAT CTACCGGGAC TGCCTGGGTG GCACCCCGTG TCGCTTCGGC CGACGCGACT TCAAGCACTC GGCGGTTATC GCCGGGCCGA GCAAACTGCA CGCCGCCGAT CTGGTCATCC CCGACCTGCG GGCAGGGTTC AGCCATCTGA TCGCGGCACT CGCCGCCGAG GGCACCTCCC GGGTGTACGG CGTCGACCTG ATCAACCGCG GCTACGAGGA CTTCGAGGCG AAGCTCGCCG ACCTGGGCGC GCACGTCGAG CGGCCGTGA
|
Protein sequence | MGVPVTDPYH RGSQDIQPHR YPAPLDALEV ALTDDVLVVH GGTPLEGRIR VRGAKNLVSK AMVAALLGDS PSRLYDLPKI RDVEVVRGLL GVHGVKVTDG DEDGALVLDP ANVESASTDQ INVHAGSSRI PILFCGPLLH RLGHAFIPDL GGCHIGPRPI DFHLQALREF GATVDKQPEG LHLSAPNGLH GTKFALPYPS VGATEQVLLT AVMAEGVTEL RNAAVEPEIV DLICVLQKMG AIIKVHTDRV IEIQGVPKLH GYSHRPIPDR IEAASWAAAA LATRGHVEVL GAEQADMMTF LNIFRSVGGE YEVTDARPPR LNDPGQEGGI RFWHPGGELK SVALETDVHP GFMTDWQQPL VVALTQARGL SIVHETVYEQ RLGYTEALNS MGANIQIYRD CLGGTPCRFG RRDFKHSAVI AGPSKLHAAD LVIPDLRAGF SHLIAALAAE GTSRVYGVDL INRGYEDFEA KLADLGAHVE RP
|
| |