Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3974 |
Symbol | |
ID | 5705251 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4513961 |
End bp | 4515439 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641273399 |
Product | undecaprenyl-phosphate galactose phosphotransferase |
Protein accession | YP_001538755 |
Protein GI | 159039502 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.321336 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00335965 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAACTCGA CGATGCTGTT GACCCCCGGC CGGTCAGCGG TGGTCAATGG CCTGTTCCGG CCGTGGACAC GTGCTGCTGT GCGGTCCTAC ATCCAGACTC TGGTGGTGCT CGACAGTGCG GTGCTGATCG TGGCCGTCCT CGTCGCGTAC GTCGCCCACT TCGGCGGCGG GCTTCCCCGC GGTGCCGAGA TTCCGTACGC CGTGGCCGCC CCTGGCCTGG TGCTGGCGTG GCTTGTCTCG CTCAGGGCGC TACGGTGCTA CGACGATCGG ATCATCGGCT ATGGCGCCGA CGAGTATCGG CGGGTGAGTT CGGCCAGCCT GCGCCTCGCT GGTGCCGTGG TGATCGCCGG CTACGTCTTC GATGTCGAGG TGCCGAGGGG TTTTCTCGCC ATCGCCTTCG CCGTCGGCAC CGTCGGGCTC GAGTCGGCCC GGTTCACCGC CCGTAAGCGA CTGCATCGAT CCCGGTCGCG GGGCGGCGGA TGGTCACGGC GAGTCCTCGT GGTCGGCGAC ACCGCGCACG TCCTGGAGTT GGTAGACACG CTGCGGCGTG AGCCGTACGC GGGCTACCAG GTGGTCGGGG CGTGCATCCC GGACGCACTG CTTGCTCCGG TCCCACAGCA GCTGGGCGAC GTGCCGGTGG TCGGTTCGTT CCGGAGTATC CCCGAAGCAG TTGCCACCAT CGATGCTGAC ACCGTGGCGG TGACCGCCTC CGGGCAGCTG ACCGCTACCC GGCTTCGCCG GCTCGGCTGG CAGCTGGAGG GAACCGGCGT TGACCTGGTG GTCGCGCCGG CACTGACCGA CGTCGCGGGC CCTCGGATCC ATACCCGTCC GGTGGCCGGA CTGCCACTGA TACATGTCGA GGCCCCTGAG TTCCGGGGCG TGGGCAAGCT GGTGAAAGGG CTGGTCGACC GGCTGGCCGC GCTGCTCGTA CTGATGCCGC TGCTGCCGTT GCTGGCGCTG ATCGCGTTGG CGGTCACGGT CGACAGTCGG GGATCGGCGT TGTTCCGGCA GACCCGGGTC GGGCAGGGGG GCCGTGAGTT CGGCGTGTGG AAGTTCCGCA CAATGGTGAT CAACGCGGAC GCCATGCTGG CGGAGCTGAC CGCCCGCAAC GAGACCGACG GCCTGATGTT CAAGCTGCGG GACGACCCCC GGGTGACCCG GATCGGTCGC GTGCTGCGCA AGTGGTCCCT GGACGAACTG CCCCAGCTCG TCAACGTCCT GTTCGGGCAG ATGAGCCTGG TGGGCCCCCG CCCACCGCTG CCGTCGGAGG TCGCACGTTA CGACGGCGAC ATCGCCCGGC GGCTGCTGGT CAAGCCCGGC ATGACGGGTC TCTGGCAGGT CAGCGGTCGG TCTGACCTGA GCTGGGAGGA TGGCCTCCGA CTCGACCTCT ACTACGTGGA GAACTGGTCC CTCACCGCCG ACCTGACCAT CTTGTGGAAG ACTTTCGGGG CGGTGCTGAA GCGTCGTGGT GCCTACTAG
|
Protein sequence | MNSTMLLTPG RSAVVNGLFR PWTRAAVRSY IQTLVVLDSA VLIVAVLVAY VAHFGGGLPR GAEIPYAVAA PGLVLAWLVS LRALRCYDDR IIGYGADEYR RVSSASLRLA GAVVIAGYVF DVEVPRGFLA IAFAVGTVGL ESARFTARKR LHRSRSRGGG WSRRVLVVGD TAHVLELVDT LRREPYAGYQ VVGACIPDAL LAPVPQQLGD VPVVGSFRSI PEAVATIDAD TVAVTASGQL TATRLRRLGW QLEGTGVDLV VAPALTDVAG PRIHTRPVAG LPLIHVEAPE FRGVGKLVKG LVDRLAALLV LMPLLPLLAL IALAVTVDSR GSALFRQTRV GQGGREFGVW KFRTMVINAD AMLAELTARN ETDGLMFKLR DDPRVTRIGR VLRKWSLDEL PQLVNVLFGQ MSLVGPRPPL PSEVARYDGD IARRLLVKPG MTGLWQVSGR SDLSWEDGLR LDLYYVENWS LTADLTILWK TFGAVLKRRG AY
|
| |