Gene Sare_3974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3974 
Symbol 
ID5705251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4513961 
End bp4515439 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content68% 
IMG OID641273399 
Productundecaprenyl-phosphate galactose phosphotransferase 
Protein accessionYP_001538755 
Protein GI159039502 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.321336 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00335965 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAACTCGA CGATGCTGTT GACCCCCGGC CGGTCAGCGG TGGTCAATGG CCTGTTCCGG 
CCGTGGACAC GTGCTGCTGT GCGGTCCTAC ATCCAGACTC TGGTGGTGCT CGACAGTGCG
GTGCTGATCG TGGCCGTCCT CGTCGCGTAC GTCGCCCACT TCGGCGGCGG GCTTCCCCGC
GGTGCCGAGA TTCCGTACGC CGTGGCCGCC CCTGGCCTGG TGCTGGCGTG GCTTGTCTCG
CTCAGGGCGC TACGGTGCTA CGACGATCGG ATCATCGGCT ATGGCGCCGA CGAGTATCGG
CGGGTGAGTT CGGCCAGCCT GCGCCTCGCT GGTGCCGTGG TGATCGCCGG CTACGTCTTC
GATGTCGAGG TGCCGAGGGG TTTTCTCGCC ATCGCCTTCG CCGTCGGCAC CGTCGGGCTC
GAGTCGGCCC GGTTCACCGC CCGTAAGCGA CTGCATCGAT CCCGGTCGCG GGGCGGCGGA
TGGTCACGGC GAGTCCTCGT GGTCGGCGAC ACCGCGCACG TCCTGGAGTT GGTAGACACG
CTGCGGCGTG AGCCGTACGC GGGCTACCAG GTGGTCGGGG CGTGCATCCC GGACGCACTG
CTTGCTCCGG TCCCACAGCA GCTGGGCGAC GTGCCGGTGG TCGGTTCGTT CCGGAGTATC
CCCGAAGCAG TTGCCACCAT CGATGCTGAC ACCGTGGCGG TGACCGCCTC CGGGCAGCTG
ACCGCTACCC GGCTTCGCCG GCTCGGCTGG CAGCTGGAGG GAACCGGCGT TGACCTGGTG
GTCGCGCCGG CACTGACCGA CGTCGCGGGC CCTCGGATCC ATACCCGTCC GGTGGCCGGA
CTGCCACTGA TACATGTCGA GGCCCCTGAG TTCCGGGGCG TGGGCAAGCT GGTGAAAGGG
CTGGTCGACC GGCTGGCCGC GCTGCTCGTA CTGATGCCGC TGCTGCCGTT GCTGGCGCTG
ATCGCGTTGG CGGTCACGGT CGACAGTCGG GGATCGGCGT TGTTCCGGCA GACCCGGGTC
GGGCAGGGGG GCCGTGAGTT CGGCGTGTGG AAGTTCCGCA CAATGGTGAT CAACGCGGAC
GCCATGCTGG CGGAGCTGAC CGCCCGCAAC GAGACCGACG GCCTGATGTT CAAGCTGCGG
GACGACCCCC GGGTGACCCG GATCGGTCGC GTGCTGCGCA AGTGGTCCCT GGACGAACTG
CCCCAGCTCG TCAACGTCCT GTTCGGGCAG ATGAGCCTGG TGGGCCCCCG CCCACCGCTG
CCGTCGGAGG TCGCACGTTA CGACGGCGAC ATCGCCCGGC GGCTGCTGGT CAAGCCCGGC
ATGACGGGTC TCTGGCAGGT CAGCGGTCGG TCTGACCTGA GCTGGGAGGA TGGCCTCCGA
CTCGACCTCT ACTACGTGGA GAACTGGTCC CTCACCGCCG ACCTGACCAT CTTGTGGAAG
ACTTTCGGGG CGGTGCTGAA GCGTCGTGGT GCCTACTAG
 
Protein sequence
MNSTMLLTPG RSAVVNGLFR PWTRAAVRSY IQTLVVLDSA VLIVAVLVAY VAHFGGGLPR 
GAEIPYAVAA PGLVLAWLVS LRALRCYDDR IIGYGADEYR RVSSASLRLA GAVVIAGYVF
DVEVPRGFLA IAFAVGTVGL ESARFTARKR LHRSRSRGGG WSRRVLVVGD TAHVLELVDT
LRREPYAGYQ VVGACIPDAL LAPVPQQLGD VPVVGSFRSI PEAVATIDAD TVAVTASGQL
TATRLRRLGW QLEGTGVDLV VAPALTDVAG PRIHTRPVAG LPLIHVEAPE FRGVGKLVKG
LVDRLAALLV LMPLLPLLAL IALAVTVDSR GSALFRQTRV GQGGREFGVW KFRTMVINAD
AMLAELTARN ETDGLMFKLR DDPRVTRIGR VLRKWSLDEL PQLVNVLFGQ MSLVGPRPPL
PSEVARYDGD IARRLLVKPG MTGLWQVSGR SDLSWEDGLR LDLYYVENWS LTADLTILWK
TFGAVLKRRG AY