Gene Csal_1693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1693 
Symbol 
ID4028531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1923769 
End bp1924809 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content46% 
IMG OID637966881 
Productundecaprenyl-phosphate galactosephosphotransferase 
Protein accessionYP_573744 
Protein GI92113816 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000262579 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAGTGA TCTCATATCT ATTAGCTAGT AGTGCTCGAG TAATGGCACA AATTATGTTG 
CGGCAGGCAC GCTTGAGAGG ACGTGCTAGG CGCGCTGTAT TTCTCGTTGG CCCTGGCCAG
CAACTATTGA AGCTAGCGAA GAGTATGCGT GCTTCTCCAG GTGAAGGGTA TTCGATTGCC
GGTTTTGAGC GTCTCCCCCG ATTACCTGAT GAGCAATGTC TAGAACGGAT AGTTAGGCGT
GTTAGTGAAA CGCAAGCTCG CGAAGTCTGG ATATCAGTTC CGCTGGAAAT GGGGAGTGCG
GTACGAGATA TTTTTTATGC GCTTCGCAAT CATACCGCAG AGGTGCGTTT TTTGCCGGAC
TTCCCAGATA TGCAGTTGTT GAACCATCGC ATGAGTGAAG TCGCGGGGCA TCTTTCAGTT
GATCTAAGTG TTACACCTAT GAGTGGCATG GCGCGTGTGC TAAAACGTAT GGAAGACCTC
CTTCTCGGCT TTCTAATAAC AGCCATGATT CTTCCGTTTT GTTTGGCTAT CGCTGTAGGA
GTTAAAGTAA CTTCGCCAGG GCCCATATTG TTTAAGCAGT ATAGAACGGG CGCGAATGGA
AAAAAATTTA AAGTTTATAA GTTTCGCTCT ATGCGGATGC ACAAAGAAGA GCAAGGCCAA
GTTACCCAAG CGTCGAGGTC TGATCCTCGC GTGACTCCTA TAGGGTCTTT TCTTCGGCGC
ACCTCCTTGG ACGAGCTGCC ACAGTTCTAC AATGTGTTGC AAGGGCGGAT GTCTATTGTG
GGTCCACGGC CTCATGCGCT TTCTCATAAC GAACATTACA AAGAATTAGT GGAGTCCTAT
ATGAGACGCC ACAAAGTAAA ACCTGGTATT ACCGGGTGGG CTCAGGTCAA TGGTTTTCGT
GGCGAGACTG ATACTCTTGA AAAAATGGAA CAGCGTGTAA AGCACGATTT ATGGTATATC
GACAATTGGT CGTTATGGCT GGATGTAAAA ATTATTTTTT TGACTGTTTT TAAAGGTTTT
GTGGGTCGGA ATGCTTATTA G
 
Protein sequence
MLVISYLLAS SARVMAQIML RQARLRGRAR RAVFLVGPGQ QLLKLAKSMR ASPGEGYSIA 
GFERLPRLPD EQCLERIVRR VSETQAREVW ISVPLEMGSA VRDIFYALRN HTAEVRFLPD
FPDMQLLNHR MSEVAGHLSV DLSVTPMSGM ARVLKRMEDL LLGFLITAMI LPFCLAIAVG
VKVTSPGPIL FKQYRTGANG KKFKVYKFRS MRMHKEEQGQ VTQASRSDPR VTPIGSFLRR
TSLDELPQFY NVLQGRMSIV GPRPHALSHN EHYKELVESY MRRHKVKPGI TGWAQVNGFR
GETDTLEKME QRVKHDLWYI DNWSLWLDVK IIFLTVFKGF VGRNAY