Gene Sala_2157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2157 
Symbol 
ID4080191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2267982 
End bp2269139 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content66% 
IMG OID638010535 
Productglycosyl transferase, group 1 
Protein accessionYP_617199 
Protein GI103487638 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.750957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.801991 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTTT CGGACCTTCG CATCGCCCTG TTCAGCGGCA ATTACAACAT GACGACCGAC 
GGCGCGAACA AGGCGCTCAA TCGCCTCGTC GGATATCTGC TGGCGCAGGG CGCGGCGGTG
CGCGTCTATT CGCCGACCGT CGCCCACCCC GACTTCGAAC CCACCGGCGA CCTCGTCAGC
GTGCCGTCGA TGGCGATCCC CGGGCGCAGC GAATATCGCA TACCCTTGAG CTTTTCATCG
AGGGTGCGCC AGGACATCGC CACCTTTGCG CCCAACATCG TCCATATCTC CAGCCCCGAT
CGCGTGGCGC GACAGGCGGC GGCGTGGGCG CGGCGGCGCC GGATTCCGGT GGCCTGTTCG
GTCCATACGC GCTTCGAAAC CTATTTCCGC TATTATAATC TGTCGTTCCT CGAACCGCTC
GTGGTCGCCT GGCTGCGCAA ACTCTATCGC CGCTGCGACG CGCTGATCGC GCCGTCCGAA
AGCTTTGCGC AGGTGCTCCG CGACCAGCGG ATGAATTATG ACATCGGCAT CTGGACGCGC
GGCGTCGAAC AGGGGATTTT TCACCCCGGC CGCCGCGACA TGGCCTGGCG CCGGTCGCTC
GGCATCGCCG ACGACACGCC CACTATCGCC TTCCTCGGGC GGCTGGTGAT GGAAAAGGGG
CTCGACGTCT TTGCCGATGC CATCGACGTG CTGACGCGCC GCGGCGTGCC GCATCAGGTG
GTGGTGATCG GCGAGGGGCC GGCGGGCGAC TGGTTCGAAT CGCGCCTGCC CAACGCGCAT
TTCGTGGGCT TTCAGGGCGG CGCCGATCTC GCTCATGCGC TTGCGTCGTG CGACATCTTC
TTCAACCCGT CGGTCACCGA AACCTTTGGC AATGTCACGC TCGAGGCCAT GGCGTGCGGG
CTGCCGGTGG TGGCGGCGCG CGCGACGGGC AGCGCGAGTA TCGTCAAGCA TGGCCAGACG
GGCTATCTCG TCGCACCGGG ATCGATCTCG GGCTTTGCCG ACCATCTCGA GCGTTATTGC
AACGATACCG CGCTGCGCGC CGACCATGGC GCCGCGGCGG TGCGCGAAAG CGGCGCCTAT
CAGTGGGATG CGATCAATCA GGCGGTTGCC GACACCTATT TGCGCCTGAT CCGCCAGAAA
CAGCGGCACG GGGGCTGA
 
Protein sequence
MDVSDLRIAL FSGNYNMTTD GANKALNRLV GYLLAQGAAV RVYSPTVAHP DFEPTGDLVS 
VPSMAIPGRS EYRIPLSFSS RVRQDIATFA PNIVHISSPD RVARQAAAWA RRRRIPVACS
VHTRFETYFR YYNLSFLEPL VVAWLRKLYR RCDALIAPSE SFAQVLRDQR MNYDIGIWTR
GVEQGIFHPG RRDMAWRRSL GIADDTPTIA FLGRLVMEKG LDVFADAIDV LTRRGVPHQV
VVIGEGPAGD WFESRLPNAH FVGFQGGADL AHALASCDIF FNPSVTETFG NVTLEAMACG
LPVVAARATG SASIVKHGQT GYLVAPGSIS GFADHLERYC NDTALRADHG AAAVRESGAY
QWDAINQAVA DTYLRLIRQK QRHGG