Gene Sala_0601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0601 
Symbol 
ID4080629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp610286 
End bp611737 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content67% 
IMG OID638008960 
Productbacteriophage N4 adsorption protein B 
Protein accessionYP_615655 
Protein GI103486094 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATCGG TGGGGCGGGG CTGGCAAGTG GAGCTATCGA CCGGGTGGCT GGAATGGCTG 
GTGCTGGGGG CGGGCCGCGA ACTGATGCTG TTCGCATCGG TCGGCATATT GCTGATCGGG
CTCGACGATC TGTTGCTCGA CGCGCTGTGG CTGGCGACGC GCGGGCAGCG CCGAGGCGAA
ACCGCGAGAG CGCCGCCGAT TGAGGGGCGT ATCGCCATTT TCGTGCCGGC GTGGGACGAG
GCCGCGGCGC TGCCCGCGAT GCTTTGCCGG ACCCTTGCCG CGTGGGACGG CGAGGATTTC
CGGCTCTATG TCGGATGCTA TCCCAATGAC ACGGCGACGA TCTATGCCGT CTCGCAATTG
GTCGCGCGCG ACGCGCGGCT GCGGCTGGTG ATCGGCGAGA GCGAAGGCCC GACGACCAAG
GGCGACAATC TGAACAGGCT CTGGGCCGCG CTTTGTGCCG ACGAACGGGT GGAGGCCCGG
CGCTTTGCCG CGATCGTGCT TCACGATGCC GAAGATCATG TCCATCGGCA CGAACTTGCG
CTCTATCGGC AGCATTTGGC TCATAATGCG ATGGTGCAGA TTCCCGTCGT GCCGATAATC
GACCGGCGTG CGCGCTGGAT CGGCGGCCAT TATGCGGATG AGTTTGCCGA GGCGCACGGC
AAGGATATGC CGGTGCGCTC GCGCCTTGGC CTGCCGCTGC CCTCGGCCGG CGTCGGCTGC
GCCTTGACCC GCAGCGCGTT GGCCCTGCTC GCGATGGAGC GAGGGGGGTG TCCCTTTTCG
AGCGACAGCC TGACGGAGGA TTATGAGATC GGGATGGTGA TCGGCGCCTA TGGCCTCGGC
GCGCGCTTCG TCGATGCGGC CGATCCCGCA GGCGACCGGA TTGTGTCGCG GGGCGCGTTT
CCGGGCCGCA TCGACGCCGC GGTTCGGCAA AAGTCGCGCT GGATCGCCGG CATCGCAATG
GCGGGCTGGG ATCATCTGGG TTGGCCCGGC TGTCGCCTGG GTCACAAGCA ACGATCGACG
GGACGCGACC TGCTCGCGCG CTGGATGCTC TGGCGCGACC GTCGCGCGCC GCTCGCGGCG
CTCATCCTGC TGGCCGCCTA TGCGGGGCTC ATTCTCGTCG CAGCGGGGGT GGCGGGACAA
TTGCTGCTGG GCTGGAATGC GATCGAACCG GGGCCGACAT TGCAATGGCT GCTCGTCGTA
AACGCGCTGC TTCTCGGCTG GCGCATGGCG CTGCGTATCC ATTTCACCGC GCGCCTTCAT
GGCTGGCGCG AAGCGTCGTT TGCCGTACCG CGTGCCTTTG TGGCGAACAT CATCGCCATG
CTCGCGGCAC GGCGTTCCGT GCTGCTTTAC TGGCGGATAT TGCGCTCGGG CGAAGTGGTG
TGGGACAAGA CCGACCACAG CGAAACCGGC CTCGCGGTCG CGGATGCGCC GGTGCGCGTG
GCGATGCGGT GA
 
Protein sequence
MPSVGRGWQV ELSTGWLEWL VLGAGRELML FASVGILLIG LDDLLLDALW LATRGQRRGE 
TARAPPIEGR IAIFVPAWDE AAALPAMLCR TLAAWDGEDF RLYVGCYPND TATIYAVSQL
VARDARLRLV IGESEGPTTK GDNLNRLWAA LCADERVEAR RFAAIVLHDA EDHVHRHELA
LYRQHLAHNA MVQIPVVPII DRRARWIGGH YADEFAEAHG KDMPVRSRLG LPLPSAGVGC
ALTRSALALL AMERGGCPFS SDSLTEDYEI GMVIGAYGLG ARFVDAADPA GDRIVSRGAF
PGRIDAAVRQ KSRWIAGIAM AGWDHLGWPG CRLGHKQRST GRDLLARWML WRDRRAPLAA
LILLAAYAGL ILVAAGVAGQ LLLGWNAIEP GPTLQWLLVV NALLLGWRMA LRIHFTARLH
GWREASFAVP RAFVANIIAM LAARRSVLLY WRILRSGEVV WDKTDHSETG LAVADAPVRV
AMR