Gene Sala_1667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1667 
Symbol 
ID4081050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1754744 
End bp1755844 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content67% 
IMG OID638010041 
Producthypothetical protein 
Protein accessionYP_616713 
Protein GI103487152 
COG category[S] Function unknown 
COG ID[COG2855] Predicted membrane protein 
TIGRFAM ID[TIGR00698] conserved hypothetical integral membrane protein 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAGACT GCGAAGCGAT GACCCGCGGC CCCGCCCCTT CCGAACCGTA TCGCGGGGAT 
CTTTTCGGGG AAATCCACCT CGCCGATATG ACCGATGCTG CGCCAACGCC CGGGATCGCC
CGATATTTCC CCGGCCTGGC GATCTGCGCC GCGGCGGCTG GCGCGGCCGG ATGGCTCTCG
GACCATTATG GTGTGCCCGT CATCCTGCTC GGCCTGCTCA TCGGGCTCGC GCTCAATTTC
GTCGCGCGCG ATGCGCGCAC CCATCGCGGT CTCGACTTCG CCTCGCACAC CTTCCTGCGG
ATCGGAATTG TGCTTCTCGG CTTTCAGGTC AGCATCGCAC AGATCGTCGC GCTGGGAGCG
CTCCCGTTTG CCGCACTGAT CCTCATCATG GCGGTGGCCT TTGCCGCCGG ACTTGCCGGC
GCTCGCCTGT CGCGCCAGTC ACCATATGCG GGCCTCCTTG CCGGTGGCGC GACGGCGATT
TGCGGCGCCA GCGCCGCGCT CGCGCTCTAT GGCATCGTCG GCAAAGAGCG GCTCAGCCAG
GCACAATTTG CGCTGACGCT GGTGGGTGTG TCGATGGCCA GTGCGCTGGC GATGTCGCTT
TATCCCGCCA TTGCGGCCGA ACTGGAACTC AGCGACGCGC AGGCCGGTTA CCTGATCGGC
GCCTCGATAC ACGATGTCGG CCAGGCAATC GGCGGCGCTT ATGCTGTTTC GGACGCAGCA
GGCATCGATG CCACGATCGT CAAGCTGGCG CGCGTTACGC TGCTTGCCCC CGTCGTGCTG
CTCGTTTCGC TGGTGATCGG CCCGGCGCGC GCCGGGCCGT CCCGACCCAG CTGGCGGCGA
CTGGGCATGC CGTGGTTCAT CACGCTCTTT CTTGCCGTTG TCGCGGTCAA CAGCCTGATC
GACCTTCCTG CCGTCGCGGC AACCAAGGCG CTTGCCGCAT CCAAGGCGCT GCTGCTGCTC
GCCGTGACGG CCACCGCCAT GCGTTCACGC ACCGACCTGC TCCTCGAGCT CGGCTGGCGG
GCGGCCGCTC CCGTCATGGC GGCTTCGCTG GCAAGCTTTG CGGCCGCACT TTTCTTCGTA
ATGATCGGGG TGGGTGACTG A
 
Protein sequence
MKDCEAMTRG PAPSEPYRGD LFGEIHLADM TDAAPTPGIA RYFPGLAICA AAAGAAGWLS 
DHYGVPVILL GLLIGLALNF VARDARTHRG LDFASHTFLR IGIVLLGFQV SIAQIVALGA
LPFAALILIM AVAFAAGLAG ARLSRQSPYA GLLAGGATAI CGASAALALY GIVGKERLSQ
AQFALTLVGV SMASALAMSL YPAIAAELEL SDAQAGYLIG ASIHDVGQAI GGAYAVSDAA
GIDATIVKLA RVTLLAPVVL LVSLVIGPAR AGPSRPSWRR LGMPWFITLF LAVVAVNSLI
DLPAVAATKA LAASKALLLL AVTATAMRSR TDLLLELGWR AAAPVMAASL ASFAAALFFV
MIGVGD