Gene Sala_1012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1012 
Symbol 
ID4081700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1039457 
End bp1040686 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content69% 
IMG OID638009372 
Productprotein of unknown function DUF894, DitE 
Protein accessionYP_616062 
Protein GI103486501 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.171336 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTCCTG AACCGCCCTC CGCCTTCGCG CCCTTTCGCT ACCCGGCGTT CCGCGCGATC 
TGGATCGCCA ACCTCGCGTC GAACATGGGA TCGATGATTC AGTCGGTCGC GGCGGCATGG
CTGATGACCG ACCTCACCGA CTCGCACCTC CTGATCGCGC TGGTGCAGGC CGGCACAACA
ATCCCGATCA TGCTTCTCGG CATTTTCGCG GGTGCCATCG CCGACAATTT CGACCGGCGC
CGTATCATGC TCGCGGCGCA GACGGGGATG CTGCTCGTTT CGGCAGCGCT GACGGTGACC
ACCTGGCTCG GTGCGACCAC GCCGCTGTCG CTGCTCTTCT TCACCCTCGC GGTCGGCTGC
GGGACCGCGC TCAACGGCCC CGCATGGCAG GCGTCGGTGC GGCTTCAGGT GGGACCGAAG
GATCTGCCGC AGGCGATCAC GCTCAACACC ATCGCCTTCA ATCTTGCGCG TAGCGCAGGG
CCGGCGCTCG GCGGCCTGCT GATCTCGATC GTCGGCGCCG CCGCGGCGTT CGGCCTCAAC
GCGCTGAGCT ATGTCGCACT GATCGTCGTA TTGCTGCGCT GGCATCCCGA CACCGTGCCG
CCGCGCCGCA CGCCGATGCT GTCCGCGATC GCGGCAGGGC TGACGTTCTG CGCGCATTCG
GACCCCTTGC GCCGTGTCCT CGTTCGCGGG TTCGCCTTCG GTTTCGGCGC GGCGGGATTC
CAGGCGCTGC TTCCCTCACT CGTCCGCGAC CGGCTGGGCG GAACCGAAAT CATCTACGGC
CTCTGCCTTG CGGCCTTTGG CGCGGGATCG ATCTTTGCCG CGCTGTGGGT CGGCGCGGCG
CGGCGCCGCT GGGGCAGCGA CCGCGTGGTG ACAGCCGCGT CGCTGGTCTT TTCCGCCGCG
ATGCTGCCTG TCGCGATGAC CATCAGCCTG CCCGCGCTGA TGCTGGCCGC ATTCGTCGCG
GGCGGCGCCT GGGTATCGAC GCTGACGACG CTCAACGTCG CGATGCAAAT GCGCTCGCCC
GAAGAGATTC TGGGGCGCTG CCTGTCGATC TATCAGGCGG TGACTTTTGG CGCGATGGCG
CTCGGCGCCT ATGCGTTCGG GCTGCTGGCC GACCTCGCCG CGCTGCCTGC CGCGATCCTC
GCATCGGCAG GCTGGCTGCT GGTCTCGGCG CTCATATTGC GCCTCATCGC GCCGATGCCG
CGCCGCGACG AGGGCCGCGT CCTGCCCTGA
 
Protein sequence
MSPEPPSAFA PFRYPAFRAI WIANLASNMG SMIQSVAAAW LMTDLTDSHL LIALVQAGTT 
IPIMLLGIFA GAIADNFDRR RIMLAAQTGM LLVSAALTVT TWLGATTPLS LLFFTLAVGC
GTALNGPAWQ ASVRLQVGPK DLPQAITLNT IAFNLARSAG PALGGLLISI VGAAAAFGLN
ALSYVALIVV LLRWHPDTVP PRRTPMLSAI AAGLTFCAHS DPLRRVLVRG FAFGFGAAGF
QALLPSLVRD RLGGTEIIYG LCLAAFGAGS IFAALWVGAA RRRWGSDRVV TAASLVFSAA
MLPVAMTISL PALMLAAFVA GGAWVSTLTT LNVAMQMRSP EEILGRCLSI YQAVTFGAMA
LGAYAFGLLA DLAALPAAIL ASAGWLLVSA LILRLIAPMP RRDEGRVLP