Gene Sala_2028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2028 
Symbol 
ID4079926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2139974 
End bp2141053 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content69% 
IMG OID638010404 
Producthypothetical protein 
Protein accessionYP_617072 
Protein GI103487511 
COG category[S] Function unknown 
COG ID[COG4427] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0314361 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGC GTATCCAGCC CGCCGGAATC GGCGCCACCG GGAAAGGCGC TGTGCGCGCT 
GCCTTCGCAA ATCAAGTGGC CTATTGCCGC GCCAACGATG CCCCCATCAC CGCGCGTATC
GTCGCCGCGA TCGCCAGCCT GCTGGACGAC CCCGCGAGCA ATTTTGCGCG CCGCATCGCC
AACTGGCCGG GCGCGCCGCT CGCCGACGCG CTGCCGCTTC GCGCCGCGGG GGGCTTTCAC
GCGCTGCACC TGTCGGAGGC TGCGCATGAA CTCGCCCCCA TTTATGCCGA CGCCGAGGAC
ATCAACGACG CCGCGATCGT CGCAGGTGTG GTTGCACGGC ATGAAGCCGC GCTGCTCCCC
TGGCTCGACG GCCCGCCGCA GACCAACGAG GCGGGGCGCT CGTCAAACTT CATCGCGGCG
ATGCTGTGGC TCGCCGAACA GGGGTTGCCA GCGCATTTCG ACTGCCTTGA AATCGGATCG
AGCGCGGGCA TCAATCTGAT GATCGACCGT TATCATTATG ACCTCGGCGG CGTGCATGTC
GGGCCGCAGC CCGGCGCGAT GGCCTTCACC CCCGATTGGC GCGGCAACCA TCCGCCCATG
CACGCAATCG CCATTGCCGG GCTCAGGGGC TGCGACGTTG CGCCGGTCGA TCTCACCGAC
CCGGCGCAGG CGCTCCGCCT CAAAGCCTAT ATCTGGCCCG AACATGACGT CCGCTTCGCG
CGCATGGAAG CGGCGATCGC CGCCGCGTAT GTGGAAAAGC CCTGTCTCAT CCGCGCCAAC
GCCGCCGATT TCGTCGAGGC CGAGCTGGCA CGGCCACAGG CGGCGGGAAC GACGCGCGTG
CTGATGCACT CGATCGTCTG GCAATATGTC CCCGCCGAGC AGCAGGCGCG CGTCACCGCC
GCCATGGAAG TCGCGGGCGC CCGCGCCACC GCCGACCGCC CCGTCGCATG GATCGCGCTC
GAAGCGAACC GGACCGTCCA CCATCACGAA CTGGTCGTGC GCTACTGGCC GGGCGGCGAC
GTCCCCCGCA AGCTGGGCCA TGCCCACGCG CACGGCGCGT GGATCGAGTG GCTGGCGTAA
 
Protein sequence
MSERIQPAGI GATGKGAVRA AFANQVAYCR ANDAPITARI VAAIASLLDD PASNFARRIA 
NWPGAPLADA LPLRAAGGFH ALHLSEAAHE LAPIYADAED INDAAIVAGV VARHEAALLP
WLDGPPQTNE AGRSSNFIAA MLWLAEQGLP AHFDCLEIGS SAGINLMIDR YHYDLGGVHV
GPQPGAMAFT PDWRGNHPPM HAIAIAGLRG CDVAPVDLTD PAQALRLKAY IWPEHDVRFA
RMEAAIAAAY VEKPCLIRAN AADFVEAELA RPQAAGTTRV LMHSIVWQYV PAEQQARVTA
AMEVAGARAT ADRPVAWIAL EANRTVHHHE LVVRYWPGGD VPRKLGHAHA HGAWIEWLA