Gene Sala_1874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1874 
Symbol 
ID4082619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1969871 
End bp1971496 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content71% 
IMG OID638010250 
Productsporulation related 
Protein accessionYP_616919 
Protein GI103487358 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.138389 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.137004 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCAAG CGATATTGAC GGCGAAGCCG GCGCGCACAC GGCGTAACGT GGCAGCGCTG 
GGCGCGTTGT TGTTGGGGAC TCTCGGCATT CCGTCGGTGC ATGCGATGCA GGCGGCCGCG
CCCGACGCGG CAACGCGCGC GGCGATGGAA AAGCGCAGCG CCGCGCGCGC GCTGCTTTCC
GCGTCGCTGG CACGGCTGGC GTCGAACAAT AATGATGCGA CCGCGCTGCT CGACGCCGGG
CGTGCGTCGA TCTCGCTCGA GGATTATCGC GCCGCCCTCG GCTTTCTGCT CCGCGCCGAA
CAGGCCAGGC CGCGCGACGG CGCGGTCAAG GCGGCGCTCG GCTCGGCGAT GGTGCATTCC
GAAAATCCGA CGCGCGCGCT CGACTATTTC GGCGAGGCGC AGCTTCTGGG CGCACCCGAA
CGGCTGTTCC TCGCCGACCG CGGGCTCGCG CGCGACCTGC TCGGACAACA GGATGCGGCA
CAGCGCGATT ATCAGCTCGC GCTGTCGATC GCGCCCGATG CCGAACTGAC CCGGCGTTAT
GCCCTGTCGC TGGGGATCAG CGGCGACCCC GATCGCGCGA TCCAGCTGCT GACGCCGCAA
TTGCGCGCGC AGGATCGCGG CGCGTGGCGG CTTCGCGCGA TGATCCTGGC GATGAACGGC
CGCGACAGGG AGGCAAGCGA GATCGTCAAC GCGACGATGC CCGCGCCGAT GGCAGCCAAT
ATCCTGCCCT ATCTGGTGCA GATGGACCGG CTCAATCCCG CGCAAAAGGC TGCCGCGGCA
CATTTCGGTC GCTTTCCGAG CGGCCAGCCC GCCGCGGCGC AGCGGCCGGT TCAGGTGGCG
ACCGCGACGC CGACGCCCCG GCCGGCGCCC GCTCCGCGCC GCAGCGCGCC GACCTCCACG
CCCGCGGCCG CCGCTCCGGT CCCGAAGCCG CCGCCGCCGC CCGCCGCGAT GCCGCCCAGC
CGCCCGCGCG CCGAAACGCC CGTGCCAGCC TCTTCCCCGC CCGCAAATCC CCCGACGTCG
GCGGTGAAAG CGCCGGCGGG GCCGGGCTTC TCGATCGCCG ACATCGCGCC CGCACCGCCC
GCCGCGGCCC CGGCGGCGCC GCGGCCCGCT GCGCAGGCAC CCGCCGCCGC GCCGCTCGCC
TCGCTCGCCG ACATCGTTGG GTCGATCGAG ATACCCCCCG AGGAACTCGC GCGCCCCGAC
GATGCGATCG GCGCCGAGAC GCTCGCCAAG CTGCTCGATG ACAAGCGCAA GGCCGAGGCT
GCCGAAGCGG CAAAGCGCGA GAAGGAAGAA GCCGCCGCCA GGGCAAGGGC CGAGGCCGAC
GCCAGGGCGA AAGAAGAAGC CGCGAAGAAA AAGGCGCATC CCGCGCGTAT CTGGGTGCAG
ATCGCGACCG GCGCCAACCC GAAGGCGCTC GCTTTCGACT ATAACCGCTT CGCCAAGCGC
AATGCGGCGC TGTTCAAGGG CAAGGCGGGC GCGACCGCCG AATGGGGTCA GACCCGGCGC
CTGCTCGTCG GCCCGTTCGC GAACCGCAAG GCGGCGCAGG ACTGGCTCGC CGATTACAAA
AAGGCCGAAG GCGACGGTTT TCTTTTCAGC TGCGAGGTCG GCGAGATCGT CGAACCGCTG
CAGTGA
 
Protein sequence
MRQAILTAKP ARTRRNVAAL GALLLGTLGI PSVHAMQAAA PDAATRAAME KRSAARALLS 
ASLARLASNN NDATALLDAG RASISLEDYR AALGFLLRAE QARPRDGAVK AALGSAMVHS
ENPTRALDYF GEAQLLGAPE RLFLADRGLA RDLLGQQDAA QRDYQLALSI APDAELTRRY
ALSLGISGDP DRAIQLLTPQ LRAQDRGAWR LRAMILAMNG RDREASEIVN ATMPAPMAAN
ILPYLVQMDR LNPAQKAAAA HFGRFPSGQP AAAQRPVQVA TATPTPRPAP APRRSAPTST
PAAAAPVPKP PPPPAAMPPS RPRAETPVPA SSPPANPPTS AVKAPAGPGF SIADIAPAPP
AAAPAAPRPA AQAPAAAPLA SLADIVGSIE IPPEELARPD DAIGAETLAK LLDDKRKAEA
AEAAKREKEE AAARARAEAD ARAKEEAAKK KAHPARIWVQ IATGANPKAL AFDYNRFAKR
NAALFKGKAG ATAEWGQTRR LLVGPFANRK AAQDWLADYK KAEGDGFLFS CEVGEIVEPL
Q