Gene Sala_2094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2094 
Symbol 
ID4080068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2200586 
End bp2201761 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content51% 
IMG OID638010469 
Producthypothetical protein 
Protein accessionYP_617136 
Protein GI103487575 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.139728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.387563 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATTG ACTCCGCACA TTATATTGGG CGTGAGCAAG CGCTCGTAAA GCACACATTT 
CTTGATCGAT ATCTTCCATC TCTAATTGGA AAAGTGTGCT CACGATACGA CGAGTTCGTC
TATGTCGATG GCTTTGCCGG TCCTTGGCAA TCTGCCGCGG GAGAAAGCTT CGACGATACT
TCATTTGGTA TCGCGCTTAC TCACATGACG GCGCAGCGCC TTCTATACCT CAGTAAAGGT
CGAAATATAA GAATGCGAGC ATTCCTCGTG GAAAAGGACC CTAGCTCATT TGCGCAACTG
GAACGCGCCA TAGCGCGGTT TCCAAAAATT GAGATTATTC CTTTAAATGG GTTGATGGAG
GCGCATGCCG CAAGGATCGC TTCGTGCATT CCACAATCAG CGTTCTCCTT TACACTGATT
GACCCCAAAG GATTTCCAGA TATCGGAGCA ATGCTCCCTC TTCTTAAGAG GGAACATGCA
GAAGCACTCG TTAATTTCAT GTTCGATTTT GCTAATCGGT TTGCAGGTAC TGACCTTATA
CCAGCGTTAG AAGATTGGCT TTCCGCATTG GGAAGCGTGG GTTGGCGCCA AGAGGTCGAG
GGGCTCTCAG GCTCCGAGCG CGAACGGAAG CTAGAAAGAT TGGCTGCCGA AGCATTACAG
ATTACCGGCG CTTACTCGTT TTCACCTGTT ATTACGGTGG ACAAAGTTCT TCATAATCGG
CCGCTGTACA AGCTAATCTT TCTTTCAAGG CATGCCGAAG GCTTGAAGGT CTTCCGAGAC
AGCGAGGCGA AAGCGCTGGA CACGCAAGCA ACGGCTCGGT CTGCATCAAA AGCAAAGAAG
AGGGCCGAAA GCTCGCCAAT TGGAGATTTG TTTGCCGACG GGGAAGATGC GGTACCAAAT
GATCGAAGCT CTCAGGTGAT CAGGCAAAGC CGGCAAGATG CCATTCGTGC CCTTGGAGCG
CAAATAATGA CCGCCGGCTC AAGCGGAATG GTTTGGGGAA ACCTTTGGCC TCCTATCCTA
GAGGATTTTT CCGTCACGCG ATCTTGGCTT GGCCACCAAG TGAATGACAT GCGTAAAGCG
GGCCGGATTT TAGCACCGGG GTGGCCAAGC GAACGAAAGC AGATCCCCGA GGACAGCCAA
CGCTTGATTT TGGCCCAAGC CGTCTCGCCC ACCTAG
 
Protein sequence
MAIDSAHYIG REQALVKHTF LDRYLPSLIG KVCSRYDEFV YVDGFAGPWQ SAAGESFDDT 
SFGIALTHMT AQRLLYLSKG RNIRMRAFLV EKDPSSFAQL ERAIARFPKI EIIPLNGLME
AHAARIASCI PQSAFSFTLI DPKGFPDIGA MLPLLKREHA EALVNFMFDF ANRFAGTDLI
PALEDWLSAL GSVGWRQEVE GLSGSERERK LERLAAEALQ ITGAYSFSPV ITVDKVLHNR
PLYKLIFLSR HAEGLKVFRD SEAKALDTQA TARSASKAKK RAESSPIGDL FADGEDAVPN
DRSSQVIRQS RQDAIRALGA QIMTAGSSGM VWGNLWPPIL EDFSVTRSWL GHQVNDMRKA
GRILAPGWPS ERKQIPEDSQ RLILAQAVSP T