Gene Sala_3087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_3087 
Symbol 
ID4082823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3238300 
End bp3239571 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content68% 
IMG OID638011473 
Producthypothetical protein 
Protein accessionYP_618124 
Protein GI103488563 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03440] conserved hypothetical protein TIGR03440 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGCC GCACCGACGC TTCGACCTGC TCCGACCCGA TCGAGGCACT GACCGGGCGC 
TTCGCGGCGA CGCGGCGGCT GTCGCTCGAT CTCGTCGCGT CGCTTTCCGA CGCCGACGCC
AGCGCGCAGT CGATGCCCGA CGCCTCGCCC GCCAAATGGC ATCTCGCGCA CACCACATGG
TTTTTTGAAA CCTTCGTGCT CCGCGACCAT GTGCCGGGTT ATGCGCTTTT CGACGACCGC
TTCCCCTATC TCTTCAACAG CTATTACGAG GCCGAAGGGC CGCGCCACGC GCGCCCGCAG
CGGGGCCTGT TGACGCGCCC GTCGCTGGAC GCGGTGCGTG CATGGCGTGC GCATGTCGAT
GCGGCGGTGG CGGACGCCTT GCCCGGCCTG TCCCCTGCGG CGCTCGCGCT GGTCGATCTC
GGCATCCACC ATGAACAACA GCATCAGGAA CTGCTGCTCA CCGACATCAA GCATCTCTTC
GCGCAGAATC CGCTCGGCCC GGCGGTGTGG CCGCGCGACC ATGCAAGCGC GAGTAAAGTT
GCGCAGTTTT CCGCCATGAA ATGGATCGAG GGCAAAGCCG GTATCGCCGC GGCAGGACAT
GATGGCGACG GCTTTGCGTT CGATTGCGAA GGTCCGCGCC ACGACGCGCT GCTGACCCCG
CACGCGCTCG CGAGCCGCCC TGTCACCAAC GGCGAATGGC AGCAGTTTAT CGACGACGGC
GGCTATCGCA CCCCCTCGCT CTGGCTCAGC GACGGCTGGG CGTGGGTGCA GGCGGAGGGC
ATCGAGGCGC CCGCCTATTG GCGCGATCGG CGATATTTCA CCCTCGCTGG ATGGCAGGAC
ATCGACCCTG CCGCGCCGGT GACGCACATC GGTTTTTTCG AGGCCGACGC CTTTGCCAGC
TGGGCGGGCG CGCGCCTGCC GACCGAGGTT GAATGGGAAG CGGCGGCGGC GGTGCTCGAT
CCGAACGGCG GCGACCAGCT CGATGCCGCC GGTCCGGTGC AACCTGCGGC GGCCGGCGGC
GACACCGGAT TGCAACAGAT GTTCGGCAGC GTCTGGGAAT GGACCGGCAG CGCCTATCGC
CCCTATCCCG GCTTCCGTGC CGCGCCCGGC GCGGTCGGCG AATATAATGG CAAGTTCATG
AGCGGCCAGT TCGTGCTGCG CGGCGGCAGC TGCGCCACCC CGCGCGGTCA CATGCGCGCG
ACATACCGCA ATTTCTTCTA CCCCCACCAG CGCTGGCAGT TCACCGGCGT GCGGCTCGCA
AAGGATCTCT GA
 
Protein sequence
MASRTDASTC SDPIEALTGR FAATRRLSLD LVASLSDADA SAQSMPDASP AKWHLAHTTW 
FFETFVLRDH VPGYALFDDR FPYLFNSYYE AEGPRHARPQ RGLLTRPSLD AVRAWRAHVD
AAVADALPGL SPAALALVDL GIHHEQQHQE LLLTDIKHLF AQNPLGPAVW PRDHASASKV
AQFSAMKWIE GKAGIAAAGH DGDGFAFDCE GPRHDALLTP HALASRPVTN GEWQQFIDDG
GYRTPSLWLS DGWAWVQAEG IEAPAYWRDR RYFTLAGWQD IDPAAPVTHI GFFEADAFAS
WAGARLPTEV EWEAAAAVLD PNGGDQLDAA GPVQPAAAGG DTGLQQMFGS VWEWTGSAYR
PYPGFRAAPG AVGEYNGKFM SGQFVLRGGS CATPRGHMRA TYRNFFYPHQ RWQFTGVRLA
KDL