Gene Sala_1013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1013 
Symbol 
ID4081701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1040727 
End bp1042274 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content62% 
IMG OID638009373 
Productsulfatase 
Protein accessionYP_616063 
Protein GI103486502 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.504761 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATCG CCCTCGCGCT GCCGCTGGCC TTCGTCGCCG CGGCGGGCGC GCAGCCCCCT 
GCCCGTCCGA ACATCGTCTT CATCATGTCG GACGATCATG CCTATCAGGC GATCTCGGCC
TATGGGTCGG CGCTGTCGAA ACTGGCGCCC ACGCCGAACA TCGACCGGAT CGCCAAAAAT
GGCGCGATCT TCACGCAAAG CTTTGTCGGC AATTCGCTGT GCGGCCCCAG CCGCGCGACG
CTGCTCACCG GACGGCACAG CCACGCCCAC GGGTTCCGGC AGAATGGCAA CAGGTTCGAC
AACAGGGTCT GGGTCTGGCC GCGGGCGCTG TCGCAGGCGG GCTACGCCAC CGCGATGTTC
GGCAAATGGC ACCTCAACTA TTCGCCCGAA GGCATCGGCT TCGATGACTG GAAGGTGCTC
GACGACCAGG GCGAATATTA TAACCCCGAC ATCATCACGC CGAAGGGTCG CAGCATCGTG
GAGGGCTATG CGACCGACCT CACGACGCAA TACAGCCTCG ACTGGCTGAA GACCGGGCGC
GACAAGTCGA AGCCCTTTGC GATCCTGATC CACCACAAGG CGCCGCACCG CAATTTCATG
CCCGCGCTGC GCCATGTGCA GAAATATCAG GGGGTGACCT TTCCTGTCCC CGCCAGTTAT
TTCGACGATT ATGCGGGTCG CAAGGCCGCT GCGGCGCAGG AAATGACCAT CTATCGCGAC
ATGTATGAGG GGCATGACCT CAAGATGACG GTCGCGAAGG GGTCGGCCGA GCTTCGCTAC
AACCGCTGGC CCGGCGCCTT CGACCGGATG ACGCACACGC AACAGGCCGC ATGGGACGCG
CTGATGCAGG CCGACAACGA CCGCATGAAC GCCGCCAACC TTTCCGGCCA CGACCTCGCG
ATCTGGAAAT ATCAGCGTTA CATGCAGCAA TATCTGGGCA CGATCGCCGC GGTCGACGAA
GGCGTCGGCG CGGTGCTCGA TTATCTGGAG GACAGCGGCC TCGATCGGAA CACGATCGTC
GTCTATACCT CCGACCAGGG CTTTTACCTT GGCGAACATG GCTGGTTCGA CAAGCGCTTC
ATCTATGAAG AATCGATGCG CACGCCCTTC CTGATCCAGT ATCCGGGACA TATCCGCCCC
GGTACGCGCG TCGCCGCGCC GATCCAGAAT ATCGACTATG CCCCGACCTT CCTCGACTAT
GCCGGGGTGA AGGGACCGGC GACGATCCAG GGGCGGTCGG TGACGCCGCT GCTTGCCGGG
CGCACGCCGC CGGATTGGCG CAAGGACGTC TATTATCATT ATTATGAATT TCCGGGCTTT
CATGCCGTTC GCGCGCATTA CGGTGTACGC GGCGAACGCT ACAAGCTCGT GCGCTTTTAT
GGCGACGATC TTGACGCATG GGAGTTTTAC GACCTGAAAA CCGATCCGCG GGAGATGCAC
AACCGGATCG ACGATCCGGC GATGAAAGCG CCGATCGCCG CGATGAAAAA GCGGCTCGTC
GAACTGCGTC GTCAATATGG CGACGGCAGC GGACCCTCGA TTTCCTGA
 
Protein sequence
MPIALALPLA FVAAAGAQPP ARPNIVFIMS DDHAYQAISA YGSALSKLAP TPNIDRIAKN 
GAIFTQSFVG NSLCGPSRAT LLTGRHSHAH GFRQNGNRFD NRVWVWPRAL SQAGYATAMF
GKWHLNYSPE GIGFDDWKVL DDQGEYYNPD IITPKGRSIV EGYATDLTTQ YSLDWLKTGR
DKSKPFAILI HHKAPHRNFM PALRHVQKYQ GVTFPVPASY FDDYAGRKAA AAQEMTIYRD
MYEGHDLKMT VAKGSAELRY NRWPGAFDRM THTQQAAWDA LMQADNDRMN AANLSGHDLA
IWKYQRYMQQ YLGTIAAVDE GVGAVLDYLE DSGLDRNTIV VYTSDQGFYL GEHGWFDKRF
IYEESMRTPF LIQYPGHIRP GTRVAAPIQN IDYAPTFLDY AGVKGPATIQ GRSVTPLLAG
RTPPDWRKDV YYHYYEFPGF HAVRAHYGVR GERYKLVRFY GDDLDAWEFY DLKTDPREMH
NRIDDPAMKA PIAAMKKRLV ELRRQYGDGS GPSIS