Gene Sala_2387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2387 
Symbol 
ID4080540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2516281 
End bp2519160 
Gene Length2880 bp 
Protein Length959 aa 
Translation table11 
GC content65% 
IMG OID638010767 
Productpeptidase M16-like protein 
Protein accessionYP_617429 
Protein GI103487868 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.183046 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGCT TTCACCGTTT CGCTGTCGTC CTTGCCCTGT CGACCTCGCT CGTCGCCGCC 
GCACCTGTGC TCGCGAAGGT CGCTGCGCCG GCGCCGACCG CCGAGTTGGT GAAGGCGGTC
GACATCCCTT ACGAGGCCTT CACGCTCGAC AATGGGCTGC GCGTGATCGT GCATGAGGAC
CGCAAGGCGC CCGTCGTCGC GGTGTCGGTC TGGTATCGCG TCGGGTCGAA GCACGAGCCG
AAGGGCAAGA CGGGCTTTGC GCACCTGTTC GAGCATCTGA TGTTCAACGG ATCCGAAAAC
GCCCCCGACG ATTTTTTCGA ACCGCTGCGC CAGGTCGGCG CGACCGATTT CAACGGCACC
ACCTTTCTCG ACCGCACCAA TTATTTCGAA ACGGTGCCGA CCGGAGCGCT CGACCTCGCG
CTGTTCCTCG AAAGCGACCG CATGGGGCAT CTGCTCGGCG CCGTGACGCA GGAAAAGCTC
GATAACCAGC GCGGCGTCGT CCAGAACGAA AAAAGACAGG GCGACAACAA TCCCTATGGC
CTGCTGCGCT ACGAGATTTT CGAGAATCTC TTTCCCAGGG GACACCCCTA TCACCACAGC
ACCATCGGTT CGATGGCCGA CCTCGACGCG GCGAGCCTTG ACGATGTGAA AAAATGGTTC
ACCGACAATT ATGGCCCGAA CAACGCCGTG CTGGTGCTGG CGGGCGACAT CGACTTAGCG
ACCGCCAAGG CCAAGGTCGA AAAATGGTTC GGCGACATTC CGCGCGGCCC CGATGTCAAG
GCGCCCGTTG TGTCGGTGCC GACGCTGCCC GCGCCGCTCG CGAAAGAGGT CAAGGACATG
ATCCCGACCA CGCGCCTCTA TCGCATGTGG ACGATGCCGG GGCTCAACGA TCCCGAAGCC
GTGCCGCTCC AGATGGCGAT GGCGGTGCTC GGCGGGCTGT CGTCGTCGCG GCTCGACAAT
GCGCTGGTGC GCAAGGATCC GGTCGCGGTC AGCGTCTCGG CCGGCGCCTC GCCTTTCGAG
GATGCAGGCA TCATCCTCGT CCAGGCCGAC GTCAAGCCGG GGGTCGATCC GGCGCATGTC
GGCAAGCGGC TCGATGAAGA GATCGCCGCC TTCCTCGCGA GCGGGCCGAC CGCCGACGAG
TTGCAGCGCG CGACCGCCAG CCATCTGGGC GGCACCATCT CGCAGCTCGA ATCGGTCGGC
GGTTTCGGCG GCAAGGCGGT GACGCTCGCC GAAGGCGCGC TCTATTCAAA CGATCCTGCC
TATTACAAGG TCGAGCTCGA CCGGCTCGCC CGCGCGACGC CTGAACAGGT GCGCGACGCC
GCGCGCAAAT GGCTGTCGCG ACCGGCCTTT TCCTTGACCT ACACGCCCGG CGAGCGCACC
GAGGGCGGCG AGAACCGCGG CGGCGCGGTC GTCGCGGCGA AATCGGCCAC GCCGGTCCAG
CCCGACCATT ACTGGAACCC CGCGCTCGGC GACGTCGGCC CCGACAGCGG CTTCAGGGGG
GCAAGCTCGA TTGCCGACCG CTCGCAATTT CCCGAGGTGT CGGGTCTGAA GGCGCTCGAT
TTCCCGGACA TCGAGCGCGC GAAGCTCAAG AACGGGATCG AGGTGGTTTT CGCGCGGCGC
AGCGCGGTCC CGACGGTCAA CGTCCAGGTC AGCTTCGACG CCGGCTATGC CGCCGATCCG
CGCAGCGCGC TCGGCACGCA GTCGCTGATG CTCAGCCTGA TGGACGAAGG CACGACGAGC
CTCGACTCGA TCGCCTTTGC CGAAGCCAAG GAGCGGCTCG GCGCGCAAAC CTATGGCTAT
GCCGACGCCG ACGAAACCGC GCTCGGCCTG TTCGCGCTCA AGCCGAATCT CAGCGCGTCG
CTGGCGCTGC TCGCCGATTA TGTGCGCAAT CCGGCGTTCG ACGCCAGGGA ACTGGAGCGC
GTGCGCGCGC AGCAGCTCAA CCGGCTGAAG GCCGAACTCA ACGAACCGCG CGCGATCGCG
CAGCGCGTGC TGAAACCCGC GCTCTACGGC GCGGACCACC CCTATGGCAT CCCGCCATCG
GGGCTCGGCA ATGAAAAGGC CGTGAGCGAA GCGACGCGCG ATCAGCTTGT CGCGTTCCAT
TCGGCGTGGA TTCGCCCCGA CAACGCCCGC ATCTTCGTCG TCGGTGACAC CACGCTTGCC
GAAGTGACGA AGGAACTCGA CCGGGCGTTC GGCGACTGGA GGGCGCCGGC GACGCCGAAG
CCGGGCAAGC ATTTCGAAAT AGCGGTGCCC AAGCCGCAGC CGCGCATCCT GCTCGTCGAC
CGGCCCAAGG CGCCGCAGTC GGTGATCGTC GCGGGCAAGG TGCTCGACGT CAAAGGCGGC
GACGAACTCG AAGTGCTGCG CGCGGCGAAC GATATTTTCG GCGGCGATTT CCTCTCGCGC
TTCAACATGA ACCTGCGCGA GACAAAGGGC TGGTCCTATG GCGTGCGCAC GCAGGTGACG
AATGAAAAGG ACCGGGTGAG CTGGATCGCG ACGGCGCCGG TGCAGGCCGA TCGAACCGGC
GATTCGATCA AGGAACTGCA AAGCGACCTC AAGGCCTTCC TTGGCGACAA GGGCGTCACG
AAGGAAGAGC TGCAGCGCAC GGTGAACGGC AGCGTGCGCG AACTGCCCGG CAGCTTCGAG
ACGTCGAACG ACGTGCTCGG CGGGCTGCGC GCCATCGCGA AATTCGATCG TCCCGACGAT
TATTACGAGA AACTGCCCGC AACCTATGAA GCGATGACAC CCGAAGCGGT CGACGCCGCC
GCGCGGAAAG CGCTCAGCGC CGACGATCTC ATCTATGTCG TCGTCGGCGA CGCCGCGGTG
GTGAAACCGC AGCTTGACGG ATTGGGTCTG CCCGTGGAAA CGGTGTCTCC CGCTAACTAA
 
Protein sequence
MARFHRFAVV LALSTSLVAA APVLAKVAAP APTAELVKAV DIPYEAFTLD NGLRVIVHED 
RKAPVVAVSV WYRVGSKHEP KGKTGFAHLF EHLMFNGSEN APDDFFEPLR QVGATDFNGT
TFLDRTNYFE TVPTGALDLA LFLESDRMGH LLGAVTQEKL DNQRGVVQNE KRQGDNNPYG
LLRYEIFENL FPRGHPYHHS TIGSMADLDA ASLDDVKKWF TDNYGPNNAV LVLAGDIDLA
TAKAKVEKWF GDIPRGPDVK APVVSVPTLP APLAKEVKDM IPTTRLYRMW TMPGLNDPEA
VPLQMAMAVL GGLSSSRLDN ALVRKDPVAV SVSAGASPFE DAGIILVQAD VKPGVDPAHV
GKRLDEEIAA FLASGPTADE LQRATASHLG GTISQLESVG GFGGKAVTLA EGALYSNDPA
YYKVELDRLA RATPEQVRDA ARKWLSRPAF SLTYTPGERT EGGENRGGAV VAAKSATPVQ
PDHYWNPALG DVGPDSGFRG ASSIADRSQF PEVSGLKALD FPDIERAKLK NGIEVVFARR
SAVPTVNVQV SFDAGYAADP RSALGTQSLM LSLMDEGTTS LDSIAFAEAK ERLGAQTYGY
ADADETALGL FALKPNLSAS LALLADYVRN PAFDARELER VRAQQLNRLK AELNEPRAIA
QRVLKPALYG ADHPYGIPPS GLGNEKAVSE ATRDQLVAFH SAWIRPDNAR IFVVGDTTLA
EVTKELDRAF GDWRAPATPK PGKHFEIAVP KPQPRILLVD RPKAPQSVIV AGKVLDVKGG
DELEVLRAAN DIFGGDFLSR FNMNLRETKG WSYGVRTQVT NEKDRVSWIA TAPVQADRTG
DSIKELQSDL KAFLGDKGVT KEELQRTVNG SVRELPGSFE TSNDVLGGLR AIAKFDRPDD
YYEKLPATYE AMTPEAVDAA ARKALSADDL IYVVVGDAAV VKPQLDGLGL PVETVSPAN