Gene Sala_3171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_3171 
Symbol 
ID4082507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3324897 
End bp3326300 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content66% 
IMG OID638011556 
Productpeptidase M48, Ste24p 
Protein accessionYP_618207 
Protein GI103488646 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0504963 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.228205 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCTT TCCCGCCGCA ACCCGCGTCC GGCGCGCATG GCTTGCGCTC CCTTTTTCAT 
ATCCTGCTCA CGCTGCTCGC GATGGTCGCC ATCGCGGTGC GCCCCGCGGC CGCCCAGTCG
ATCCTGCGCG ATGCCGAAAC CGAGGCGCTG TTTCAGGATA TGATGGACCC GCTGCTCGTC
GCCGCGGGAT TGCAGCCGGG ACAGGTGCGC GTCCACCTGC TCGGCGATCG CAGCATCAAC
GCCTTTGTCG CGGGCAGCCA GGACATCTAT GTCTTCAGCG GGCTGATCGA AGCCGCCGAC
AGCGCCGAAG AGGTGCAGGG CGTGCTCGCG CACGAACTGG GGCATGTGAT GGGCGGCCAC
GCCATCCGCA TCAACGATGG CGTGAGCGCC GCGACCAGCA TCTCGCTGCT CAGCCTGCTG
CTCGGCGCCG CGGCGATCGC GGCGGGCGGC GGCGAGGCGG GCATGGGCAT CATGATGGCG
GGGCAGCAGG CGGCGCTCGG CAAGTTCCTC GCCTTCAGCC GCGTCCAGGA ATCGACCGCT
GACGCCGCCG GCGCACAATA TCTGTCGAAG GCGGGGATCA GCGGGCGCGG CAGCCTGGCC
TTCTTCAAGA AGCTCCAGAA TCTGGAGTTC CGCTATGGCA TCAAGCAGGA CGACGACCAG
GCCTATGGCC GGACGCACCC GATGTCGGGC GACCGTATCC AGGCGCTGCG CGAAGTCTAT
GTCATCGACC CCGCATGGAA CAAGCCGGCT GACCCGGCGA TCGAAAAGCG GTTCCAGCGC
ATCAAGGCCA AATTGTTGGG CTATATGGCC GAGCCCGAAC GGACGCTGCG CAAATTTCCG
GAAAGCGACC GCAGTGTTCC GGCGCGCTAT GCGCGCGCCT ATGCCTGGCA CAAGAGCGCC
TATCCGCAAA AGGCGCTGGC CGAAGTCGAG GCGCTGCTCG AAGCCGACCC CGACGACCCC
TATTTCCTCG AACTCGAAGG GCAGATCCTG CTCGAATCGG GGCGCCCCGA CGAGGCGATC
CCCCCGCTGC GCCAGGCGGT GGCCAAATCG CGCTCGCAAC CCCTGATTGC CGCCACGCTG
GGCCATGCGC TGATCGCGAC CGAAGAGCCC GCCCACTATG CCGAGGCCGA AAAGGTCCTG
AAAACCGCGG TCGCGCTCGA CAACCAGAAT CCCTTTGCCT GGTATCAGCT TGGCATCGTC
TATGCCAACA AGGGCGATCA GGCCCGCGCC GCGCTCGCTT CGGCCGAACG CTATAGCCTC
GAGGGTAGAC AACCGGCGCT GGCGCTGCGT AATGCCGAAA TGGCGATGCA GGGCCTGCCG
CAGGGATCGC CCGACTGGAT TCGCGCACAG GATATTTCGC TGGTCGCTCG CGCCGAGGTG
GAACGCGAGC GTAAACGGCG TTAG
 
Protein sequence
MTAFPPQPAS GAHGLRSLFH ILLTLLAMVA IAVRPAAAQS ILRDAETEAL FQDMMDPLLV 
AAGLQPGQVR VHLLGDRSIN AFVAGSQDIY VFSGLIEAAD SAEEVQGVLA HELGHVMGGH
AIRINDGVSA ATSISLLSLL LGAAAIAAGG GEAGMGIMMA GQQAALGKFL AFSRVQESTA
DAAGAQYLSK AGISGRGSLA FFKKLQNLEF RYGIKQDDDQ AYGRTHPMSG DRIQALREVY
VIDPAWNKPA DPAIEKRFQR IKAKLLGYMA EPERTLRKFP ESDRSVPARY ARAYAWHKSA
YPQKALAEVE ALLEADPDDP YFLELEGQIL LESGRPDEAI PPLRQAVAKS RSQPLIAATL
GHALIATEEP AHYAEAEKVL KTAVALDNQN PFAWYQLGIV YANKGDQARA ALASAERYSL
EGRQPALALR NAEMAMQGLP QGSPDWIRAQ DISLVARAEV ERERKRR