Gene Sala_1966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1966 
Symbol 
ID4080515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2072900 
End bp2075836 
Gene Length2937 bp 
Protein Length978 aa 
Translation table11 
GC content67% 
IMG OID638010343 
Productpeptidase M16-like protein 
Protein accessionYP_617011 
Protein GI103487450 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCTG ATCCAGCGCG CGCGCCCCTT CGTTTTGCCC GTTCCATGCT TGTCGGCGCC 
CTCGCCGCAC TGCTGCTGGC GCCGCTCGCC GCGCGCGCCC AGCACGAGGT CGCGACCGCG
AAGCAAGCGG GGAACAGCGA CTGGCTTTAT GTTGGCAGTG ACATTCCACG CGATACGGCG
TGGCAGTTCG GCATCCTCCC CAATGGCCTG CGCTATGCCG TGCGCAATAA CGGCGTGCCG
CCGGGGCAGG TGTCGATCCG CGTGCGGATG GATGTCGGTT CGATGTTCGA GACCGACGAT
GAGCGAGGCT ACGCCCATCT GCTCGAACAT CTGACCTTTC GCGGCTCGGA GCACATCCCC
GATGGCGAGG CAAAACGTAT CTGGCAGCGC TTCGGCGTAA CCTTCGGCAG CGATTCGAAC
GCCCAGACGA CGCCGACGCA GACCGTCTAC CAGCTCGACC TTCCCAGCGT GACGCCCGCG
AATCTTGACG AAAGCATGAA ATTGCTTGCG GGCATGATCC GTGCGCCGCG TATTTCGGAA
CTGGCGGTGG CGGCCGAGCG CGGAGTCGTC ATGGCCGAGC TGCGCGAATC GGACGGACCG
CAGAAACGCA TCGCCGACGC GACCAACGCG CATCTGTTCG CGGGGCAATT GCTTGGCGAT
CGCTCGCCAA TCGGTACAAC GGCCTCGCTG GGCAAGGCGA CCGCGGCGTC CGTCGGCGCC
TTTCACGATC GCTGGTATCG TCCCGAACGC GCGGTGGTCG TCATTGTCGG CGACGGCGAC
CCCGCCACTT TCGCGCGGCT GATCGCCAGA TATTATGGCG ACTGGAAAGG GGAGGGGACC
AATCCGCCCG ATCCCGATTT CGGAAAGCCC GATCCCGCGG CGCCCGCGGC GCTCGAAATT
GTCGAGCCGA ACCAGCCGCT CGCGCTCACG CTCGCCATGG TCCGGCCGTG GAAAAGGCGG
ATCGACACGG TCGAAAATAC GCGGCGGCTT TACCTCGAGT TCATCGCACA GGCGCTCGTC
AACCGGCGGC TCGAAAATCG CGCGCGCGCG GGCGCCAGCT ACCTCGTCGC CACCGTCGAA
CAGCAATATG TCAGTCGCAG CGCCGATGTG ACCGCGGCCT CGATCGTTCC GTTGAGCGAC
TGGAAGGCCG CGCTCGCCGA TGTCCGCGGG GTGATCGCCG ATGCGGTGCG CCGCCCGCCG
TCGCAGGCCG ACATCGACCG CGAGACGAAC GAGATCGAGG CCTTTCTGCT CAAAGAGTTG
GAAAATGCGC GCAACGAGCC CGGCGCGCGG CTTGCGGACG ACATGGTGCG CGCGGTCGAC
ATCAACGAAA CGGTGACGAG CCCGCAGGGG CAGGTCGATA TGTTCAGGGC GATCCGGGCG
TCGGCGACAC CGCAGGTGAT GCTCGACATC AGCCGCGCCA TCTTTTCCGC GCCGGTCACC
CGCGTCGTGC TGACCACACC GACATCGGCC GGCGGGAACG GCGCCGTCCT CGCCGCGCTG
AAGGCTCCCG TCGCCGCGCG CGACGAACGG CTTGCGGCGG TCGAGGCCGA TTTCGGGCAG
CTTCCGTCGC TTGGCAAGCC CGGCTCCATC GTCGCGACGG CGCCGCTGCC GGGCCTTCGC
GCCGAGCGGA TCGAGCTGTC GAACGGCGTC ACCGCGCTGG TGTCGAACAA CAAGATCGAA
CCGGGCAAGG TGCGCGTCAA TGTGCGCTTC GGTACCGGCA ATCGCAGCGT CGCCGCCGAT
GCGCCGAACC TGTTGTGGAC CGGCGACTAT GCGCTTGTCG CGAGTGGAAT CGGACCGTGG
GGCCAGAATG AGATCGACCA GCTCACCAAC GGGCGGCAGA TTCAGATGAA CTTCGCGATC
GACGACGATG CTTTCGAGCT GTCCGCCGAA AGCCGCCCTG CCGACCTTGC CGACCAGTTG
CGGCTCATGG CGGCCAAGCT GGCGCTGCCG CGCTGGGATG CGGCGCCGGT CGAGCGGCTG
CGGATCGGCT ATCTCACCGG CTATGAGCTC AATGACGCGA CCCCCAATGC GGTACTCGAA
CGCCATTTGC GCGGCTGGCT GACCGGCAAT GACGCGCGCT GGGCGGCGCC CGACCGCGCG
GCAATCGAGG CGCTGACGCC TGCGGCCTTC CGGGCCTTCT GGGAACCGAG GCTCGCCAGC
GGACCCATCG AGGTGCAGAT TTTCGGCGAC CTCGAAACGG TCGATTACCG GAAAATCCTG
ACCGAAAGTT TTGGGGCGCT GGCGCCGCGC GCGATTCTGG CGCCACCGGG TGGCCAGCGC
GTCGATTTTG CGCCGCACGT CACCGTTCCC GAAATCGCCT ATCACCGCGG CGAGCAGGGG
CAGGCGGCGG CGATGACGGC CTGGCCGACC GGCGGCGGCC GCACCAATCC GCGCGACGCG
CGCGCGCTGG AGGTGCTGGC CGCAATCTTC AACGACCGGC TGTTCGATCG GCTACGTGCT
GAACAGGGCG CCAGCTACGG CCCCGTCGTC GACAGCCACT GGCCAACGGG ATTCGACACC
GGCGGCTATC TGCTCGTCGG CAGTCTGCTC GCGCCAAAGG ACATCGACCG TTTTTACGGG
ATTGCGCGCG ACATCGCCGC CGACCTCGTC GCACGACCGG TCAGCGCCGA CGAATTGGCG
CGCAACGCCG GCCCGATCCG CGAGCAGGTG GCGCGAGCAT CGACAGGCAA TGTCTATTGG
ATGTTCCTGC TCGAAGGCGC GACGCGCGAT CCGCGCGTGA CTGCGGCGGC GCTGTCGATC
CAGGACGACT TGGCGGCCGT CACGGCCGCC GATGTGCAGC GGCTCGCGCG CCAGTATCTG
ACGCCGGACC GCCAATGGTC GCTTGCCATC CTGCCCGAGG GCATGACGCT CGCCGATGCG
TCGGCACTGA ATGGTGCGAT GTCGACCCGT CTCGATGCCG CGGTTGGCGG ACGTTAG
 
Protein sequence
MSSDPARAPL RFARSMLVGA LAALLLAPLA ARAQHEVATA KQAGNSDWLY VGSDIPRDTA 
WQFGILPNGL RYAVRNNGVP PGQVSIRVRM DVGSMFETDD ERGYAHLLEH LTFRGSEHIP
DGEAKRIWQR FGVTFGSDSN AQTTPTQTVY QLDLPSVTPA NLDESMKLLA GMIRAPRISE
LAVAAERGVV MAELRESDGP QKRIADATNA HLFAGQLLGD RSPIGTTASL GKATAASVGA
FHDRWYRPER AVVVIVGDGD PATFARLIAR YYGDWKGEGT NPPDPDFGKP DPAAPAALEI
VEPNQPLALT LAMVRPWKRR IDTVENTRRL YLEFIAQALV NRRLENRARA GASYLVATVE
QQYVSRSADV TAASIVPLSD WKAALADVRG VIADAVRRPP SQADIDRETN EIEAFLLKEL
ENARNEPGAR LADDMVRAVD INETVTSPQG QVDMFRAIRA SATPQVMLDI SRAIFSAPVT
RVVLTTPTSA GGNGAVLAAL KAPVAARDER LAAVEADFGQ LPSLGKPGSI VATAPLPGLR
AERIELSNGV TALVSNNKIE PGKVRVNVRF GTGNRSVAAD APNLLWTGDY ALVASGIGPW
GQNEIDQLTN GRQIQMNFAI DDDAFELSAE SRPADLADQL RLMAAKLALP RWDAAPVERL
RIGYLTGYEL NDATPNAVLE RHLRGWLTGN DARWAAPDRA AIEALTPAAF RAFWEPRLAS
GPIEVQIFGD LETVDYRKIL TESFGALAPR AILAPPGGQR VDFAPHVTVP EIAYHRGEQG
QAAAMTAWPT GGGRTNPRDA RALEVLAAIF NDRLFDRLRA EQGASYGPVV DSHWPTGFDT
GGYLLVGSLL APKDIDRFYG IARDIAADLV ARPVSADELA RNAGPIREQV ARASTGNVYW
MFLLEGATRD PRVTAAALSI QDDLAAVTAA DVQRLARQYL TPDRQWSLAI LPEGMTLADA
SALNGAMSTR LDAAVGGR