Gene Sala_2247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2247 
Symbol 
ID4080261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2359665 
End bp2361158 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content67% 
IMG OID638010625 
Productpeptidase S1C, Do 
Protein accessionYP_617289 
Protein GI103487728 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTTACG TATATGGCAT CACTTCGGCC TTGCTGGCGG GTGGCACCGC GCTGGCGCTG 
GTCACCCAGT CGCCCGTCGG CGCGCAGGTC GCGCAGAACG AACAGAGCGA AATGGCACGC
GTCGTCCCGC GCGCGGGCGC GCCGGACAGC TTTGCCGACC TCGTCGAACA ATTGCAGCCT
GCGGTCGTCA ATATCTCGAC CAAGCAGGAA GTGACGCTCG GCGTACGCCT CAACCCCTTT
GCCGGCACGC GCGAGCCGAT CACGCAGGAG CAGCAGGGCG GCGGATCGGG TTTCCTCATT
TCGTCGGACG GCTATATCGT CACCAACAAC CACGTCATTT CGGGCGGACC GCGCGGCGAA
GCCGTGAACG AGGTTACTGT CACGCTGACC AACCAGCGCG AATATAAGGC GAAGATCGTC
GGTCGCGATG TGGCGTCGGA CCTTGCGCTG CTCAAGATCG ACGCCACCGG CCTGCCCTTC
GTCAAATTCG CGCAAGGCAG CCCGGCGCGC GTCGGCGACT GGGTGGTCGC GATCGGCAAC
CCGCTCGGGC TCGGCTCGAC AGTGACCGCG GGGATCATTT CGGCGGTGCA GCGCAACATC
GGGCAGGGCG GCGCCTATGA CCGCTATATC CAGACCGATA CCGCGATCAA CCGCGGCAAT
TCGGGCGGTC CGCTGTTCGA CCTGCAGGGC AATGTCGTCG GCATCAACAA TATGCTGATC
TCTCCCGTCG GCGCGAACAT CGGCGTCAAT TTCGCGATCC CCGCCGAAGC GGCGATCCCG
GTGATCGAGG CGCTGCGCGC GGGCGAGCGA CCGCAGCGCG GCTATCTGGG CATCGGCATC
GTTCCGGTGA CCGAAGACAT TGCGGCGGCG CTCGGCCTGC CCAAGGACCG CGGCGAGTTC
GTCCAGCGCG TCGAACCCGG CGAAGCGGGC GAAAAGGCCG GGCTGAAGCG CGGCGACGTG
GTGCTGAAAG TCAATGGCCG CGATGTCACG CCGCAGCAGA CTTTGTCCTA CATCGTCGCC
AACACCAAGC CCGGCACGCG CATCCCGCTC GAAATCGTGC GCGACGGGCG GACAATGACG
CTGAATGCCG TCGTTGGCAC GCGCCCGCCC GAAGAGCAGC TCGCCGGGGA CAATTTCGAC
CCCGAAGAGG AACAGACGAT GCCCGAGGAT CCATCGGGCG CGGCCGACGA GACGATCCAG
AACAGCCTGG GCATGGCGGT GCAGCCGTTG ACCCCGGCGA TCGCGCGCGC CGTCGGCATC
GACCCCGACA GCAAGGGGCT CGTCATCGCC GCGGTCGCGG GCAGCAGCGA CGCGGGCCGC
AAGGGGCTGC GCCGCGGCGA CGCGATCCTG AGCGCCAATC GCACGCCGGT CACCTCGGCG
GAGGCGCTGG CGAAGGTCAT CACCGACGCC AAAAAGGCGG GGCGCGACGC GGTGCTGCTG
GAAATCCTGC GGCGCGGCGG CCCGTCGGCC TTCATTGCGA TCCGCCTCAA ATAA
 
Protein sequence
MRYVYGITSA LLAGGTALAL VTQSPVGAQV AQNEQSEMAR VVPRAGAPDS FADLVEQLQP 
AVVNISTKQE VTLGVRLNPF AGTREPITQE QQGGGSGFLI SSDGYIVTNN HVISGGPRGE
AVNEVTVTLT NQREYKAKIV GRDVASDLAL LKIDATGLPF VKFAQGSPAR VGDWVVAIGN
PLGLGSTVTA GIISAVQRNI GQGGAYDRYI QTDTAINRGN SGGPLFDLQG NVVGINNMLI
SPVGANIGVN FAIPAEAAIP VIEALRAGER PQRGYLGIGI VPVTEDIAAA LGLPKDRGEF
VQRVEPGEAG EKAGLKRGDV VLKVNGRDVT PQQTLSYIVA NTKPGTRIPL EIVRDGRTMT
LNAVVGTRPP EEQLAGDNFD PEEEQTMPED PSGAADETIQ NSLGMAVQPL TPAIARAVGI
DPDSKGLVIA AVAGSSDAGR KGLRRGDAIL SANRTPVTSA EALAKVITDA KKAGRDAVLL
EILRRGGPSA FIAIRLK