Gene Sala_0351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0351 
Symbol 
ID4081049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp361358 
End bp363181 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content70% 
IMG OID638008710 
Producthemolysin activation/secretion protein-like protein 
Protein accessionYP_615407 
Protein GI103485846 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2831] Hemolysin activation/secretion protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.410508 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.212759 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTGT CGGGGGATGA GGGACAGGAT CCGATTTTCG GGCTGCGACG CGGGGCGTTC 
CGCGCCGCGC TGACCGCGTC GACGGGGCTG ATCGCGGCCG CGCTGCCCGC CGCGGCGCTT
GGGCAGGCCA CGCCGCAGAT CCCCACGCGC GAGGAAATCC AGCGCCCGCC GCTCGCGCCC
GCGGCACCGC CCGGCGAGCG GATCGTGGCG GTCGACGAGG AGATCGAGCG CGCGCCCTGT
CCGCTGGCCG ACCCGCAATT TGCCCATATC CGCTTCACCC TTCGCGCGGT CGAGTTTTCG
CGTGTCGAGG GCATCGACCC GGCGATGCTG TCGCCAAGCT GGTCGGGCCG CACCGGGCAG
GAGCTGCCGA TCGCCGCGGT GTGCGACATC CGCGACCGCG CCGCGACCAT GCTGCGCGCC
AAAGGCTATC TGGCCGCCGT GCGCGTCCCT GCGCAGACGA TCGACGACGG GGTCGTGCGG
CTCGACATAT TGGCGGCGCG GATGGCGCGC GTCGAAGTGC GCGGCGATGC CGGCGCCAAC
GAGGCGCTGC TTCAGCGCTA CCTGTCGCGA CTCGACGACG CGCCGGTGTT CAACATCGCC
GATGCCGAGC GCTACCTGCT GCTCGCGCGC GACATTCCGG GGATGGATGC GCGGCTGACG
CTGCGCCCCG GCGGCACGCC GGGCGAAGTC ATCGGCGAAG TGACGGTGGT GCGCACCCCC
GTCACCTTCG ACTTCAACGC CCAGAATCTC GGTTCGCGCG ATGTCGGCCG CTGGGGCGCC
AATGCGCGCG CGCGCTTCGC CGGGCTGACC GGCATGGCCG ATCTCACGAC GCTGAGTTTC
TATTCGACCC CAGATTTCGA CGAGCAGACG GTCGTGCAGG TCGGCCACGA GCTGCGCGTC
GGCGGCGAGG GATTGCGGCT CGGCGCCAGC TATATCTATG CCTGGACGCG CCCCGACGTC
ACCGGACTGC CGATCAAGTC CGATACGCAG ATCCTTAGCC TGTTCGCCTC CTATCCGCTC
GTGCTGACAC AGGCACGGCG GCTGACGATC GGCGGCGGGC TCGACATTAT CGACCAGGAC
ATCGGCCTGT CGGGCATATC GCTCAACGAG GACCGGCTGC GCGTTTTGGG CCTGCGCGCC
GATGCGAGCT GGGTCGACCC CGGTTCGATC GCCGGGCGCG GCGGCTATAG CGCGGGCGAG
CCGCGCTGGT CGCTCGCCAC CTCGCTCGAA GCGCGGCAGG GTGTGGATTT CCTGGGCGCG
AGCGACGATT GCGGGCCCGG CGGCACCGCC TGTTTCCTGC CCGGCGCGGT ACCGCTGACG
CGCATCGAGG GGCAGCCCGA CGCCTTCCTC ATCCGCGCGC AGGCGCTCGC CGAATGGCGC
CCCGTCAAGC TGTTCACCCT GTCGGCGGCC CCGCGCGCCC AATGGGCAGC CGATCCCCTG
CTCGCCTATG AAGAGTTTTC GGGGGGCAAT TTTACCGTCG GGCGCGGGTT CGACCCCGGC
ACGGTGATCG GCGACAGCGG CGTCGCCGTG GCGCTGGAGG CGCGTTATGG CTCGTTCGTC
CCCGCCAATA CCAAGGCTTT CGCGATCCAG CCCTTCGCTT TCTTCGACGC GGCCTGGGTG
TGGAACGAAG ATGCGGCGTT CGACGGGCTC GATCCGCAAA AGCTTTATTC GGCGGGCGGC
GGCGTGCGTG TCGCCTATGG CGATGTCGCG CGGCTCGACG TGACGCTCGC GGTGCCGCTC
AATCGGGGCG GTTTCCTGAC CGAACGGCCC GACCCGCGGT TGCTCGTGTC GCTTACCACC
CAGTTCGGCG TCAGGGCGCG CTGA
 
Protein sequence
MAVSGDEGQD PIFGLRRGAF RAALTASTGL IAAALPAAAL GQATPQIPTR EEIQRPPLAP 
AAPPGERIVA VDEEIERAPC PLADPQFAHI RFTLRAVEFS RVEGIDPAML SPSWSGRTGQ
ELPIAAVCDI RDRAATMLRA KGYLAAVRVP AQTIDDGVVR LDILAARMAR VEVRGDAGAN
EALLQRYLSR LDDAPVFNIA DAERYLLLAR DIPGMDARLT LRPGGTPGEV IGEVTVVRTP
VTFDFNAQNL GSRDVGRWGA NARARFAGLT GMADLTTLSF YSTPDFDEQT VVQVGHELRV
GGEGLRLGAS YIYAWTRPDV TGLPIKSDTQ ILSLFASYPL VLTQARRLTI GGGLDIIDQD
IGLSGISLNE DRLRVLGLRA DASWVDPGSI AGRGGYSAGE PRWSLATSLE ARQGVDFLGA
SDDCGPGGTA CFLPGAVPLT RIEGQPDAFL IRAQALAEWR PVKLFTLSAA PRAQWAADPL
LAYEEFSGGN FTVGRGFDPG TVIGDSGVAV ALEARYGSFV PANTKAFAIQ PFAFFDAAWV
WNEDAAFDGL DPQKLYSAGG GVRVAYGDVA RLDVTLAVPL NRGGFLTERP DPRLLVSLTT
QFGVRAR