Gene Sala_1988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1988 
Symbol 
ID4082153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2096682 
End bp2097803 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content66% 
IMG OID638010364 
ProductOmpA/MotB 
Protein accessionYP_617032 
Protein GI103487471 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins
[COG3637] Opacity protein and related surface antigens 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000257198 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0360376 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAAGC TTGCCGTCGC TGTGGCGTTG GCCTCCACCA CCCTCGCGTC GCCGTCCATG 
GCGCGCGACG ATTCCTGGTA TGTCGGTGTT GGCGCGGGCG CAATGCTCGT CGAAGACATT
GATCTCGATA TCGGCACCTT CAACAATGCC GGGTCGCTCG ACCATCGCGC GGGCTATGAT
TTCGAAGGCA CCGTCGGTTA TGACTTCGGC GGGTTCCGTG CCGAAGTCGA AGTCGGCTTC
CGTGAAGCCG ACATCAAGTC GGGCCGTTTC GGCAACCCCG GCATCCCGCA GACGGCATCG
GGCGCGGGTA CGCTGTTCAC CGGCTCGACC GACCTGAACG GCGATTCGAA CGCGCTCAGC
TTCATGGTCA ACGGCATGCT CGACTTCGGC GACGACGACG GCCTGCAGGG CTTTGTCGGC
GGTGGCGCCG GTGTCGCCCG CGTGTCGGTC GAACCCGTCT TTGCCGGTCC GTTCCTCGAC
GATTCGGACA CGGGCTTTGC CTGGCAGGCG ATCGCGGGCG TCCGCGCGCC GCTCAGCAGC
AACTGGGACG TCGGCCTGAA GTATCGCTTC TTCAACGCCG ACAATGTCGA TCTGGTGGAT
CAGGCCGGTC GCGACGTTTC GACGCGCTTC CGCTCGCACT CGATCCTCGG CACGCTGACG
TACAACTTCG GCGGCGCTCC GGAGCCGGTG GCGCCTCCGC CGCCGCCTCC GCCGCCCCCG
CCGCCCCCGC CGCCCCCGCC GCCTCCGCCG CCGCCGCCGG TCGTGGAATG CGCGCCTGGG
CCGTACATCG TGTATTTCGA CTGGGATCAG TCGAACATCA CGCCGGAAGC GGCTTCGACG
CTCGACAATG CGATCAGCGC CTATAACCGT GGTTGCACGG GCACGCAGAT CATGCTCGCC
GGTCACGCCG ACCGTTCGGG TTCGGCCCGC TACAACGTCG GCCTGTCGGA ACGCCGCAAC
GATGCGGTTC GCAGCTATCT GACCGCTCGC GGTATCTCGG ATGGTTCGAT CAGCGCGCAG
GCGTTCGGCG AAACCCGTCC GGCCGTTGCG ACCGCCGACG GCGTCCGCAA CGACCAGAAC
CGTCGCGTGG AAATCACTTA CGGTCCGAAC TCGGGCATGT AA
 
Protein sequence
MRKLAVAVAL ASTTLASPSM ARDDSWYVGV GAGAMLVEDI DLDIGTFNNA GSLDHRAGYD 
FEGTVGYDFG GFRAEVEVGF READIKSGRF GNPGIPQTAS GAGTLFTGST DLNGDSNALS
FMVNGMLDFG DDDGLQGFVG GGAGVARVSV EPVFAGPFLD DSDTGFAWQA IAGVRAPLSS
NWDVGLKYRF FNADNVDLVD QAGRDVSTRF RSHSILGTLT YNFGGAPEPV APPPPPPPPP
PPPPPPPPPP PPPVVECAPG PYIVYFDWDQ SNITPEAAST LDNAISAYNR GCTGTQIMLA
GHADRSGSAR YNVGLSERRN DAVRSYLTAR GISDGSISAQ AFGETRPAVA TADGVRNDQN
RRVEITYGPN SGM