Gene Sala_2758 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2758 
Symbol 
ID4080243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2907047 
End bp2908087 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content68% 
IMG OID638011141 
Productperiplasmic protein-like protein 
Protein accessionYP_617796 
Protein GI103488235 
COG category[S] Function unknown 
COG ID[COG3672] Predicted periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.202545 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.860065 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGTC ATGCCCCATC CGCCCGCGCC GCGCGGTCCC TGCTGCCGCT GATCGCGGCC 
ACCCTGGCGG TCGCGCCCGC AGCAGCGCAG GCGTCGACCA AGCTGGACAA TGTCAAGGCG
GCGGCACCGG CCAAGGTCGT CTGTGACGCG GTGGCCGCTT CCGCCGTGCC GCGCGCCCGC
GACCTGTCGC AGCTGATCCT GGACGGCGCG CCGAGCGCGC TCGATCGCAT CAGGATGCAG
CAGCAAGGGA TAAATCGGCC CGCAACGGTC AACACCATCC CCGATCGTCG CGCGCTCGAA
CCCGCAAGCC GTATGCCGCT TTCCTTCACG GCGTCGGCAC CCGTCGACTG CCGCAATGCG
CCGTCGCCGC CGGGCGTGAC GGCCGAATGG GATGCCGGGT CCGAACTCGG CACGCGCGCC
ATTCCGGTCA AGCGGACGCG CTTCGACGAT CGCTGGGACC GCGTGCACCG CGCCGCGCCC
GCCGCGCTGA TGCAGCGCCA GCTGCAGAGC GCCAATGCCC TGTCCGGGCT CAGCGAAACC
GAGCTGCTGG CGCGCGTCAA TCAATGGGTC AATCGCGAAA TCGCCTATGT CGGGGACGAT
CGCAATTACC GCCGCCGTGA TTTCTGGGCG ACCGCTGACG AGACGCTCGC GCGCGGCAGC
GGTGATTGCG AGGATTTTGC GATCCTGAAA ATGCAGATGC TGCGCGCCGC CGGGATCGAT
GCCAACCGGA TGAAGCTCGT TCTGCTGCGC GATCTCGCCG CCAACGCCGA TCACGCCTTC
CTGCTCGTCG ATACGGGTGG CGGCAAGCTG GTGCTCGATA ATGTGACCGA CCGCCTCTAT
GACGGCGCCC GACCGCAAGC GGTGCGCCCC GTGCTGTCGT TCAGCGCCGA CCGGCGGTGG
GTCCACGCCT ATCGCACCGC GGCGGAAACC CCGGCTGCAA CCATCGTTCC GGGGGCGCGC
AAGAGCATCA CCCTTGCGCT CGCCGATCAG CGTTCGGTCA AGGCCGTCCC GCTGACCTTC
AAAACGGGTT TGAGCAAATA G
 
Protein sequence
MRRHAPSARA ARSLLPLIAA TLAVAPAAAQ ASTKLDNVKA AAPAKVVCDA VAASAVPRAR 
DLSQLILDGA PSALDRIRMQ QQGINRPATV NTIPDRRALE PASRMPLSFT ASAPVDCRNA
PSPPGVTAEW DAGSELGTRA IPVKRTRFDD RWDRVHRAAP AALMQRQLQS ANALSGLSET
ELLARVNQWV NREIAYVGDD RNYRRRDFWA TADETLARGS GDCEDFAILK MQMLRAAGID
ANRMKLVLLR DLAANADHAF LLVDTGGGKL VLDNVTDRLY DGARPQAVRP VLSFSADRRW
VHAYRTAAET PAATIVPGAR KSITLALADQ RSVKAVPLTF KTGLSK