Gene Sala_1015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1015 
Symbol 
ID4081703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1044845 
End bp1047895 
Gene Length3051 bp 
Protein Length1016 aa 
Translation table11 
GC content60% 
IMG OID638009375 
ProductTonB-dependent receptor 
Protein accessionYP_616065 
Protein GI103486504 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.501659 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGTTTT CGCATGAAAT TCGGAATAAG GGCGCTCGCC GGTTGCGTGC GCGTTCGCTC 
GCGGCTGTGC TGGCATGGAG CAGCGCGGCT GCGGCCATGG CCGTCGCCAT CGCCACCCCC
GCCCATGCCC AGGTTTCGAA CGCTTCGCTG CGCGGCACGG TAAAGGCCGA AGGCGGCGTG
AGCCAGGTGA CCGCCATCAA CGTGAACACC GGTTTGACAC GCAGCGTCGC GGTCGGCGAA
AACGGCAGCT ATAATATCGC CTCGCTTCCC GTCGGCACCT ATCGTCTTGA ACTCACCACG
CCGGGCGGCG TGCGCCGCAC CGACGAATTC ACCCTGTCGG TCGGGCAGAG CGCCGTGCTC
GACTTCGATT TCTCGCAGCC CGACATCGCT TCGGATGACG GCGCGATCAT CGTCACCGGC
ACGCGCCTCC GTTCGATGGA AGGCGGCGAG GTGGGCACCA ATATCAGCCA GCGCCAGATC
GAGGTGCTGC CGCAGAACAA TCGCAACTTC CTCGCCTTTG CCGACCTGGC TCCGGGCGTG
CAGTTCGTGA CCGCTGGCAA CGATCAGTCG CGTCTGCAGG GCGGTGCGCA GAACAGCAGC
ACCGTGAACG TCTTTATCGA TGGCGTCGGG CAGAAGGATT TCGTGCTCAA GAACGGCATT
TCGGGTCAGG ATTCGACCCA GGGCAACCCC TTCCCGCAGC TTGCGGTCGG CGAATATCGC
GTTATTTCAT CGAACTACAA GGCCGAGTTT GATCAGGTAA GCTCGGTTGC GATCACCGCC
GTGACGCGGT CGGGGACCAA CGAGTTTCAT GGCGAAGCCT TCATCGACTA TACCGATCAG
GGATTGCGCG ACCGTCGCCC CAACGAACTC ACCGGGACCA AGATCAAGAC CAAGGATTTT
CAGTTCGGTG GTGCGTTGGG TGGGCCGATC ATCAAGGACA TGCTGCACTT CTTCGTCACT
TATGAAGGAA AGCGCCAGGA GAATCCGCGC GACATCCGCC CTGGCTTCAA CCTGCCGCTC
GACTTTTTCC CGGCCGAATA TCGGGGCGTT TTCGGTCCGA CGAATGCGAC GTTCAACGAA
GATCTCTATT TCGGCAAGCT CAGCTTTCAG CCGACGTCGA GCGATCTGAT CGAATTGTCG
GGACGGCACC GGCGCGAGAG CGGCGAGTTT CTGAGCAGCG GGATCAATGC GCGTGAGACG
ATCAGCGCGC AAAAAGTGAT CGAATATCGC GGCACGGCGC GCTGGGAGCA CACCGCCGAC
AACTGGATCA ACGATCTGAA ACTCACCTAT GAGGATGTGC GCTGGGCGCC GACCCCGGTC
GTTTTCGGCA ATGGCAGCCT GTTCGCTTAT GCCGCACCGA ATGAGAGCAA CCCCGCGGTC
ATCGACCGCG CCGATATTCT GCGCATCGGG GGCGGCGCCA ATTATCAGGA CAAGGGCCAG
AAAGGTTGGG GCATCCAGAA TGATTTCACC TGGACCGGAT TCGAAGGACA TACGATCAAG
TTCGGGGTCA AGTCGAAATG GGTCGAGCTG AACACGCTTC AACTCAACAA TTTCAACCCG
CTCTATACCT ATAACGTCGC CTTCAATCCC AACGGCGGCA CATTCAATGA CGAAATCCCC
TATCGGCTCC AGTTCGGGGC GCAGACCGGA ACCGGCAACC CCATCGTCAG CTCGGACAAC
TGGCAGTTCG GCATCTATCT TCAGGACGAC TGGGAGGTCA CCGATCGCCT GACCCTCAAT
CTTGGTGTCC GGTGGGATTA TGAGCGGACG CCGTCCTATC TCGATTTCGT CCATCCCGCC
GATGCGGTGA ACGCCGTTTC GCCCGCCAAT TACCCCAATC TGGTCAACGC CGACTATGAC
ATCAACGATT TCATTTCGAC CGGATCGGAG CGCAAGGCGT TCAAGGGTGC CTGGCAGCCG
CGGATCGGTT TCAGCTATGA GCTCGACGAC GACGCGCGCT TTGTGCTCTT CGGCGGCTTC
GGCCGCTCCT ATGACCGCAA CCAGTTCGAT TTCCTCCAGC AGGAAATCAG CGTCGGCTCC
TTTGCAACGC GCACCTTCAA CTTCAACACC GGCGATCCTT TTAACATCTG CGCACCGGGG
CCCACCTGCG TCACCTGGGA TCCCATCTAT CTGACCGAAG CCGGCCGCCA GCAGTTGCTG
GCGCAGGCGG GACCCGGCGG TGGCCGCGAA CTGCGTTTCA TAACCAACAA GCTGAAAGTT
CCCTATTCGG ATCAGTTCAG CCTGGGCCTC CGCGCCCGCG TGACGCCGCT CTTCGAAGCC
GAAGTCGGTT ACAGTCATGT CGAGAGCAAG GACGGTTTTG CCTACCTGCT CGGCAACCGG
CGCCCCGATG GCAGCTTCTT CCCGCCGGCA CCCGCCGCGC CGAGTTCGCC CTTCGGCTTT
GCGCCGCCGG GTTTCGGGTC GATCATCATC GGCACCAACG GGCTCGAAAC GCGCGCCGAC
ACGGGCTATC TCAAGCTGGT GAAAAACTAC ACGGCGGCAT CACCGTGGAG CCTTGCGGCG
ACCTATACCT ACACCGAGGC TGAGGAAAAC AGGAACTTCG GTGAGGTCTT CAGTCTCGAT
TTCCCGTCAA TCGAAGATTA TAATTTTGCG CGGTCGGCGG GGGTGCGCAA ACATCGCTTC
GTCGCCGCGG GGTCGGTCGA CCTGCCGATC GGGGTGACGC TGTCGGGCAA GTTCACTTTG
GCGTCGCCGC CCTATCTCAA GGCGTTCGTA AATACCGGCG GGGACAATCC GTCGCGCACG
GTCATCTCGA ACGAGGCGAA GGGTAATGGC GACCGCTGGG GCCTGCGCCA GTTCGACCTT
GCGATCATCA AATATATCCC GTTCCGCTTC ATCAGCGACG AATCGCGGCT TCGCCTGCGT
CTCGACATCA TCAACCTGTT CAACGACCGC AACTTTGTGG ATTACAACAA TAATCCGGCG
GACAATACGC GAACGCCGTC CAGCCCCACC ATCTATCGCG AGATTTCGGG GATCGGCGTT
GGCGGCAACC CGCCGCGTAC GGTAAAGGTC TCGGCCGGTT TCTCTTTCTG A
 
Protein sequence
MTFSHEIRNK GARRLRARSL AAVLAWSSAA AAMAVAIATP AHAQVSNASL RGTVKAEGGV 
SQVTAINVNT GLTRSVAVGE NGSYNIASLP VGTYRLELTT PGGVRRTDEF TLSVGQSAVL
DFDFSQPDIA SDDGAIIVTG TRLRSMEGGE VGTNISQRQI EVLPQNNRNF LAFADLAPGV
QFVTAGNDQS RLQGGAQNSS TVNVFIDGVG QKDFVLKNGI SGQDSTQGNP FPQLAVGEYR
VISSNYKAEF DQVSSVAITA VTRSGTNEFH GEAFIDYTDQ GLRDRRPNEL TGTKIKTKDF
QFGGALGGPI IKDMLHFFVT YEGKRQENPR DIRPGFNLPL DFFPAEYRGV FGPTNATFNE
DLYFGKLSFQ PTSSDLIELS GRHRRESGEF LSSGINARET ISAQKVIEYR GTARWEHTAD
NWINDLKLTY EDVRWAPTPV VFGNGSLFAY AAPNESNPAV IDRADILRIG GGANYQDKGQ
KGWGIQNDFT WTGFEGHTIK FGVKSKWVEL NTLQLNNFNP LYTYNVAFNP NGGTFNDEIP
YRLQFGAQTG TGNPIVSSDN WQFGIYLQDD WEVTDRLTLN LGVRWDYERT PSYLDFVHPA
DAVNAVSPAN YPNLVNADYD INDFISTGSE RKAFKGAWQP RIGFSYELDD DARFVLFGGF
GRSYDRNQFD FLQQEISVGS FATRTFNFNT GDPFNICAPG PTCVTWDPIY LTEAGRQQLL
AQAGPGGGRE LRFITNKLKV PYSDQFSLGL RARVTPLFEA EVGYSHVESK DGFAYLLGNR
RPDGSFFPPA PAAPSSPFGF APPGFGSIII GTNGLETRAD TGYLKLVKNY TAASPWSLAA
TYTYTEAEEN RNFGEVFSLD FPSIEDYNFA RSAGVRKHRF VAAGSVDLPI GVTLSGKFTL
ASPPYLKAFV NTGGDNPSRT VISNEAKGNG DRWGLRQFDL AIIKYIPFRF ISDESRLRLR
LDIINLFNDR NFVDYNNNPA DNTRTPSSPT IYREISGIGV GGNPPRTVKV SAGFSF