Gene Sala_3108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_3108 
Symbol 
ID4082695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3256229 
End bp3259291 
Gene Length3063 bp 
Protein Length1020 aa 
Translation table11 
GC content64% 
IMG OID638011494 
ProductTonB-dependent receptor 
Protein accessionYP_618145 
Protein GI103488584 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.2566 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGAAAA CCCACTATTC CAAGCTGAAA CTGGGTGCCG CGCCGCTCGT GCTGAGCGTC 
GCCCTGGTTT CGGCTCCTGC CTATGCGCAG GACGCCGAGG AAGGCGCCAC CGGTTCGTCG
GAAATCGTCG TTACCGGTAC GCTGATCCGC AATCCGAACC TCGAACAGTC GACGCCGGTC
AACGTCACGA CCGCCGACAC CATCGAGCTG AAGCAGTCGA ACGTCGCCGA AGAGGTTCTG
CGCGAACTGC CCGGCGTGGT CGCGAACATC GGTTCGGCGG TCAACAACGG TAACAGCGGT
GCGTCGTACG TCGACCTTCG CGGTCTCGGT TCGATCCGCA ACATCGTGCT GCTGAACGGC
AACCGCGTTG CGCCGTCGGA CGTCAACGGC CGCGTCGACC TCAACAACAT CCCGCTCGCG
CTGATCGAGC GCGTCGATGC GCTCACCGGC GCGGCGGTGA CCACCTATGG CGCCGACGCG
ATCACCGGCG TCGTCAACTT CGTCACCAAG CGCGATTTCG CAGGTCTCGA AGTCACCGCG
TCGAACCAGA TCACCGAAGA GGGCGACGGC CACATCTTCC GCGTCGACGC GACGATCGGC
GCGAACTTCG ACGACGGTCG CGGCAACGCG GTGCTGAGCA TCGGTTACCA GCAGGCCGAC
CCCGTCTATC AGGGCGCACG TCCTTTCTCG AACGACACGC TCGACAGCTA TTCGAACCAG
TTCATTGGTT CGGGCACCTC GGTTCCGTCG CGTTTCTCGG GCACCCGTCC GCTCGATCCG
GTCACCGGCC AGCCGAGCAC CGATCCGACG GTGGCCAATG GCGGCCTGCG CCAGGTCAAC
GCGGCGGGTG CTGCGGTTCC GACCTTCGCA ACCTATAACT TCAATCCGTT CAACATCTTC
CAGACGCCGT TCGAGCGCTT CAACATCTAC GCTCAAGCCA ATTATGAAGT GTCGGACTCG
GTTGAAGTCT ATACGCGCGG CATGTTCTCG AAGAACACCG TTTCGACGAT CATCGCCCCC
TCGGGTTCGT TCGGCGGCAC GGTGACGGTC AACCTCAACA ATCCCTATCT GCCGTCGACC
CTGCGCAACC AGTTCTGTGC ATTCAACGTC GCCGCGGCAG GATCGGGCCT CTATACCCCG
CGCTTCACGC CCGCCGAATG TGCGGCTGCG GCCACCGCGA CGGGTCGTAC CGATCCGAAC
TATCGTGAAG TGACCGTCAC GCTGAACCGT CGCACGCCCG AAGTCGGTCC GCGTATCAGC
GACTATCAGA CGACCTTCTT CGACTATCGC GTCGGCGCCC GCGGCGGCAT CACCGACACG
ATCGACTGGA GCGTCGAGGG CGCCTATGGC GAATCGGAAA ATATCCAGAC GATTCAAAAC
TATACGCTGC AATCGCGCTT CCGCGAAGCC GCGCTGGCGA ACAACACGAC GACCTGTCAG
AGCGGCAACG CCAACTGCGT TCCGGTCAAT CTGTTCGGCC CCGAAGGCTC GATCACGCCC
GAAATGGCCG ATTATCTGTC GGAAAACAGC TCGACGACCA ACCGCACCTC GCTGGCGCAG
GTTCGTGCGA TCGTCTCGGG CGACCTCGGC TTCGCCTCGC CGGGCGCGGT TCAGCCGATC
GGCTTCGCGC TGGGCGGCGA ATATCGCAAA TATACCGCGC AGCAAGCGTC GGATCTGCTC
GCCAAGACGC CGGGTGAACT CGGCGGCGCC GGTGGTGCGG CCCCGGACAT CGACGGCGCC
TATGACGTCT ATGAAGCCTA TGCCGAAATC GTCGCGCCGC TGATCGAGGA CAAGCCTTTC
TTTGAAAGCC TGACCCTCGA AGCCGGCGTG CGCTATTCGG ATTACAGCAT CGAAGGCGCT
GGCGGCTATG ACACCTGGAC CTGGAAAGCC GGCGGCAGCT GGGAACCGGG CGCGGGTGTC
AAGTTCCGCG GCAACTACAG CCGCGCGGTC CGCGCGCCGA ACATCGGCGA GCTGTTCACG
CCGCAGACGG TTGGCCTCAC CAACCTCGGC ATCGACCCCT GCGCCGGTGC GGCGCCGACG
ACCGACGCCA ACCTGCGCGC GGTCTGTATC GCGCAGGGCG CTCCGGCGGG GACGATCGGT
TCGATCACCA ACCCGACCGC GGCGCAGGCC AATATCACGG TCGGCGGTAA CCTGAACCTG
CAGCCCGAAA AGGCGGACAC CTGGACGATT GGTCTGGTCT GGCAGCCGGA CTTCCTGCCG
CGCTTCAACA TGTCGATCGA CTATTACAAC ATCAAGATCA ACGACGTGTT GGGCACCCCG
CTGCCGGGCG ACATCATCGC CGCCTGTTTC GACAATGTCA CGGCGGCGAG CGCAAGCGAC
CCCGCCTGTA CGTCGATCCG CCGCAACCCG ATCACCGGCG GTCTCGACGG CGATCCGTCG
ACCACGCCCG GCCTGTTCGG CCTGACGAAC AACCAGGGCC GCCTGTTCAC CGACGGTATC
GACCTGTTGA TGAACTACAG CACCGACCTT GGCTTCGCCA GCCTCGACTG GTCGTTCGTC
GGCAACTGGA CGAACAGCTC GAAGTTCAAC GCCAATGTCG CCAACCCGGA TTCGCTCAAC
CGCGAATGCG TCGGCTACTA CAGCGTGAAC TGCTCGTTCA CCGGCTCGAT CCAGCCGGAG
TTCCAGTTCT CGAACCGCTT CACGCTGGGC TTCGACAAGG TTGACCTGTC GCTGCTGTGG
CGCTGGATCG ACCGGGTCGA TTTCGAACCC CAGCAGCTGC AGGACGACAT CGACGCCGCG
GTTGCTGCTG GCACCAGCCC GACCACCGGT TGTCCGGACC CGACGGGAAC CGACCCGAAC
GGCTGCGTCG TCGATCCGCA GTTCCGCTCG GTCAAGGCGC AGCACTATTT CGACCTGACG
GCGCGTTTCA ACGTCAGCGA GAATCTGGTG TTCACCGCGA CGGTGCAGAA CCTGCTCGAC
AACAAGCCGC CGCTGCTCGG CAACACGATC GGTTCGACCA CCTACAACAG CGGCAATACC
TATCCGTCGA CCTACGACGC GCTGGGCCGC CGCTATGCGG TGTCGGCGAA GCTGAAGTTC
TAG
 
Protein sequence
MKKTHYSKLK LGAAPLVLSV ALVSAPAYAQ DAEEGATGSS EIVVTGTLIR NPNLEQSTPV 
NVTTADTIEL KQSNVAEEVL RELPGVVANI GSAVNNGNSG ASYVDLRGLG SIRNIVLLNG
NRVAPSDVNG RVDLNNIPLA LIERVDALTG AAVTTYGADA ITGVVNFVTK RDFAGLEVTA
SNQITEEGDG HIFRVDATIG ANFDDGRGNA VLSIGYQQAD PVYQGARPFS NDTLDSYSNQ
FIGSGTSVPS RFSGTRPLDP VTGQPSTDPT VANGGLRQVN AAGAAVPTFA TYNFNPFNIF
QTPFERFNIY AQANYEVSDS VEVYTRGMFS KNTVSTIIAP SGSFGGTVTV NLNNPYLPST
LRNQFCAFNV AAAGSGLYTP RFTPAECAAA ATATGRTDPN YREVTVTLNR RTPEVGPRIS
DYQTTFFDYR VGARGGITDT IDWSVEGAYG ESENIQTIQN YTLQSRFREA ALANNTTTCQ
SGNANCVPVN LFGPEGSITP EMADYLSENS STTNRTSLAQ VRAIVSGDLG FASPGAVQPI
GFALGGEYRK YTAQQASDLL AKTPGELGGA GGAAPDIDGA YDVYEAYAEI VAPLIEDKPF
FESLTLEAGV RYSDYSIEGA GGYDTWTWKA GGSWEPGAGV KFRGNYSRAV RAPNIGELFT
PQTVGLTNLG IDPCAGAAPT TDANLRAVCI AQGAPAGTIG SITNPTAAQA NITVGGNLNL
QPEKADTWTI GLVWQPDFLP RFNMSIDYYN IKINDVLGTP LPGDIIAACF DNVTAASASD
PACTSIRRNP ITGGLDGDPS TTPGLFGLTN NQGRLFTDGI DLLMNYSTDL GFASLDWSFV
GNWTNSSKFN ANVANPDSLN RECVGYYSVN CSFTGSIQPE FQFSNRFTLG FDKVDLSLLW
RWIDRVDFEP QQLQDDIDAA VAAGTSPTTG CPDPTGTDPN GCVVDPQFRS VKAQHYFDLT
ARFNVSENLV FTATVQNLLD NKPPLLGNTI GSTTYNSGNT YPSTYDALGR RYAVSAKLKF