Gene Sala_0027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0027 
Symbol 
ID4082214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp22212 
End bp24476 
Gene Length2265 bp 
Protein Length754 aa 
Translation table11 
GC content65% 
IMG OID638008387 
ProductTonB-dependent receptor 
Protein accessionYP_615086 
Protein GI103485525 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.634145 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0299657 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCATC TTCCTTCCGC CCTTGCCATG AGCGTCGCCC TCGCTGCCGC CGCGGTCGCC 
CCCGCCGCCG CACAGGAGGC TTCGGTCGTC GCCGCCGAAG AAGGCAGCGG CCTTGGCGAC
ATCGTCGTCA CGGCGCAGCG CCGCGAGGAA AGCCTGCAGG ATGTGCCGGT GTCGGTCAGC
GTATTGTCGG GCGACACGCT GGGCGCGATC ACCTCGACCG GCGCCGACAT CCGCGCGCTC
GCGGGCCGCG TCCCCAGCCT CAACATCGAA AGCTCCTATG GCCGCTCCTT TCCGCGCTTC
TATATCCGCG GCCTCGGCAA CACCGACTTC GACCTCAACG CCTCGCAACC GGTCAGTCTC
GTTTACGACG AGGTCGTGCT CGAAAACCCG ATCCTGAAAG GCTTTCCGAT CTTCGACCTC
GACCGCGTCG AGGTGCTGCG CGGGCCGCAG GGCACGCTCT TCGGCCGCAA CACCCCCGCG
GGCATCGTCA AGTTCGACAC CGTCAAGCCC GGCAAGACCG GCGGCTATGC GCGCGCCAGC
TATGGCCGTT ACGGCACCAG CCAGGTCGAA GTCGCGGCGG GCGCCGCCGA CGACAATGGC
TTTTCGGTGC GCCTCTCGGG CCTCTATCAG CATCGCGACG ACTGGGTCGA CAATATCGCC
ACCACCGCAA AGGACGATCT CGGCGGCTAT GACGACATCG CCGCGCGGCT CCAGTTGCAA
TATGAAAACG GCCCCTTTAC CGGTCGCGTA ACCGGGCAGG TTCGCGTCTT CGACGGCTCG
GCGATCATCT TCCGCGCCAA TACGCAGCTG CCGGGCAGCA ACCGCCTTGT CGGCCTCGGC
GGCGCGGACA CCGCGTTCGA GCGCGACAAG GTGTGGCAGG ACGGAATCAA TTTCCAGAAG
CTCAACACCT ATAATGCCGC ATTGAACCTC GAATATGATT TCGGCGCGGT GACCGCCTAT
TCGATCACCT CCTACTGGAA CGGCAATTTC AAGAGCCGCG GCGACATCGA CGGCGGCTTC
GGCGCGGTAT TCCTGCCCGT ATCGGGTCCC GGACTGATTC CCTTCGCGGC GCAGAGCCAG
GACAATGTGC CCAGCCTCGA CCAGTTCACG CAGGAAATCC GCATCGCCTC GAACAACAGC
GGCGGGCTCG GTTACCAGTT CGGCGCCTTC TATTTCGACG AAGGGCTCGA CATCACCAGC
TTCGATTTCG GCGGTCCGAC CGACGCAGCC CCTGCGGCGA TCGCGGTGCA GCGGCAGGAC
AGCGAAGCCT ATGGCATCTT CGGTTCGGTC AATTACGCCT TCGAGGGCGG CCTGACGCTT
CAGGCCGGCG CGCGCTACAA CCACGACACG CGCGATTTCG TCGCGGCGCG CCCGGTCGAA
ACGCGCCCGA TCTTCGTCGT CAATCCGAAC ACGCCGGTCC CGCCCCAAAG CGCGAGGGTC
AAGGGCAAGC TGCTGACATG GGACGCCAGC GCGACCTGGG AAGCGTCGGA CGCCGTGACG
TTCTACGCCC GCGTTGCGCG CGGCTATCGC GCGCCGTCGG TGCAGGGCCG CCTGACCTTC
TCGCGCGTGA TTTCGACCGC CGATCAGGAA GAAACGATGT CGTATGAGGC GGGGATCAAG
ACCGCCTTCC TCGACGACCG CGTCCGCTTC AACCTCACCG GCTATTATTT CGATACCAAG
GATCTTCAGC TGACCGCGGT CGGCGGCACG GCGAACGTCG CCAGCCTGCT CAACGTCGAT
GCCAAGGGCC ATGGGATCGA GGCCGAATTG CAGGCGGCGC CTGCGCGCGG GCTGACCTTC
AGCGTGGGCG GCGCGTGGAA CGTCGCCGAG ATCGACGATG CCAACGCTTT TGTCGCGGGC
TGTGGTTCGG CGACGCCGTG CACGGTGCTC GACCCGCAGC GGCCGGGCAG CCCCGGCATC
TTCTCGATCG ACGGCAACCA GCTGCCGCAG TCGCCCAAGT GGACGCTGAA TGCGACCGCG
GGTTATGAAA TCCCCGTCGG CGACGGCGCC ATCTATGCCT TCACCGACTG GTATTACCGG
TCGAAGGTGC AGTTCTTCCT CTATCAGTCG GTCGAGTTCT CGGACGACAA GATGATCGAG
GGCGGGCTGC GCGTCGGCTA CCGCACCGAC CGTTTCGACG TCGCGGCCTT TGTGCGCAAC
ATCACCAACG ATGAATCGCC GACCGGCGGC ATCGATTTCA ACAACCTGAC GAGCTATGTC
AACGAACCCC GCATCTGGGG CGTCGAGGCG GGCGTGAAGT TCTAA
 
Protein sequence
MRHLPSALAM SVALAAAAVA PAAAQEASVV AAEEGSGLGD IVVTAQRREE SLQDVPVSVS 
VLSGDTLGAI TSTGADIRAL AGRVPSLNIE SSYGRSFPRF YIRGLGNTDF DLNASQPVSL
VYDEVVLENP ILKGFPIFDL DRVEVLRGPQ GTLFGRNTPA GIVKFDTVKP GKTGGYARAS
YGRYGTSQVE VAAGAADDNG FSVRLSGLYQ HRDDWVDNIA TTAKDDLGGY DDIAARLQLQ
YENGPFTGRV TGQVRVFDGS AIIFRANTQL PGSNRLVGLG GADTAFERDK VWQDGINFQK
LNTYNAALNL EYDFGAVTAY SITSYWNGNF KSRGDIDGGF GAVFLPVSGP GLIPFAAQSQ
DNVPSLDQFT QEIRIASNNS GGLGYQFGAF YFDEGLDITS FDFGGPTDAA PAAIAVQRQD
SEAYGIFGSV NYAFEGGLTL QAGARYNHDT RDFVAARPVE TRPIFVVNPN TPVPPQSARV
KGKLLTWDAS ATWEASDAVT FYARVARGYR APSVQGRLTF SRVISTADQE ETMSYEAGIK
TAFLDDRVRF NLTGYYFDTK DLQLTAVGGT ANVASLLNVD AKGHGIEAEL QAAPARGLTF
SVGGAWNVAE IDDANAFVAG CGSATPCTVL DPQRPGSPGI FSIDGNQLPQ SPKWTLNATA
GYEIPVGDGA IYAFTDWYYR SKVQFFLYQS VEFSDDKMIE GGLRVGYRTD RFDVAAFVRN
ITNDESPTGG IDFNNLTSYV NEPRIWGVEA GVKF