Gene Sala_1010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1010 
Symbol 
ID4081698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1036125 
End bp1038434 
Gene Length2310 bp 
Protein Length769 aa 
Translation table11 
GC content60% 
IMG OID638009370 
ProductTonB-dependent receptor 
Protein accessionYP_616060 
Protein GI103486499 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.247668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.797114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGGAG GGAGAGTGTT TATGAAAACG CATTTGATCG CGCTGTTTGC GGCATCCGCG 
CTTGTCAGCC CTGTTGTCGC CTTTGCCCAG GAAACGCCCG CCGCCGACGA TCAGGGGGGG
CTCGAGGAAA TCATCGTCAC CGCGCAAAAA CGCGCCGAGG GGCTGTCCGA CGTGCCGATC
TCGATCTCAG CGGTCAGCGG CAAGCAGGTC GAAAATTACG GGCAGACCAA TCTTGAACAA
ATCTCGTCCT CGGTCCCGAA CCTCAAGATC ACCCAGACGG CGATCGCCAA CCGTATCGCG
ATTCGCGGCA TCGCATCGGG CGACAACAAG GGGTTCGAAC AATCGGTCGC GATGTTCGTC
GACGGCGTCT ATTACGGTCG CGACCAGCTC TCGCGCCTGC CGCTCGTCGA CATGGAGCGC
GTCGAGGTGC TGCGCGGGCC GCAGCCGACC CTGTTCGGGA AAAATGCCAT TGCCGGCGCG
GTCAATATCA CGACGCGCGG CCCGACCGAC ACATTCGAAG GGTCGGTCAG CGGTCTTTAC
GAGTTCAACC ACAAGGAATT GCAGCTGACC GGCGTGCTGT CGGGACCGCT GAGCGACGGC
GTCGAGGCGC GCGTCGTGGG CTATCATCGC TCGATGGACG GCTATTTCTA TAATCAGGAG
CTCGATCGCG ACGAACCCAA TGTCGATGAG TATTATTTTC GCGGCAAGGT CGAACTGGAC
AAGGGCGGAC CGTTCGCGGC CGAACTGAAA CTGGAATATG CCGACTTTGA AATGAAGGGC
CAGCCGCGCG ACGTCTTCGG CGCGGTCGGC AATTACAACG CGGTTTTCCA GGGTCCCTTC
TTCGTCAGCA CCGACCCCGA TTTTGTCCGC GAGGACAATG GTTATGAAAG CCGAAACAAG
GTGTTCGGTG CCACACTTAA TGTCGACTTC GAAATCGGCG ATCACACGTT GACCGCCGTG
TCGGCCTTGC TCGATTACAA GACGCGCGAA ATTGTCGACG TCGATTTTTC GGGCATCAGC
TTCCTTGATG GCACCAATTT GCGTGAGGAT TACCGCCAGT TCAGCCAGGA ACTGCGGCTG
ACCTCGCCGG GTGGCGAGGT GTTCGACTAT ATCGCCGGGG TCTATTATCA GCACGCCAAA
CTCGACGTGC AGGATTTCAC CCTGTTCAAT CCGACCTTCC TCGCGCTCGG CGCGCCCTTC
AATGCGTTGG GCGACACCAG CAACGACCGC GATTACACGC AGAAATCCGA TCTGATCTCC
GCCTTTGCGC AAGGTGAATT ATCGGTCACC GACCAGCTGC GCATTACTGC GGGCGCGCGG
TTCAACCATG AGAAGAAGAG CGGCAGCCGC AGCCTGGCGA TCGTCCAGGG GCCGCTCAGC
ACGGCGCCCG CAGCGGTCGT TGCGGCCGTG TTCCGGGCGC TCAATATCGA AACGCACAGC
ATCTCCGGCA AGATCAGCGA AGACAGCTTC AATCCGATGG TCAACGTCCA GTATGACGCG
ACCGATGACC TGATGGTCTA TGCCTCCTAT GCCAAGGGCA CGAAGGCGGG CGGTTTCGAC
ATAAGGTCGA ACTCGCTGCC GACCTCGACG ACCGTGGCGC GGCCCGGCGC CTTTGTCTTC
GAGGACGAAA GCGCAGACAA TTTCGAGGCC GGCCTGAAAT ATAAAGGGCG CAATGTCGCT
TTCAACGTCT CGCTCTATCG CACCGATTAC AAGGATCTTC AGGTCAATAT CTTCGACGGC
ACGCTGAACT TCAACGTCCG CAACGCTGCT GAAGCGCGGA CGCAGGGGGT GGAGGCCGAT
TTCCGCGCGG CGCTCGCGCC GGGGCTGACG GTCAGCGGCG CGGTCGCCTA TCTCGACTTC
AAGTTCACCA ATTTCACCGA GGGCCAATGT TTTTATCTGC AGGTGCCCGG ACCGAATGGC
CTGTGCGATT ATTCCGGCAA GCGAAACGCC TTGAGCCCCG AATGGTCGGG CAATTTGAAC
CTCGACTACA CGACGCCGGT GACGAACGAG ATGAAGGTCG CGCTCAACAT CAACGCCGAT
TTCTCCTCTT CCTATATCGC CGCGGTAAAC CTCGATCCGC GTACGCATCA GGACGGATAT
GTAAAGCTGG GCGCGCGGTT TGCGCTGGCC GACGTTGATG ATCGCTGGGA GGTCGCGCTG
ATCGGCCGTA ACCTCACCAA CCAGCGGATA TTGCAGACGG CGAGTTCGAT GCCGCTCGCC
ACGACGATCA CGCGAGGGGC GGGCAATGCC TATAACGGCA TCGTCGACCG CCCGCGCACG
ATCGCGGTGC AACTGACGGG GCGCTTTTGA
 
Protein sequence
MVGGRVFMKT HLIALFAASA LVSPVVAFAQ ETPAADDQGG LEEIIVTAQK RAEGLSDVPI 
SISAVSGKQV ENYGQTNLEQ ISSSVPNLKI TQTAIANRIA IRGIASGDNK GFEQSVAMFV
DGVYYGRDQL SRLPLVDMER VEVLRGPQPT LFGKNAIAGA VNITTRGPTD TFEGSVSGLY
EFNHKELQLT GVLSGPLSDG VEARVVGYHR SMDGYFYNQE LDRDEPNVDE YYFRGKVELD
KGGPFAAELK LEYADFEMKG QPRDVFGAVG NYNAVFQGPF FVSTDPDFVR EDNGYESRNK
VFGATLNVDF EIGDHTLTAV SALLDYKTRE IVDVDFSGIS FLDGTNLRED YRQFSQELRL
TSPGGEVFDY IAGVYYQHAK LDVQDFTLFN PTFLALGAPF NALGDTSNDR DYTQKSDLIS
AFAQGELSVT DQLRITAGAR FNHEKKSGSR SLAIVQGPLS TAPAAVVAAV FRALNIETHS
ISGKISEDSF NPMVNVQYDA TDDLMVYASY AKGTKAGGFD IRSNSLPTST TVARPGAFVF
EDESADNFEA GLKYKGRNVA FNVSLYRTDY KDLQVNIFDG TLNFNVRNAA EARTQGVEAD
FRAALAPGLT VSGAVAYLDF KFTNFTEGQC FYLQVPGPNG LCDYSGKRNA LSPEWSGNLN
LDYTTPVTNE MKVALNINAD FSSSYIAAVN LDPRTHQDGY VKLGARFALA DVDDRWEVAL
IGRNLTNQRI LQTASSMPLA TTITRGAGNA YNGIVDRPRT IAVQLTGRF