Gene Sala_3041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_3041 
Symbol 
ID4083049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3188443 
End bp3191238 
Gene Length2796 bp 
Protein Length931 aa 
Translation table11 
GC content60% 
IMG OID638011427 
ProductTonB-dependent receptor 
Protein accessionYP_618078 
Protein GI103488517 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000614421 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.287923 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAATTC AAGCAATGCG GAGCGTTCGC ACTTTGACCC GGCTGGCGTG CGGGGCCTCG 
CTTGCCGTGC TGGCGGCGTC GCCGGTGTGG GCGCAGGATG CTGCTGAAGA GGCGGTCGCC
AGCGAAGACG AGATCGTCGT CACCGGCTTT CGCGCCTCGC TCGACGAAGC ACTCAACCAG
AAACGCGATT CGATCTCGGC TGTCGACGTC ATCGTGGCGG AGGACATTGC GAAATTCCCG
GATCAGAATC TGGCTGAATC GCTGCAACGC ATTCCCGGCA TCTCCATCCA GCGCGACGCT
GGCGAAGGCC GCGCGATCAC GGTTCGCGGG CTTGGCGCAC AGTTCACGCG CGTTCGCCTG
AACGGCATGG AAACCATCGC GACGTCGACC GACGGCGCCG CGGCGAACCG CGACCGCGCG
TTCGATTTCA ACGTCTTTGC CTCCGAACTC TTCACCTCGC TGGTGGTCCA TAAGACCGCC
TCGGCCTCGC TCGACGAAGG CTCGCTGGGG GCAGTGGTCG ATCTCAACAC CGGCAATCCG
CTGGGCGGCA AGGAAGGGCT GACGCTCGTT GGCTCGGCGC AGGCGCGTTA CAACGACCTC
ACCGAAAATG TCGATCCGCG GCTGGCAGGC CTGATCGCCT GGACCGACGC CGATCGCACC
TTCGGCGTTT CGGCGTCGGT CGCCTGGTCC GATTATACGA CGGACGAACT GGGCAATAAC
AGCGTTCGCT GGGCGCAGGC GCCGTTCCGC AGTGTAGATG GTGTAACTTG CCTTTCGGGC
TCCAGTTTCG TCGCAAATCC CTCGGCCGGC TGTATCGAGG TTGCCGAAGC GTTCCACCCG
CGCATCCCGC GTTACGGCCT GGTGCGGCAT GAACGCGAAC GGCTGGGTGC GACCGCCTCG
ATCCAGTTCG AGCCAAGCGA GAATACGAAG ATTTCGATCG ACGGGCTATA TTCGCGGTTC
AAGGAAGTGC GCGACGAATA TTGGGGCGAA GTGCTGTTGC GCAGCGAAGA GCGCGGCATC
GATGTGTCCA ACTACACGAT CGATGAAGAC AACAATCTGA TCGCGGCGGA TCTCGATAAT
GTGTACATCC GCACCGAACG CTATTCGCGT GAGAGCGAGA CCGAATTCTA CCAATTGTCG
GGGCGCCTGG AACAGCGGCT GACCGACACG TTCAAGATCA ATCTGCTTGG CGGTTTTTCG
AAGTCGCAGG CCGATATCCC GATCGAAACG ACGCTGGCGT TCGACGACCG GGACGGCACC
GGGTATCGCT ATGATTATAC GAACATGAAG TTCCCGGTTC TCAGCTTCGG TCCGGGCATC
GAGGATCCTT CGTCGTTCGT GCTCGCCGAA TTCCGCGATC GCCCGTCGTT CCTGACCAAC
AAGTTCAAGA CGGTGTCGCT CGATTTCGAC TGGGACGTGG CCGACCGGTT CAAGCTGCTT
GGCGGCGGTT TCTATCGTCA GTTCGATTTC GATACGGTCG GCTTCCGCCG CGACAGCACC
TATTGTGCCG CCTTCACCTG CGCGCCCGGT ACCACCGGCC TGCCGGTAAC GGCTGACATT
GCCGAACTGT TCAAGCTGGG CAAAGCGGGG CAGCCATCGG GCAATACCAA CGCCTGGATC
GTGCCAGATC TCGATGCGGG GACGGCGCTG ATCGACCTCT ACAACCGTCC GGCCGTCCTT
CAGCAAGGTG AACAGCGCGC AGTCACCGAA AAAACCTATG GCGGCTGGTT TATGACCGAG
TTCGAAACCG AACTCGCCGG GATGCGGCTG ACCGGTAATG CCGGCGTGCG CTATGCCAAA
ACCGAGCAAT CGTCTTCGGG TTTCACAAGC GGAACATTCG TTACGGTCGA CCGGACCTAT
GACGACTGGT TGCCGTCGTT CAACCTGAAC CTGCATCCGA CCGAGAACAT CATTCTGCGC
GGGGCGATCG CGAAGGTGAT CACGCGCCCC ACGCTCGGCA ATCTGTCGCC GGGCGGTGCC
GTCGATCAGT TTAATTTCCG GATCACCTCG GGTAACCCGT TCCTCGATCC GTTCCGCGCG
ACGACTTTCG ATCTGGCCGC CGAATGGTAT TTCGCACCGG GAGCACTCGC TTCGGTGGCT
TTGTTCGCCA AGGATATCGA AAGCTTCCCC ATTTCTACTT CACTTCAGGG GACATATGCC
GACAGCGGCC TGCCGCTCTC GCTTCTGACG CCGGGAACCC CGGCTTATGA TGCTGTCGTG
GGCGGTTCGA ACCCCAACCG GCAGTTCGAG TTCCGTACGA CCGGAAACGG CCCCGGCGCG
AGCCTCAAGG GCATGGAATT GTCGCTCCAG CTACCATTCT CTGTGTTCTC GGACTCGTTG
CGTCACTTTG GCGTGCTCGG CAATGCGACG TTCGTAAAAA GCAATGTCGA CTACACGATT
GCCGGGCCCT TGGCATATGA TCCTGTCGAT AACCGGCTCG AGGCACAGCC AGCGGGCGTC
TATACCCAGC CGCTGCTCAA CCTTGCCAAG CGGGCGTGGA ATGCCACCGT CTATTACGAC
GACGGCAAAT TCTCGGCGCG TACGTCGGCG GCATGGCGCA GCGGCTATAA TGACGGCACC
AGCGGCAACG GCAATGTGTT CGAAGGTTAT GGCAGTTCGT TCAACGTCGA TGCGTCGATC
CGTTATGCGA TCACCGAAAA TATCGAACTG TCGATCGAGG GGACGAACCT TACGGACGAT
TATCGCTATC GCTTCACCGA CCTCGAAGCG AATCGCAACT ATGAGAACAA CCACTACGGC
CGCACCTTCC TGTTCGGTGC GCGCGTCAAG ATCTGA
 
Protein sequence
MSIQAMRSVR TLTRLACGAS LAVLAASPVW AQDAAEEAVA SEDEIVVTGF RASLDEALNQ 
KRDSISAVDV IVAEDIAKFP DQNLAESLQR IPGISIQRDA GEGRAITVRG LGAQFTRVRL
NGMETIATST DGAAANRDRA FDFNVFASEL FTSLVVHKTA SASLDEGSLG AVVDLNTGNP
LGGKEGLTLV GSAQARYNDL TENVDPRLAG LIAWTDADRT FGVSASVAWS DYTTDELGNN
SVRWAQAPFR SVDGVTCLSG SSFVANPSAG CIEVAEAFHP RIPRYGLVRH ERERLGATAS
IQFEPSENTK ISIDGLYSRF KEVRDEYWGE VLLRSEERGI DVSNYTIDED NNLIAADLDN
VYIRTERYSR ESETEFYQLS GRLEQRLTDT FKINLLGGFS KSQADIPIET TLAFDDRDGT
GYRYDYTNMK FPVLSFGPGI EDPSSFVLAE FRDRPSFLTN KFKTVSLDFD WDVADRFKLL
GGGFYRQFDF DTVGFRRDST YCAAFTCAPG TTGLPVTADI AELFKLGKAG QPSGNTNAWI
VPDLDAGTAL IDLYNRPAVL QQGEQRAVTE KTYGGWFMTE FETELAGMRL TGNAGVRYAK
TEQSSSGFTS GTFVTVDRTY DDWLPSFNLN LHPTENIILR GAIAKVITRP TLGNLSPGGA
VDQFNFRITS GNPFLDPFRA TTFDLAAEWY FAPGALASVA LFAKDIESFP ISTSLQGTYA
DSGLPLSLLT PGTPAYDAVV GGSNPNRQFE FRTTGNGPGA SLKGMELSLQ LPFSVFSDSL
RHFGVLGNAT FVKSNVDYTI AGPLAYDPVD NRLEAQPAGV YTQPLLNLAK RAWNATVYYD
DGKFSARTSA AWRSGYNDGT SGNGNVFEGY GSSFNVDASI RYAITENIEL SIEGTNLTDD
YRYRFTDLEA NRNYENNHYG RTFLFGARVK I