Gene Sala_1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1004 
Symbol 
ID4081692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1026974 
End bp1028830 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content65% 
IMG OID638009364 
ProductTonB-dependent receptor 
Protein accessionYP_616054 
Protein GI103486493 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID[TIGR01779] TonB-dependent vitamin B12 receptor 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0684927 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAGAT ATTCGACCAT TTCCATGATC GCCCTTGCCG TTGCGACCCC TGTCGTTGCG 
CAGGAAACCA ACCCCGATGC CGACGACATC GTCGTTACCG CATCGGGCAT CGAACAGCCC
GGCGACGAGG TCGGTCAGGC GATCACGGTG ATCGACGCCG ACACGATCGA AACGCGGCAA
ACGATCGACG TCGTCGACCT GCTCGCAACG ACGCCCGGCA TCCGATTCAA TCGCACCGGC
ACCACCGGAT CGGTCACCGG CGTGTCGATC CGCGGCGCCG AGACGACGCA GACCCTGGTG
CTGATCGACG GGGTCAAGGT CAACGACCCC AGCGGCATCG GCGACGGTTA TGATTTCGGC
CATCTGCTCA CCGGCAATAT CCGCCGCATC GAGGTGCTGC GCGGATCAAA CTCGGTCGTC
CATGGCAGCC AGGCGATCGG CGGCGTGGTC AATGTGATGA CGGCGACGCC CGGCTCGGGT
TTCGCAGGCG GCGCATCGGC CGAATATGGC TATAGCGACA CCGTCAACGC CAGGGCCGAC
CTGTCGGGCA CCACCGGGCC GGTGTCGGGC AGCATCGGTG GCGCCTATTT CCGCACCGAC
GGCATTTCGT CGGCGGCGGT CGGCACCGAG CGCGACGGAT ATGAAAATAT CGCCGCCAAT
GCGCGGCTGA AGGTCGCGTT CAGCGACGCG CTGAGCCTTG ATCTGCGCGG CTATTACATC
AACGCCGACC TCGATTTCGA CAGCTTCTTC GGCGCTCCGG CCGACAGCGC CGACGTCGCC
AGGCTCGACC AATATGTCGG CTATGCCGGG CTGAACCTCG GCCTCTTCGA CGGTCGCTTC
ACCAGTCGCG CCGCGGTGAC ATGGATGCGC AACGAGCGCG ACTATTTCTT CGTGCGCGGC
ACCGCCCCCG ACTTTGGCTA TAAGGGCACG AACCTGCGCT TCGAATATCA GGGCGTCGTG
GCCCCCGCCG ACACCGCCAG GCTGGTGTTC GGTTACGAGC ATGAGCGCCC CGAATATGAC
TTCTTCGGCT TTGGTTCGAC CGACAGCCAG AAGGCCAATA TCGACAGCGT CTATGCACTC
GCCATCGTCC AGCCGCTCGC CGGCCTGTCG CTGACCGGCG GCGTTCGCCA CGACGACCAT
AACCAGTTCG GCGGCGCCAC GACCTTTGGC GCCAACGCCA ATTATTCGCC CAACGGCGGC
GCCACCAATG TCCGCTTGAG CTATGGCGAG GGGTTCAAGG CGCCGTCGCT CTTTCAGCTC
TACGACAGCT TCAGCGGCAA TGCGGCGCTG CGTCCCGAAC GCTCGAAAAG CTATGACGTC
GGCATCGACC AGAGCCTCGC CGACGGGCGC GCGCTGGTGT CGCTCACCGC CTTCCTGCGC
AACACGACCG ACCAGATCAA TTTCGACAAT GCGACCTTCA CCTATGGCAA TATCGACCGC
ACGCGCGCCA AGGGGATCGA GGCGACGCTG GCGCTGAAAC CCGTCGACGC ACTGAACGTC
ACGGCGTCGT ACAGCTATAT CGACGCGCGC GACCGGTCGG GGCGGCCCGC GTTCGACGGC
AAGCGCCTGC CGCGCCGCGC CGCGCATGCG GTCAGCCTGT CGGCGGACCA TGACTGGTCG
TTCGGGCTGT CGGCCGGCGC GACGGTCACG ATGGTCGGCG ACAGTTTCGA CAATGCGACG
AATATGGTGC GGCTCGACGG CTATGCGCTC GCCGGGGTGC GCGCGTCGTT CGCGGTTACC
GAGCGGATCG AAGTGTACGG CCGCGTCGAT AATCTGTTCG ACGCCGATTA TGCGACGGCG
TTCAACTTTG GCACCTATGG CCGCGCCGCC CATGGCGGGG TGCGGGCGCG ATTCTGA
 
Protein sequence
MFRYSTISMI ALAVATPVVA QETNPDADDI VVTASGIEQP GDEVGQAITV IDADTIETRQ 
TIDVVDLLAT TPGIRFNRTG TTGSVTGVSI RGAETTQTLV LIDGVKVNDP SGIGDGYDFG
HLLTGNIRRI EVLRGSNSVV HGSQAIGGVV NVMTATPGSG FAGGASAEYG YSDTVNARAD
LSGTTGPVSG SIGGAYFRTD GISSAAVGTE RDGYENIAAN ARLKVAFSDA LSLDLRGYYI
NADLDFDSFF GAPADSADVA RLDQYVGYAG LNLGLFDGRF TSRAAVTWMR NERDYFFVRG
TAPDFGYKGT NLRFEYQGVV APADTARLVF GYEHERPEYD FFGFGSTDSQ KANIDSVYAL
AIVQPLAGLS LTGGVRHDDH NQFGGATTFG ANANYSPNGG ATNVRLSYGE GFKAPSLFQL
YDSFSGNAAL RPERSKSYDV GIDQSLADGR ALVSLTAFLR NTTDQINFDN ATFTYGNIDR
TRAKGIEATL ALKPVDALNV TASYSYIDAR DRSGRPAFDG KRLPRRAAHA VSLSADHDWS
FGLSAGATVT MVGDSFDNAT NMVRLDGYAL AGVRASFAVT ERIEVYGRVD NLFDADYATA
FNFGTYGRAA HGGVRARF