Gene Sala_0914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0914 
Symbol 
ID4083124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp923955 
End bp926918 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content60% 
IMG OID638009275 
ProductTonB-dependent receptor 
Protein accessionYP_615965 
Protein GI103486404 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4771] Outer membrane receptor for ferrienterochelin and colicins 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGATT CGCGGAAGGT CCGTTTCGAT CGCTTGGCAA TGGGCGTCTC GCTGCTTGCG 
ATCGGCATCG CATGTCAACC GGCCCTCGCC CAGGATGACG TCGTTGCGCA GAATGAGGAC
GACGCGATCG TCGTTACCGG CATCCGTGCA AGTCTCGAAC AGGCGCAAAA TATCAAGCGC
AATGCACAAG GCGTGGTCGA TGCCATTTCC GCCGAAGACA TCGGCAAGTT CCCCGACACC
AATCTGGCGG AGTCGCTGCA ACGCATCACC GGCGTATCGA TCGACAGGGC CAGCGGGGAA
GGGTCGACCG TCACGGTCCG CGGTTTCGGC CCCGAGTTCA ACCTTGTCGT GGTGAACGGC
CGCCAGATGC CGACATCGAC CCTCGGTGAC GGCTTCTCGG CACCATCGAC ACGCAGCTTC
GATTTCGGCA ACCTCGCCGC CGAAGGCGTC GCCGGGGTCG AGGTTTACAA GTCGGGCCGC
GCTTCGCTTC CCACCGGCGG TATCGGCTCG GTCATCAACA TCAAACTGCC ACGACCGCTG
GATCGCCCCG GCCTCAATGG CAGCATCGGT GTCAAGGGCG TTTACGACAC GTCGCAACTT
TATGGCACGG ACGTCACGCC CGAGATCTCG GGCATCATCA GCAAGACCTT CGCCGATGAC
CGCATCGGCA TCCTGCTCGC CGGTTCCTAT CAGAAGCGTA ATGGCGGCCA GGCGCAGTTC
ACTGTGGGCA ATTGGCGGCC TGGCTATACG GGTGCGGAAA ACAACTGGGG TACGCTTGCG
CAACCCGGAG ACCCGCGTTT CGCCAATATC GAAAACCGCC CCGGACCCAC CGACATTTAC
CAGGTTCCGC AGAATGCGGC GTATGATTTT TACGATTTCC AGCGCGAGCG GATCAACGGC
CAGGCGGTGC TGCAGTTCAA GCCGACCGAC AGCCTCGATG TGTCGCTCGA CTATATTTTT
GCGCAAAACA CCTTTGACAG CCGCCAGAGC AGCATCGGCG TGTGGTTCAA TCACAATGAC
ACGTCGAGCA GCTGGACCAA CGGCCCCGCC GCGGGCGCCA ATTTCTACGC GGAGAATTTC
GCTGCCGCCG AAGGCAAGGA TCTGGCGATC ACCGGCGCGG TCGGTTCGAA CCGCAATATC
AACCATTCGT TCGGCGGCAA CATCAAGTGG GAAGGGCCCT GGGGGCTGCG CCTCGAACTC
GACGGCCACC ATTCCACCGC GGAATCGAAG CCGACCTCGC CTTATGGCAG CGGGATTGCC
GTCGGTACGG CCATCTTTGG CGTTGCCAGC CAGAGAGTGG ACTTCACCAC CGACATGCCG
GTCATTTCGG TCGCCATGCA CCCCGGTTCC GAGATCGCGG CTGAGAATAT TCGTCCGGCG
GGCAATGCTT TCCGCAATGC CTATATGAAG GATACGATCG ATCAGGTCAG CTTGCGTGGC
GGCTTCGATT TCGATGCGTC ATTCATCGAA AGCCTCGATT TCGGCGCGAG CTACCTCGAG
AATGATGTGC GGACGGCATT CGGGGTGATC CAGAATGACA CGTGGGGCGG AACCTTGTCT
GCGGCCGATA CGCCGGATTC GCTGTTCACG CCGCGCGCAC TCTCCCCCGA TCTTTCCGGC
ATGAGCGGAT CGAGGGACCC GGCCATCATT CCGACCTATT TTCTGATCGA CACGGCAGGG
CTGATTTCAC TCCTCGACGA CAGGCTCGGG ATCTGTGACG CCGCGCCGGG CGATACTTGT
CTGGCGCCTT ATTCCACCGA CCGGCGCATC TTGGAAAAGT CGATTACGCC CTGGGTGCAG
TCCTTCCACA GCTTCGATCT GGGCGATGCC AGGGCGAATC TGCGTCTTGG ACTGCGCTAT
GAGAAGACCA AGGTTACGTC GAGGGCGCTC GCCGAGATCC CCACGGGCAC CGTCTGGGTC
GCGCAGAATG AAATCAGCCT GGTCAAAACG CCCGGTACCG ATTTCACGAC GCTGAAAGGC
GAGTATGATA ACTGGCTTCC TGCGATCGAT TTCGATCTGT CGCCCTTTGA AAATGTGAAG
CTGCGCGCGT CTTACAGCCA CACGATTACC CGCCCGGACT ATGCCAGCAT GCAAGGCGGC
ATTACGCTCG CGCAGCCGCT GCGCGTCGGC GCGGGCGGCA GCCAGGCAAG CGCCGGCAAT
CCGGCTCTGC TCCCCTACAA GTCCAAGAAT GTCGATTTGT CGGCCGAATG GTATTATGAT
CGGGCAAGCT ATGTGTCGGT AGGCTTCTTC AACAAGAAGG TCAGCAATTT CATCGCCAAC
GACACCACCC AGACGCCTTT GTTCGACTTG CCCGATCCGT CGCAGGGCGC GGCGGCCACG
GCAGCGCGCC AGGCGCTCGG GCCGAATGCG AGTTTCGACC AGATCGTCGC CTGGGTTCAG
GCCAATCGGC CAGCCGACTA TGTGGCCGGC GTCGGGACGG CGGGTGGCGT TGCGGGTCGT
GCCGGCGATC CGAACGTCAT CTTCACCTTC ACCCAGCCAT CGAACAGCGA TCAGCAGGCG
CGGCTGTGGG GCTGGGAGTT TGCGATCCAG CATAATTTCT GGGATACCGG CTTCGGCGCG
ATCCTCAACT ACACGGTGGT CAACAGCGAT ACGGGATTTG ACAACACATT GCCGTGGACG
ACCACGCAAT TCGCTGTTCC GGGCGCGAGC GACAGCGCGA ACGCCGTGCT TTATTACGAC
AAAAACGGCT TGCAGGCGCG TATCGCCTAC AACTGGCGGG ACCGGTTCCT GGCCGGCAAG
GATCAGGATC CCTATTACAT CAAATCCTAT GGCCAGTTCG ACGCCAGCCT GAGTTACGAG
TTCAAGAAAG GCCTCACCGC GTTCGTCGAA GGTATCAACA TTACCAACGC CGACCGGACA
GGCGTGCGCC GTAACGATCG CGCGATCTTC TTCGCGGCGC CGGGCTATGC GCGTTATTCG
GCGGGCATTC GATTCACTTT CTGA
 
Protein sequence
MNDSRKVRFD RLAMGVSLLA IGIACQPALA QDDVVAQNED DAIVVTGIRA SLEQAQNIKR 
NAQGVVDAIS AEDIGKFPDT NLAESLQRIT GVSIDRASGE GSTVTVRGFG PEFNLVVVNG
RQMPTSTLGD GFSAPSTRSF DFGNLAAEGV AGVEVYKSGR ASLPTGGIGS VINIKLPRPL
DRPGLNGSIG VKGVYDTSQL YGTDVTPEIS GIISKTFADD RIGILLAGSY QKRNGGQAQF
TVGNWRPGYT GAENNWGTLA QPGDPRFANI ENRPGPTDIY QVPQNAAYDF YDFQRERING
QAVLQFKPTD SLDVSLDYIF AQNTFDSRQS SIGVWFNHND TSSSWTNGPA AGANFYAENF
AAAEGKDLAI TGAVGSNRNI NHSFGGNIKW EGPWGLRLEL DGHHSTAESK PTSPYGSGIA
VGTAIFGVAS QRVDFTTDMP VISVAMHPGS EIAAENIRPA GNAFRNAYMK DTIDQVSLRG
GFDFDASFIE SLDFGASYLE NDVRTAFGVI QNDTWGGTLS AADTPDSLFT PRALSPDLSG
MSGSRDPAII PTYFLIDTAG LISLLDDRLG ICDAAPGDTC LAPYSTDRRI LEKSITPWVQ
SFHSFDLGDA RANLRLGLRY EKTKVTSRAL AEIPTGTVWV AQNEISLVKT PGTDFTTLKG
EYDNWLPAID FDLSPFENVK LRASYSHTIT RPDYASMQGG ITLAQPLRVG AGGSQASAGN
PALLPYKSKN VDLSAEWYYD RASYVSVGFF NKKVSNFIAN DTTQTPLFDL PDPSQGAAAT
AARQALGPNA SFDQIVAWVQ ANRPADYVAG VGTAGGVAGR AGDPNVIFTF TQPSNSDQQA
RLWGWEFAIQ HNFWDTGFGA ILNYTVVNSD TGFDNTLPWT TTQFAVPGAS DSANAVLYYD
KNGLQARIAY NWRDRFLAGK DQDPYYIKSY GQFDASLSYE FKKGLTAFVE GINITNADRT
GVRRNDRAIF FAAPGYARYS AGIRFTF