Gene Sala_3168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_3168 
Symbol 
ID4082504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3318722 
End bp3321109 
Gene Length2388 bp 
Protein Length795 aa 
Translation table11 
GC content61% 
IMG OID638011553 
ProductTonB-dependent receptor 
Protein accessionYP_618204 
Protein GI103488643 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.43685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.501895 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAATA TGTCGAAAGT GCTGGTGCGG CGCAGCGCGA TGCTGCTCGG CGGCACCATC 
CTGGCCACGT CGGGCACGGC GTTGGCGCAG CAAGCAAATA CAGAGGGCGA CAATGACATC
GTCGTCACCG CGTCGAAGCG CGAGGAAAAT CTGCAGGACG TGCCGCTGGC GATCACCGCG
ATCGGCAACG AACGGCTCAG CGAATTGCAG GTCAAGGAGT TTCAGGACGT CGTCAAGTTC
CTGCCCTCCG TGACAATCCA GACCGCGGCG CCGGGGTTCA GCCAGGTTTA TTTCCGCGGT
GTGGCGTCGG GCGAGAATGC CAACCACTCG GCGTCGCTGC CGACGGTCGG CACCTATCTG
GACGAAATGC CGATCACGAC GATCCAGGGC GCGCTCGACA TCCATGCCTA TGACCTGGCG
CGCGTCGAGG CGCTCGCGGG GCCGCAGGGT ACGCTCTACG GCGCCAGCTC GATGGCGGGC
ACGATCAAGC TCGTCACCAA CCGGCCCGAC CCCAGCGGCA CCTACGGCTC GGTGGGGCTC
GAACTCAACA GCGTCGCGCA CGGCGACGTC GGCGGCGTCG CCGAAGGCTT CGTCAACGCC
CCGCTCGGCG AACGCGCGGC GCTGCGCCTC GTCGGCTGGT ATCGCCACGA TGCGGGCTAT
ATCGACAATA TCGCGGGCAG CCGCACCTAT CCGACCAGCG ACATCACCCA GGACAATGCC
GCGCTGGTCG AAAAGAATTA CAACGACGTC GATACCTATG GCGCGCGGCT TGCACTCGGC
ATCGACCTTG ACGACGATTG GACGATCCGC CCCACGCTGA TGGGGCAGGT GCAGAAAACC
AATGGCAGCT TTGCCCAGGA GCGGTCGACT GCGGTCAGCG ACGACCTTCA GACCGTGCAA
TATAATCCCG AAAACTCGAA GGACCGGTGG ATTCAGGCCG CGCTGACGAT CGAAGGCAAG
CTGGGCAATT GGGATCTGAC CGTCACGGGC GGCCACCTGC GCCGGAAAAC CGAGGTCGAA
AGCGACTATT CGGATTATGC CTATTTCTAT GATGCGCTCT ACGGCTATGG CGCCTATTTC
TATGATAACA ACGGCGACCT TATCAGCCCG AATCAATATA TCCAGGGCAT CGACCGGTAC
AAAAAGAGCT TTGTCGAGGC GCGCGTCGCC TCCCCGGCCG ACGCGCGCAT CCGCTTTATC
GGCGGACTGT TCTGGCAGCG TCAGTCGCAC AATATCGAAC AGAATTACAT TATCGACAAC
CTGACGGACC TGTTCACGGT CACGGGCACT GACAGCAATA TCTGGCTGAC CAAGCAATTG
CGTATCGACC GCGACTATGC GGCGTTCGGC GAGATCAGCT TCGACATCAC CGACAAATTG
ACGCTGACCG GCGGCGGCCG CGTGTACAAG TTCGACAACA GCCTGGTCGG CTTCTTCGGC
TATAATGCCA ATTTCTCCAG CCGAACCGGC GAAGCCGCCT GTTTTGCCGG TCCAATCATT
TCCGGGACGC CATGCACCAA TCTCGACAAG CGGACGAAGG ACAGCGACTT CATCCACAAG
CTCAATCTGA CCTACAAGTT CAGCGATGAC GCGCTTGTCT ATGCCACCTG GTCGCGCGGT
TTCCGCCCCG GCGGCATCAA CCGCCGCGGA ACGCTGCCGC CCTATGGCCC CGACACGCTC
GACAATTATG AGTTCGGGTG GAAAACGAAC TGGGGACCGG TGCGTTTCAA TGGCGCCATT
TATCAGGAAG ACTGGACCGA CATCCAGCTT TCCTTCCTCG GTCTCAACGG CCTCACCGAA
ATCCGCAACG CCGGGGTCGC GCGCATCCGG GGGATCGAAA TCGACCTCGG CTATCGCCGG
AACGGTTTTT CGATCAACGC GGGCATGAGC TATAATGACG CCGAAATCCG CCGCGATTTC
TGCCGCATCG CCAATGACAG CTTCGACTGC ACCCTGCCCG GATCGGACGG CGCGGACAAT
GCGCTCCTCG CGCCAAAGGG CACCAGCCTG CCGGTGACGC CCAAGTTCAA GGGCAATGTC
GTCGCGCGTT ACGAGTTTCC GGTCGGCGGC ATGGACGCGC ATGTGCAGTT TGCTGTGAAC
CATATCGGCA AGCGGCGCAG CGACCTCAGA ACCTTTGAGA ACAGCCTGAA GGGCTTTTTT
GATGCCTATA CCACCGCCGA CCTCAGCGTC GGCGTCAAGG GCGACGACTG GAAAGCGGAA
CTGTTCGCGA CCAACTTGTT CGACGAAAAT GGCGTCATCA ACTCGGGCGT CCAGTGTCTG
GAGACGACAT GCGGCGATCC CGACGGCATC AGCAGCACGG GCGGCGTCTT CTACGACACG
GTCATCCGCC CGCGGCTGAT CGGAATCAAG GTGAGCAAGG ACTTCTGA
 
Protein sequence
MRNMSKVLVR RSAMLLGGTI LATSGTALAQ QANTEGDNDI VVTASKREEN LQDVPLAITA 
IGNERLSELQ VKEFQDVVKF LPSVTIQTAA PGFSQVYFRG VASGENANHS ASLPTVGTYL
DEMPITTIQG ALDIHAYDLA RVEALAGPQG TLYGASSMAG TIKLVTNRPD PSGTYGSVGL
ELNSVAHGDV GGVAEGFVNA PLGERAALRL VGWYRHDAGY IDNIAGSRTY PTSDITQDNA
ALVEKNYNDV DTYGARLALG IDLDDDWTIR PTLMGQVQKT NGSFAQERST AVSDDLQTVQ
YNPENSKDRW IQAALTIEGK LGNWDLTVTG GHLRRKTEVE SDYSDYAYFY DALYGYGAYF
YDNNGDLISP NQYIQGIDRY KKSFVEARVA SPADARIRFI GGLFWQRQSH NIEQNYIIDN
LTDLFTVTGT DSNIWLTKQL RIDRDYAAFG EISFDITDKL TLTGGGRVYK FDNSLVGFFG
YNANFSSRTG EAACFAGPII SGTPCTNLDK RTKDSDFIHK LNLTYKFSDD ALVYATWSRG
FRPGGINRRG TLPPYGPDTL DNYEFGWKTN WGPVRFNGAI YQEDWTDIQL SFLGLNGLTE
IRNAGVARIR GIEIDLGYRR NGFSINAGMS YNDAEIRRDF CRIANDSFDC TLPGSDGADN
ALLAPKGTSL PVTPKFKGNV VARYEFPVGG MDAHVQFAVN HIGKRRSDLR TFENSLKGFF
DAYTTADLSV GVKGDDWKAE LFATNLFDEN GVINSGVQCL ETTCGDPDGI SSTGGVFYDT
VIRPRLIGIK VSKDF