Gene Sala_1109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1109 
Symbol 
ID4082047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1141695 
End bp1144910 
Gene Length3216 bp 
Protein Length1071 aa 
Translation table11 
GC content66% 
IMG OID638009471 
Productautotransporter beta-domain-containing protein 
Protein accessionYP_616159 
Protein GI103486598 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAGA CCCTGCTGGC CTCGACCTGC CTGGCCACCC TTCTCTCGAC CGCTGTTCAT 
GCCGAAACGA CGATCAGCAC TGCCACGACC GCGCCTGTGC GGACGTCGAC AATCAAGTCG
GGCGCGCCCG ACGACATCAA GATCACCTCC GCCGGGTCGA TCAAGCCGAC CGTCGCAGGA
CCCGCAGTCA CGATCGACAG CAATCACAAG GTCGTCAACG ACGGCACGAT CGAGTTCAGC
AATATCGATG GCGCAACAGG CATTCTTGCC GGCGCAGGGA CCACGGGCGG CATCACCAAC
AGCGCGTCCG GCAAGATCAC GATCGGCGAA ACCTACGCCG CGACCGACAT CGACAATGAC
GGCGACCTCG ACGGCCCGTT CGCGATCGGC ACAGGCCGCG TCGGCATCGC GACCGCTGGC
GCGTTTTCGG GCAACATCGT CAATTCGGGT GCGATCGCGA TCGAAGGCAA TGATTCGGCG
GGCATAAGGC TCGGCGGCCC GCTGACGGGC AATTTCACCA ACGATGGCAC GATCAGCGTG
CTCGGCGACA GGGCGCTCGG CGTCGGGCTT CAGGACGTCA CGGGCAATAT CCGTCTTGCC
GGAACGATTG CCGCCACCGG TGTCGATGCG ATCGCCGCGC GGCTCGGCGG CGACATCACG
GGCGCGCTCG TCGTGCAGGG CAATCTCACG GCAACGGGCT ATCGCTTCAC CGCGCCCCCG
GCCGACCCCT CGAAGCTCGA TGCCGACGAC CTGCTGATCG GCGGTCCCGC GCTGGCGGTC
GAAGGCGATG TGACGGGCGG CATCATTTTC GCGGTCCCTC CGAAGGATTC GAGCACAACC
GACAAGGATG AGGACGATGA CGGCATCGAC GACGACAAGG AAGGGTCGGC GGTCGTCCGC
TCCTTTGGAT CGGCGGCGGC CATCCGGATC GGTTCGGCCG ACCGCGATGT CGCGATCGGC
GCGCTCGCGG GCACCGGAAC CGGCCTCGGC GTCATCATCG ACGGCTCGGT GCTCGGCAAC
GGGCTTTACG CCGGCAAGGA TGGCAATGCG ATCCGGATCG GCGGGCTTGG CGGCGCCGTG
ACGATCGCTG GCGGCATCGG CATCGGCGCC ACCGGCAGCG TTGGGGCGCA GTCCAAAGAT
GCGGCGGCGA CGGCAATCCG CTTCGGCAGC GGTGCCTCGA CCCCCGAACT GCGTAACGCG
GGCAAGGTCG AGGCGTCAAC CGGCGGCAAT GCGTCGGGGG CAGTAGCGAC CGCGGTGATC
GTGGAAGCGG GCGCCGACGT CGCGCTGATC CGCAACAGCG GCACGATCGC GGCCAAGACC
GGCGGCGACA ATGGCACGGC GCGCGCGATC GTCGACCTGT CGGGCAATGT CGATATGGTC
GAGAATAGCG GCGCGATCAG CGCGAGCGGC GCGCTCGTTT CGTCGGATCG CAACATCGCG
ATCGACCTGT CGGCCAACAA CAATGGGGCA ACGATCAAGC AGACCGCCGT GGCCGCGGGT
ATCACCGCGC CGAGCATCGT CGGCGACATC CGTTTCGGCG GCGGCAACGA CGTGTTCGAC
ATCGCGGATG GTTCGGTGAA GGGCAACAGC ATCTTCGGCG CCGGCAACAA CAGGCTCGCG
CTGTCGGGCG ACGCGACCTA TGCCGGCAAC GCCCGCTTCG GCGCCGGCAA CGACACGATG
GCGCTGGCCG GAACCTCCAA GTTCACCGGG CTCGCCGACT TCGGCGGCGG CGCCGATGCG
CTAACGATCG GCGGCACTTC GGTCTTTACA GGCACTCTTG CCAATTCGTC GGGACTGGCC
GTGTCGGTCA ATGGTGGCAC CTTCGACGTG CGGGGCGCCG CGACGATCGC ATCGCTTGCC
GTAACCGACA AGGGCGTCCT TGGCGTCATG CTCGACACAG GCAGCACGGG CACCGCGCTG
CAAGTGACGG GGAATGCCAG CTTCGGCGCG GAATCGAAGC TCGCGCTTCA ATTGTCGAGC
ATCGAAGAGG CCGAGGGCGA GCACGTCGTG CTCACCGCCG GATCGATCAC TGGCGCCAAC
AATCTGACCG CCTCGCAGAC ACTTCTTCCC TTTCTTTACA AGGGCACACT CACCTCGACT
GCCACCCGGC TGATCGTCGA TGTCGCGCGC AAGAGCACGA GCGAGCTCGG TCTCAACCGC
TCGGAAGCCG GCGCTTTCGA CGCTGTGCTC GATGCCGTCG TCGCCGAACA GAAGATCGAG
GACGTGTTCC TCGGCATCAC CGATGGCGAT CAGTTCCGCA GCCAGCTTCA GCAGATGCTG
CCGGAACACG AAGGCGGAGT CTTTGAAACC GTCACCTCGG GCTCGCGCGC GCTCGCACGC
CACCTCCTCG ACCCCAATGC GCCGTATCAG GACGAAGGCA AATGGGGCTA TTGGGTGAAC
CAGGCCGTCT GGGGCACGTC GAAGGGAATC GGCAACACCG CGAGCTATGA CGTCAGCGGC
TGGGGCATCT CGCTCGGCGC GGAGATCGAG AGCGACGTCG GCAATTTCGG CGGCTCGATC
GCCTTCCTCA GCGGCAAGGA CAGCAACGGC AGCAACGCCA ATGAAGTCAG CACGAGCCAG
TTCGAAGGTG CGCTGCACTG GCGCCTGCGC TCGGACGGTT TCATGGCCAA CGCCCGCGTG
TCGGGCGCGC CGGTAAAGCT GAAGGGCACA CGCATCTTCC GCGCCGAAGC GGGCGCCGAG
GACATTGAAG AGACGATGAA GGGCAAATGG GACGCGACCC TGTGGTCGGC GTCGGGCTCG
GTTGCCTATG ACACGCGCCT CGGCGGGCTG ACGCTGCGTC CGATGGTCGC GGTGGATTAT
TACAAGCTCC AGGAAGACGG TTATCAGGAA ACGGGCGGCG GCGATGCGCT CGACCTCACC
GTGCTCGATC GCGACAGCGA CGAACTCGCG GTAACGGGCA CGGTAACGCT CGGCCTCGAG
TTCGGCGGCG CCGACGAATA TGACGGCTGG ACGCGCTTCG AACTCGAAGG CGGGCGCCGG
CAGATCGTCA GCGGCACGCT GGGCGCGACG ACGGCTTCGT TCAAGGACGG CACGCCCTTC
ACCCTGATCC CCGATGATCG CACGAGCGGC TGGGTCGGCC GCCTCCGCGG CATCGCCGGC
AATTCGGCCT TTCAGGTCGC CGGCGAAGTG TCCGCGGAAG AACAGCAAAG CCATGTCGGC
TGGGCGTTCC GTGCGAGCCT GCGGGTCGGT CTCTAG
 
Protein sequence
MRKTLLASTC LATLLSTAVH AETTISTATT APVRTSTIKS GAPDDIKITS AGSIKPTVAG 
PAVTIDSNHK VVNDGTIEFS NIDGATGILA GAGTTGGITN SASGKITIGE TYAATDIDND
GDLDGPFAIG TGRVGIATAG AFSGNIVNSG AIAIEGNDSA GIRLGGPLTG NFTNDGTISV
LGDRALGVGL QDVTGNIRLA GTIAATGVDA IAARLGGDIT GALVVQGNLT ATGYRFTAPP
ADPSKLDADD LLIGGPALAV EGDVTGGIIF AVPPKDSSTT DKDEDDDGID DDKEGSAVVR
SFGSAAAIRI GSADRDVAIG ALAGTGTGLG VIIDGSVLGN GLYAGKDGNA IRIGGLGGAV
TIAGGIGIGA TGSVGAQSKD AAATAIRFGS GASTPELRNA GKVEASTGGN ASGAVATAVI
VEAGADVALI RNSGTIAAKT GGDNGTARAI VDLSGNVDMV ENSGAISASG ALVSSDRNIA
IDLSANNNGA TIKQTAVAAG ITAPSIVGDI RFGGGNDVFD IADGSVKGNS IFGAGNNRLA
LSGDATYAGN ARFGAGNDTM ALAGTSKFTG LADFGGGADA LTIGGTSVFT GTLANSSGLA
VSVNGGTFDV RGAATIASLA VTDKGVLGVM LDTGSTGTAL QVTGNASFGA ESKLALQLSS
IEEAEGEHVV LTAGSITGAN NLTASQTLLP FLYKGTLTST ATRLIVDVAR KSTSELGLNR
SEAGAFDAVL DAVVAEQKIE DVFLGITDGD QFRSQLQQML PEHEGGVFET VTSGSRALAR
HLLDPNAPYQ DEGKWGYWVN QAVWGTSKGI GNTASYDVSG WGISLGAEIE SDVGNFGGSI
AFLSGKDSNG SNANEVSTSQ FEGALHWRLR SDGFMANARV SGAPVKLKGT RIFRAEAGAE
DIEETMKGKW DATLWSASGS VAYDTRLGGL TLRPMVAVDY YKLQEDGYQE TGGGDALDLT
VLDRDSDELA VTGTVTLGLE FGGADEYDGW TRFELEGGRR QIVSGTLGAT TASFKDGTPF
TLIPDDRTSG WVGRLRGIAG NSAFQVAGEV SAEEQQSHVG WAFRASLRVG L