Gene Sala_2744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2744 
Symbol 
ID4080229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2892898 
End bp2896083 
Gene Length3186 bp 
Protein Length1061 aa 
Translation table11 
GC content61% 
IMG OID638011127 
Productputative outer membrane protein with a TonB box 
Protein accessionYP_617782 
Protein GI103488221 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4771] Outer membrane receptor for ferrienterochelin and colicins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.111846 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTTC GTTATTCGAT CGCGGCGAGC CTGATGGCCA TCAGCGCCGC GACTGTCGTT 
GCCGCGCCGG TACAGGCGCA GGAAACCAGT TCTTCCGTTC GCGGCACCGT CGAGTCGGCC
AGCGGCCCGG TTTCGGGCGC CAGCGTCACC ATCACTCATG TTCCGTCGGG TACGGTCGCT
CGTTCAACGA CCGACGCATC GGGGAACTTC AGCGCCAACG GCCTTCGCGT CGGCGGCCCC
TTTACCGTCG AAGTGACCGC CGAAGACTAT GAGTCGGCGC AGGTCACCGA CCTCTTTCTC
CAGGCCGGTC AACCCTATCG CCTGCCAATC GTTCTCGAAG ATGCCGCGAT CGTGGTTACC
GCCTCGTCGG TCGGTGGCGC GCTCGAACAG TCGAACGGCC CGATTACGGC ACTGGGTCGC
GAGGCCATCG AAGGCGTCGC CTCGATCAAC CGCGATATCC GCGACCTTGC GCGCCGCGAT
CCGTTGGTCA CCATGGACCT GACCAACGCA CGCACGATCG AAGTGGCTGG CAACAATGGT
CGTCTCAACC GCTTCTCAGT TGACGGCGTG CAGTTCAGCG ACGACTTCGG TCTCAACAAC
GGTGGCCTGC CGACGTCGCG TGGCCCTGTG CCGTTCGACG CAATCGAACA ATTTTCGGTC
AAGGTTGCGC CCTTCGATAT CGCCGAGGGC GACTTTCAGG GCGGCGCAAT CAATGTGGTG
CTTCGCTCCG GCGGCAACAA GTTCCACGGC GGCGCTTTCT TCACCTATAC CGACGACAGC
CTGACCGGCA ACCGTACGCG CGGCCGCGAT ATCTCGTTGG ACTTCGACAG CAAACAATAT
GGCGCCAATA TCAGCGGGCC GATCATCAAG GACAAGCTGT TCTTCATGTT CGCTTATGAA
AAGACCAAGG AAACCGATCC TTTCGACGAC GGTTTCGGCC CCGGCTTTGC CAATCAGGTT
CCCGGCCTGA CACAGGCGGT GATCGACCAG GTCAGCAGCG TCGCACAATC GCGCTACGGA
TACGACACGC TCGGCCTGTC GCCCAACGCG GTCGAGGAAG ACGAGAAGAT TATCGGCAAG
CTCGACTGGA ATATCAGCGA CACCCAGCGT GCCTCGCTGA CCTATATTCG CAACGTGGGC
ACGCAGCAGT TCCAGCAGAA TACGTTCCTG ACGTCGCCCT TTGCGCTCGG TTTCCAGTCG
AACGGCTATG AGCTGGCCGA AGAAGTCAAT ACGGGCATCT TCGAGTTGAA CTCGACCTGG
TCGGACCAGT TCTCGACCAC CTTCCGCGCC TCGTATCGTG ATTATAACCG CGACCAGACC
CCGTTCGGAG GCCGCGACTT TCCTCAGATG GAAGTCTGCA CCGACGCGAC CTCGGCCGGG
TCGGCCACCA GCTGTAACGG CACGCGTCTG TTTTTCGGTC CCGATGTGAG CCGTCATTCG
AACGACCTGA ACACGGAAAA CCTGTCGTTC GACTTCACCG CGCGCCTCGA CGCCGGCGAT
CATTCGCTGC GCTTCATGGC CGGCTATACC GATGTGAGCG TGTTCAACCT GTTCCTGCAA
CGTTCGCTCG GTGACTTCTA TTTCGACAGC CTCGCCGATT TTGCGGCGGG CAACGCCAAT
CGCCTGCGCT ATGCGAACGC GGTGCCCAGC CTGGACCCGA ACGATGCCGC GGCCAGTTTC
GGCACGCAAA ACTACACGTT CGGCATTCAG GACGACTGGC AGGCTACCCC CGACCTGACG
CTGTCGTTCG GGATACGCTA TGACCTGTTC GGCAACGACG CCAAGCCGCC GCTCAACCCG
AATTTCCTCG CTCGTTACGG TTTTCACAAC CGCGAGACCT TCAACGGCCG CGATGTGATC
CAGCCACGCT TCGGCTTCAA CTGGCAGGCG ACCGACAAAC TGATCGTTCG CGGCGGTGTC
GGCATCTTCT CGGGCGGCAC GCCCGACGTT TTCCTGTCGA ACAGCTTCTC GAACACGGGC
CTCCTGACCA ATCTTGTCGA TATCAACCGC TCAAACTGCG CCGCTTCCGC AACCTGCGAC
GCGCTGAACG GAATCGACGA CGGGACGATC CCGGCAAGCG TCAACGACTT CCTGGCGCGC
AACACCGGGT CGCTTGCGCT GGCCCCGACC GACAATATCG ATCCCAAGCT GAAGATCGCG
CGCAAGTGGA AGGCTTCGCT GCAGGCCGAT TATGAAATCG GTGACGGCTG GTTCGTGGGC
GGGCAATTCC TGTACGACAA GAACATCTAT GGCTATACTT GGACAGATTT GCGCTCGGTG
CCGATCGGCA CGCTGCCCGA TGGCCGCACC CGCTATGGCC CGTTTGGCGG CGTCGCAACG
ACCAACCGCG ACCTGCAACT GACCAACAGC GAACGCGGCC GCGCCTTCTT CGCGACGGCA
CGTTTCTCGA AGGCGTTCGA CTTCGGCCTG ACACTCGATG GCAGCTACAC CTATTCGAAC
GTGAAGGACG AAGGGGCGCT GACCTCGTCG ACCGCCTCGT CGAATTACGG CAACAACGCC
TTCGTCGATC CCAACCGCGC CGCCTATGGC CGCTCGATCT ACGAATACAC CCACCAGTGG
AAGGGCGGCA TCGACTTCAA ACGCGAGTTC TTCGGCGACA ACGAAACGCG GATCAGCCTG
TTCGGCGAAT ATCGTTCGGG CCGTCCGTAC AGCGTCACCA TGCTCGACAA CAGCGGTGGT
CGCGGGGCTG TGTTCGGCAC AGTCGGCAAC CTCGGCAACA TGCTGCTATA CGTTCCGACG
GCCGGAGGCG ACCCGAGGGT TACGTTTGAT TCGGCCGCCA GCGAAGCTGC GTTCAACACG
CTGATTTCGG AACTCGGCCT CGAAAAATAT CGCGGTCGGA TAGTGAAGAA GAACAGCCAG
ACTTCGCCCG ACTTCTTCAA GGTCGATCTG CATGTCAGCC AGGAAATCCC TGCCTTCGTC
GGCGATGCGA AGTTCAAGCT GTTTGCCGAT GTTGAAAATG TGCTGAACCT GATCGACAGC
GATTGGGGTT CGCTTCGTCA GGTTTCCTTC CCGTATAATG CCGCAATCGT CCGCGTGGCC
TGCGCGGCGA CCAGCGGGAC CAACTGCACC CAGTATCAGT ACAGCAACGT GCGGGCACCC
AACCAAGTGC TCACCAGCCG CGTGTCGCTG TACGGCGTCC GCGTCGGCGT GCGGGTCAAC
TTCTAG
 
Protein sequence
MKLRYSIAAS LMAISAATVV AAPVQAQETS SSVRGTVESA SGPVSGASVT ITHVPSGTVA 
RSTTDASGNF SANGLRVGGP FTVEVTAEDY ESAQVTDLFL QAGQPYRLPI VLEDAAIVVT
ASSVGGALEQ SNGPITALGR EAIEGVASIN RDIRDLARRD PLVTMDLTNA RTIEVAGNNG
RLNRFSVDGV QFSDDFGLNN GGLPTSRGPV PFDAIEQFSV KVAPFDIAEG DFQGGAINVV
LRSGGNKFHG GAFFTYTDDS LTGNRTRGRD ISLDFDSKQY GANISGPIIK DKLFFMFAYE
KTKETDPFDD GFGPGFANQV PGLTQAVIDQ VSSVAQSRYG YDTLGLSPNA VEEDEKIIGK
LDWNISDTQR ASLTYIRNVG TQQFQQNTFL TSPFALGFQS NGYELAEEVN TGIFELNSTW
SDQFSTTFRA SYRDYNRDQT PFGGRDFPQM EVCTDATSAG SATSCNGTRL FFGPDVSRHS
NDLNTENLSF DFTARLDAGD HSLRFMAGYT DVSVFNLFLQ RSLGDFYFDS LADFAAGNAN
RLRYANAVPS LDPNDAAASF GTQNYTFGIQ DDWQATPDLT LSFGIRYDLF GNDAKPPLNP
NFLARYGFHN RETFNGRDVI QPRFGFNWQA TDKLIVRGGV GIFSGGTPDV FLSNSFSNTG
LLTNLVDINR SNCAASATCD ALNGIDDGTI PASVNDFLAR NTGSLALAPT DNIDPKLKIA
RKWKASLQAD YEIGDGWFVG GQFLYDKNIY GYTWTDLRSV PIGTLPDGRT RYGPFGGVAT
TNRDLQLTNS ERGRAFFATA RFSKAFDFGL TLDGSYTYSN VKDEGALTSS TASSNYGNNA
FVDPNRAAYG RSIYEYTHQW KGGIDFKREF FGDNETRISL FGEYRSGRPY SVTMLDNSGG
RGAVFGTVGN LGNMLLYVPT AGGDPRVTFD SAASEAAFNT LISELGLEKY RGRIVKKNSQ
TSPDFFKVDL HVSQEIPAFV GDAKFKLFAD VENVLNLIDS DWGSLRQVSF PYNAAIVRVA
CAATSGTNCT QYQYSNVRAP NQVLTSRVSL YGVRVGVRVN F