Gene Sala_2642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2642 
Symbol 
ID4081772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2784862 
End bp2787996 
Gene Length3135 bp 
Protein Length1044 aa 
Translation table11 
GC content64% 
IMG OID638011018 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_617680 
Protein GI103488119 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGTCG AATTGCCCGG GCAAAAGGGC GTCGGGAAGC TGCAATCTGC GGCCGAGGGC 
CGCTGCGTCG TCTCGATCTT CCACTCGATC CAGCGCAGCG AAGATATCGA GTGCGAGGTG
AGCGCGGTCG TCCGCGCCTA TTTGAGCCCG CAGACCCGGG CCTATGTTCT CGACGACGGG
CGTATGCGGG TCGGTCGCAT CAGCGACTAT CTTCAGCAGG AAAATGGCCT TGTCACTTAT
GAGGTCCGCT TCCCCAACGG GAAGCAAAAA GACTTCAGCG AGATCGACCT ATTCGTCCGG
CCGTGGAATG CGCCCGAGGA TCCGGCCGAA ATGCTCGCGG CCGGCGCCGC CGAAAGCCAG
TATCTGCACG ACCGCCGGCA AGCCGCGCTG ACGCCGCTGC GATCGCTGAC CAGTGCGGCT
CAAGGTCTGA CGTCGCTGCT CTCGGCCGGG ATCGACTTCG TGCCGCACCA AGTCGCAGCG
GTGCGCCGCG TGCTCAGTGA TCCGATCCAG CGCTATCTCC TCGCCGACGA GGTCGGGCTG
GGCAAGACAA TCGAGGCCGG GTTGATCATC CGCCAGCATC TGATCGACAA TCCCGACACC
GAGGTACTGA TCGCGACCCC GTCGGCGCTG TGCGAGCAGT GGCGGCGCGA ACTTTCCGAA
AAGCTCCGCC TCGACCAGTT CGGCGAGCCC TTCGAATGTT GCTCGCATGC CGAAATCGAG
CGGGTCGCGC GAACACCCGA TGTTCTGGTG GTCGACGAAG CGCATCATCT GGTTGGACTC
GAGGACGGCC CGCTTTCAGC ATCGGCGGCT CGGTTGCGCG AGCTCGCCCG AGACGTGCCG
GTGTTGTTGC TGCTGTCCGC AACGCCGCCG CTTGGCGAGG AGGCTCGGTT CCTGGCACTG
CTCAATTTGC TCGATCCGCT GACCCATCCG CTCGACGATC TTGACGGTTT TCGCCTGAAG
CTCGAACAGC GCCGGGCGAT CGGCCGCTTG CTCCTTAGTC TCGACCCGGA CGCATCGGGC
ATTGTGCTGC GCCAGCGCGG CGCTGAACTA GTCCGTAGCT TTCCCGACGA TCCTGTAGTC
CAAGAGTTGG CGCCGCAGTT GATTGAGGCA ACGCGAGAGG CACCCGAACG ACTCGTCGAC
CTATGCGGCG CGCTCAAGGA GCATATCGCC GACAGCTATC GCATCCACCA GCGTCTGATC
CGCTCGCGTC GCGCCGATGC CCAGGGATGG GAGTTTCGAC CACGCGGACC TGATGGCGGC
GCCATGACCC ATGTGCGCAT CGAAGGCGAT CCGAGCGACA ACCTGGCGGT GCTGCTTGGC
ACGCTTGAAG ACTGGCGGTC GGCTGCGGTC GACTCGCTAG TCGATGGCGA TCCGGGGGTG
CCGCTTCTCG CCGCGCGCTA CCGCAATCTT CTCGGCGCGG TGTCCGAAGG CGCTGACGCT
CTTCGCGCCT GGCTGACTGG TGTCATGCCG ATCTTCGCCG GCGAGGCGGA AATCCTCGAC
GCGCTGCGCG AGCAAGCCCA GGACTACGAC GATGCCGACC GCATCGAAAC GATGGTCGAA
AGCGTGCGGA GGCTAATCAA GACGCTGCGG GCCGACGTCG ATCAGCCCAA GATCGTTGTG
TTCGCGACGG CTGAGGCGCT GGCAACCGCG TTCCATCGAG CGCTGGAGGA GGCGCTCGGC
GACACACCGT GCTTTCTACT CGTCTCGGGA GGTGACGCGG GCAATGACAA GGAGGACGGC
GTCGCGGCGT TCGCGCAAGC GCCCGATGCT GCGGTTCTCA TCGGCGACCG GACGGCCGAA
GAAGGTCTCA ATCTCAGTTT CTCCGATGCG ATCGTGCATC TCGACCTGCC GCTGAGCGCT
GCCCGGATCG AGCAGCGCAT CGGCCGCCTC GACCGCTATG GCCGGCGACA GGGGATCATC
CGTCACCGGA TACTCTTGCC TTCGGACGAA GACGAAAGCC CCTTTGCAGC CTGGCAGGCG
TTGCTCGCCG ACGGCCTGTC GATTTTCCAC CGCTCGATCA GCGATGTGCA ATTCCTGCTG
GATGATTTCG AGACCCGCGC GCTCGACACC CTGCTGATGA CCGGGCCCAA TGCTCTAATC
GCGCTGGCCG AGGAGATCAA AGGCCAGATC GCCGACGAAC GAAAGTCGCA GGACGAACAA
TATGCCCTCG ACCGGATCGC GCTTGCAGAG GAACCGGTCG AAACATTCAT CCAGGCTCTG
GATGACGCCG AAGCTGATGA GGCGGCGCTT GAGGCTGGCG TCGATCAATG GCTGATCGAC
ACACTGCAAC TCAAGAAGCG GCCCTACGCC TGGCCCGAGG AAGATCCGTT CAAGCTCGGT
ATAACCAAGC AGACCTTGAT CCCGCGCTTG CCGTGGCAAC AGCAATTGGA ACTCGACGAC
AGCCAGCCGC TGAGCTGGAA GCGCCGGATC GCGACACGCC GGACCGCCGT GACCTTGCTG
CGCCCGGGTA CGCCGCACAT TGACGTGCTG GCGCGCTTCA CGCGCTGGGA CGATCGCGGA
ACCGCGTTCG TCACTTATCG GCCGGTCGCG GACTGGCTTG GCGACGCCTG GATCGGATTC
AAGCTCTGCT TCACGATCGA GCCCAGTCTC GACATCGCCG ACCTGCTCGC GCCGTCGCGC
GCCGAACTTG CCACGTTACG CCGCGCGCAG CGCTATTTTG CGCAAAGCGA GCAGACCCTG
TTCATCGACA TCAATGGTGA AATGGTCATC GATCCTGCGC TGCTCGCAAT CCTGTCCAAG
CCGTATAACG GGCATGGCAA AGGCCTGAGC GCCGACATCA ATCTCGGAAG CCGGCCGCAT
ATCCTCGCCG ATTATATTGA CCCCGCTGTG TTTCCGGCAG TGTGCCGTCG CGTGCGGAAC
GGCGCGCGGC AAACCTTGTC TGAGCAGCCC GCCGTGCTCG ATGCGATCAC CGCGGCGACG
ACACTGGCGG CAAGCGATCT ACAGCGGCGC CGAAACCGGC TCCAGCGCCG CCAATCGGCC
GGCGACTCGA TGGCACGCGA GGATATCGCT CTGATCGAGT CAATCCTGCC CAGCATCGCA
TCACCCGCCA TCCGGCTCGA CGCCATGGGA TGTTTCATCC TGGGTGGGGC GCTCACGAGG
TCCGCTCATG GTTGA
 
Protein sequence
MLVELPGQKG VGKLQSAAEG RCVVSIFHSI QRSEDIECEV SAVVRAYLSP QTRAYVLDDG 
RMRVGRISDY LQQENGLVTY EVRFPNGKQK DFSEIDLFVR PWNAPEDPAE MLAAGAAESQ
YLHDRRQAAL TPLRSLTSAA QGLTSLLSAG IDFVPHQVAA VRRVLSDPIQ RYLLADEVGL
GKTIEAGLII RQHLIDNPDT EVLIATPSAL CEQWRRELSE KLRLDQFGEP FECCSHAEIE
RVARTPDVLV VDEAHHLVGL EDGPLSASAA RLRELARDVP VLLLLSATPP LGEEARFLAL
LNLLDPLTHP LDDLDGFRLK LEQRRAIGRL LLSLDPDASG IVLRQRGAEL VRSFPDDPVV
QELAPQLIEA TREAPERLVD LCGALKEHIA DSYRIHQRLI RSRRADAQGW EFRPRGPDGG
AMTHVRIEGD PSDNLAVLLG TLEDWRSAAV DSLVDGDPGV PLLAARYRNL LGAVSEGADA
LRAWLTGVMP IFAGEAEILD ALREQAQDYD DADRIETMVE SVRRLIKTLR ADVDQPKIVV
FATAEALATA FHRALEEALG DTPCFLLVSG GDAGNDKEDG VAAFAQAPDA AVLIGDRTAE
EGLNLSFSDA IVHLDLPLSA ARIEQRIGRL DRYGRRQGII RHRILLPSDE DESPFAAWQA
LLADGLSIFH RSISDVQFLL DDFETRALDT LLMTGPNALI ALAEEIKGQI ADERKSQDEQ
YALDRIALAE EPVETFIQAL DDAEADEAAL EAGVDQWLID TLQLKKRPYA WPEEDPFKLG
ITKQTLIPRL PWQQQLELDD SQPLSWKRRI ATRRTAVTLL RPGTPHIDVL ARFTRWDDRG
TAFVTYRPVA DWLGDAWIGF KLCFTIEPSL DIADLLAPSR AELATLRRAQ RYFAQSEQTL
FIDINGEMVI DPALLAILSK PYNGHGKGLS ADINLGSRPH ILADYIDPAV FPAVCRRVRN
GARQTLSEQP AVLDAITAAT TLAASDLQRR RNRLQRRQSA GDSMAREDIA LIESILPSIA
SPAIRLDAMG CFILGGALTR SAHG