Gene Sala_3014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_3014 
Symbol 
ID4082846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3155863 
End bp3157629 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content72% 
IMG OID638011401 
Productsingle-stranded-DNA-specific exonuclease RecJ 
Protein accessionYP_618052 
Protein GI103488491 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID[TIGR00644] single-stranded-DNA-specific exonuclease RecJ 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.858016 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGG CGGCACTGGG AATCACCTGC TCGATCGCGG GCCAGTCGTG GCGCTGGCGG 
CGTGCGAGCG CCGACATGGC GGCGGAAAAT CTCGCGCCCG ACGACCTTGT CACGCAATTG
CTGCTCGCGC GCGGGGTGGC GCGCGACGAT CTCGATCGCC AGCGCACCCC GACCCTGCGC
GGCTTCATGC CCGACCCGTC GCTGTTCCGC GACATGGATG CCGCCGCGGC GCGGCTCGCC
GATGCCGTCG AACGCCGGGA GGCGGTGACG ATCTTCGGCG ATTATGACGT CGACGGGGCG
ACATCGGCCG CGCTGCTCGT CCGCCTGCTG CGGGCGCTCG GCACAACTGT CGGCGCCTAT
ATCCCCGACC GTCTGATGGA AGGCTATGGC CCGTCGGGCG CCGCGCTGGT GAAGATCGGC
GAGGCGGGGT CGAAACTTAT CGTCACCGTC GATTGCGGCG CACAGGCGTT CGAGGCGATC
GCCGAAGCCA ATGCCGCCGG CGTCGAGGTG ATCGTCGTCG ACCATCACCA ATGCGCGACC
AGCCTGCCCG CCGCGCTTGC GCTGGTGAAC CCCAACCGGC TCGACGAGTC GCCCGACGCA
GCGGTCCACG GCAATCTCGC CGCGGTCGGC GTCGCCTTCC TGCTCGGCGC GGCGCTGCTT
CGCACATTGC GCGCGCGCGG CTTTTTTGCG GGCCGCGAGG AACCCGCGCT GATCGAGCTT
CTGGACCTTG TCGCACTCGG CACCGTCGCC GACGTCGCGC GGCTCACCGG CTTCAATCGC
GCGCTGGTGA CGCAGGGGCT GAAAGTGATG GCGCGGCGCG GCAATATCGG CCTCGCCGCG
CTGATGGACG CGGCGCGGCT GACCAGGCCG CCGGGCGCAA GCGACATGGG CTTCGCGCTC
GGTCCGCGCA TCAACGCGGG CGGGCGCGTC GGCAAGTCGG ACCTCGGCGT GCGCCTGCTC
ACGACCGACG ATCCGCAGGA AGCCGCCGAT ATCGCGCAGC AACTGTGCCG CCTCAACGAG
GAGCGCCGCA CGATCGAGGC GGCGGTGCTC GACGAGGCGC TGGCGGCGAG CGCCGCGTGC
GGCAACGCGC CCGTCGCGAT TGTCGCCGGC GAAGGCTGGC ACCCCGGCGT GATCGGCATC
GTCGCCGGGC GGCTCAAGGA ACGGTTGCAC CGCCCCGCGA TCGTGATCGC GGTGGACGCG
GACGGTATCG GCAAGGGATC GGGGCGCTCG ATTTCGGGCG TCGACCTGGG CGCAGCCATT
CTCGCCGCCA AGGAAACGGG CTTGCTCGTC GCCGGCGGCG GCCATGCGAT GGCGGCGGGA
CTCACGGTGG CCGCCGACCG GGTCGATGCG CTTGGCGCCT TTTTGAGCGA CCGCCTCGCC
GCCGATGTCG AGCGCGCGAG CGGCGAGCGT GCGCTGCTGA TCGACGCCGT GCTCGCGCCG
CGCGGGATCT CGCCGCTCTG GTGCGACGCG ATCGAAAGCG CTGGCCCCTA TGGCGCCGGT
TGGCCCGCGC CGCGCGTCGC GACGGGGCCG GTGCGAATCG TCGAATCGGG GATCGTCGGC
ACCGATCATG TCCGCCTGAT CGTCGCGGGC GACGATGGCG CGCGGTTCAA GGCGGTCGCC
TTTCGCAGCG CCGAAACGGT GCTGGGCCAG ACCCTGCTCG GCGCGCGGGG ACGCAAGCTG
TGGCTCGCGG GCCGGGCAAA ACGCGACGAC TGGGGCAGCC GCCCCGCCGC CGAGCTGCAC
CTCGAGGATG CTGCCTGGGC CGACTGA
 
Protein sequence
MSEAALGITC SIAGQSWRWR RASADMAAEN LAPDDLVTQL LLARGVARDD LDRQRTPTLR 
GFMPDPSLFR DMDAAAARLA DAVERREAVT IFGDYDVDGA TSAALLVRLL RALGTTVGAY
IPDRLMEGYG PSGAALVKIG EAGSKLIVTV DCGAQAFEAI AEANAAGVEV IVVDHHQCAT
SLPAALALVN PNRLDESPDA AVHGNLAAVG VAFLLGAALL RTLRARGFFA GREEPALIEL
LDLVALGTVA DVARLTGFNR ALVTQGLKVM ARRGNIGLAA LMDAARLTRP PGASDMGFAL
GPRINAGGRV GKSDLGVRLL TTDDPQEAAD IAQQLCRLNE ERRTIEAAVL DEALAASAAC
GNAPVAIVAG EGWHPGVIGI VAGRLKERLH RPAIVIAVDA DGIGKGSGRS ISGVDLGAAI
LAAKETGLLV AGGGHAMAAG LTVAADRVDA LGAFLSDRLA ADVERASGER ALLIDAVLAP
RGISPLWCDA IESAGPYGAG WPAPRVATGP VRIVESGIVG TDHVRLIVAG DDGARFKAVA
FRSAETVLGQ TLLGARGRKL WLAGRAKRDD WGSRPAAELH LEDAAWAD