Gene Sala_0158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0158 
Symbol 
ID4082916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp155860 
End bp158835 
Gene Length2976 bp 
Protein Length991 aa 
Translation table11 
GC content74% 
IMG OID638008517 
Producthelicase 
Protein accessionYP_615215 
Protein GI103485654 
COG category[L] Replication, recombination and repair 
COG ID[COG3893] Inactivated superfamily I helicase 
TIGRFAM ID[TIGR02786] double-strand break repair protein AddB, alphaproteobacterial type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.233932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.795485 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACAC GCCGCCCCAC CCTCTTTTCC ATTCCCGTCC AGCGGGCGTT CGCCGACGCG 
CTCGCCGCCG GACTGATCGA CCGCTATGCG GACGGGGTGC TGGGGCTGAC CGAGGGGATC
GTGCTGCTGC CGAGCAATCG CGCGCGCAGC GCGGTTCAGG CCGCCTTCGT GCGCGCGGGC
GGCGCAGGAC TGCTGATGCC GCGGCTTGCG GTGATCGGCG ACGCCGATCT CGACGAAAAT
GTCGCGCTCG CGCTCGATGC GGTCGACATG GACGCGCCAC TGCCGCCGGC GATCGAGCCG
CTGCGGCGGC GCCTGCTGCT CGCCGAACTG ATCGAGCGAT ATACGCCCGC GGGAGAGCCA
CCGGTGACGG GTGCCGCCGC CTTTCAGCTC GCGGGCGGCC TCGCGCGCGT CATCGACCAG
CTGCATTATG AGGAAATCCC GGCCGCGGCA CTCGTCGATC TCGACCTCGG CGCCTTTGCC
GACCATTGGC GCGCGTCGCT GGCGCGGCTG CGCCTGCTCG TCGACCATTG GCCGGATGTG
CTCGCGGCGA CGGGCCACAT CGACCGCGCG GCGCGCCGCA ACGCGCTGCT CGACCGCGTG
GCCGCCGCCT GGCGGACGGC GGCGCCCGCG CGCTTCGTCG TCGCAGCGGG CATTACCACC
GCGGCGCCCG CGATCGCGCG ACTGCTGCGC ACCGTCGCCG ATATGGAACG CGGCATGGTC
GTGCTTCCCG GCCTCGACAC CGCGATGGCG ACGGCGGAAT GGGAGGCGCT GGGCCCGACC
GGACCTGACC CCGACCGTTC GGCGCGCCCG CTGGAAACAC ATCCCCAATA TCATCTGAAA
CTGTTGCTCG ATCGCATGGG CATCGCGCGC GAAGAGGTTG CCGAATGGGA GGCGCGCTCG
CCCTTCGACG GCCCGGAGGC GCGCTCGCGC TTCGTGTCGT TGCTCTTCGC GCCCGCCGAC
TATACCGCGC GGTGGCAGGC GGCGGGCGAT CTTGAGGCCG CGACCGCCGG GATCGCGGGC
GCGACCTTTG CCGACGATGC GCAGGAGGCC CAGGGGATCG CGCTGCTGAT GCGCCATGCG
GTCGAGACGC CGGGGCGCAC GGCAGCGCTG GTCACGCCCG ACCGCGCGCT CGCCGAGCGC
ATCGCCGCCG CGCTTGCGCG CTGGGGGATC GCCGTCGACG ACAGCGCGGG CCAGCCGCTG
GCGCGCACGC CGCCGGGCGC GCTTGCAATG CTGCTCGCGG AACTGGCGGC CGATTTCAAC
CCGGTCGCGC TGATCGCGCT GCTGGCGCAC CCGCTCGTCC GCAGGGGCGA GCCGCGCACC
GCCTGGCTCG ATGCGGTGCG CCAGCTCGAC CTGCTGCTGC GCGAACCGGG GCTTGCGCCC
GGCTGGGCGG GCGTGTCGGC GCGGATCGCC GCGCTCGCTG CCGATCGCGA CGCGCGCGGC
CATGCGCTCG CGGTCGAACT GGCGCCATGG TGGCACCAGT TGGGGGAGGC GTTCGTCCCC
CTGCTCGCGC CCTTCGCCGG GCCGCCGGTG GCGCCCGACC GCCTGCTCGC CGCGCTTCGG
CAGGGGCTCG AATGGCTGAC CGGCGAGGCG GCGTGGACAG GACCGGCGGG ACGGATGCTC
GCCGAGCTGT TCGACCGCTG GACACTCGCG CGCGGCGCGG GCCCGGCGCG CGTCGCCCCC
GCCGATTTCC CGGCGATGCT CGGCCAGTTG CTCGGCGAAG CCAGCGTCCG GCCGCCCTAT
GGCGGACACC CGCGGCTGTT CATCTGGGGG CTGCTAGAGG CACGGTTGCA ACGCGCCGAC
CTGATGATCC TCGGCGGCCT CGACGAAGGG CGCTGGCCCG CGGCGACGCA GCCCGACCCT
TGGCTCGCGC CCGGAATCCG CCGCCTGATC GGCCTGCCCG CCACCGAACG GCAGCAGGGG
ATGGCCGCGC ACGACTTCGC CGGCGCGCTG GGCGCGGGCG AGATCGTCGT AACGCGCGCC
CTGCGCAGCG GCGGCGACCC GGCGGTGGCC TCGCGCTTCT GGCTGCGGCT GGCGGCGCTG
GCGGGTGATC TTCCCCAGCC CGCGTTGGAC GGCGTACCGC TGACCGATCT CGCCGCGCGC
ATCGACTTAC CGGCCGACGA CATCGAGCCC GCCGGCAAAC CCTGCCCCGC GCCGCCCGCC
GACCGGCGCC CGCGCCGGAT CAGCGTCACC GCGGTCGACC GGCTCGCGCG CGATCCCTTT
GCCTATTATG CGAACCAGAT TCTCGGCCTG TCGCCGCTCG CGCCGCTGAG CGCCGCGCCC
GATCCGCGCT GGCGCGGGAC GCGGGTGCAC GCGCTGTTCG AACGGTGGGT GCGCGCGGGA
GCGAACGCGG CGGCGTTCGA GGCCGAACTG GCGGCGCTGC GCGACGACCC GGCGCTCGAC
GCGATCGCGC GCGCTTTCTG GCTGCCGCGC ATCGAACCTG CGCTGCGCTG GGCGGCGCGA
CAGCTGATCG AGGCCGAAGG GCGCACCGCG CTGAGGGCCG AGGCGTGGGG TGAGATCGCG
CTCGACGGCA TCACCCTTAC CGGCAAGGCC GACCGCGTCG ACCGGCTGCG CGACGGACGC
CTCGCGATCG TCGATTACAA GACGGGCGGG GCGCCGAACG CGAAGGCGGC GTTCGACAAG
CTCGACAATC AGCTCGGCCT GCTCGGCCTG ATTGCCCGGC GGGGCGGGCT GGCGGGCGTC
GACGCGGCCG AAATCGCTGC CCTGGAGTAT TGGAGTCTGC GCCCCGACCG CAAAGCGGGC
GGCGCAGGCA AGATTTCGTC GACCTATGGC CCGCGCAGCG ACCTGAAAAG CGCCGCGGAA
GCGGTCGATC ACGCCGCCGA CGCGCTCGCC GGCCTGGCCG CGCGCTATCT GTTCGGCGAC
GCGCCCTTCG CGCCGGGCGA CAGCGCGACC TATGGCGATT ACGACCAGCT GATGCGCCGC
GACGAATGGT TCGGGCGCGG CGAGGAGGGC GCATGA
 
Protein sequence
MATRRPTLFS IPVQRAFADA LAAGLIDRYA DGVLGLTEGI VLLPSNRARS AVQAAFVRAG 
GAGLLMPRLA VIGDADLDEN VALALDAVDM DAPLPPAIEP LRRRLLLAEL IERYTPAGEP
PVTGAAAFQL AGGLARVIDQ LHYEEIPAAA LVDLDLGAFA DHWRASLARL RLLVDHWPDV
LAATGHIDRA ARRNALLDRV AAAWRTAAPA RFVVAAGITT AAPAIARLLR TVADMERGMV
VLPGLDTAMA TAEWEALGPT GPDPDRSARP LETHPQYHLK LLLDRMGIAR EEVAEWEARS
PFDGPEARSR FVSLLFAPAD YTARWQAAGD LEAATAGIAG ATFADDAQEA QGIALLMRHA
VETPGRTAAL VTPDRALAER IAAALARWGI AVDDSAGQPL ARTPPGALAM LLAELAADFN
PVALIALLAH PLVRRGEPRT AWLDAVRQLD LLLREPGLAP GWAGVSARIA ALAADRDARG
HALAVELAPW WHQLGEAFVP LLAPFAGPPV APDRLLAALR QGLEWLTGEA AWTGPAGRML
AELFDRWTLA RGAGPARVAP ADFPAMLGQL LGEASVRPPY GGHPRLFIWG LLEARLQRAD
LMILGGLDEG RWPAATQPDP WLAPGIRRLI GLPATERQQG MAAHDFAGAL GAGEIVVTRA
LRSGGDPAVA SRFWLRLAAL AGDLPQPALD GVPLTDLAAR IDLPADDIEP AGKPCPAPPA
DRRPRRISVT AVDRLARDPF AYYANQILGL SPLAPLSAAP DPRWRGTRVH ALFERWVRAG
ANAAAFEAEL AALRDDPALD AIARAFWLPR IEPALRWAAR QLIEAEGRTA LRAEAWGEIA
LDGITLTGKA DRVDRLRDGR LAIVDYKTGG APNAKAAFDK LDNQLGLLGL IARRGGLAGV
DAAEIAALEY WSLRPDRKAG GAGKISSTYG PRSDLKSAAE AVDHAADALA GLAARYLFGD
APFAPGDSAT YGDYDQLMRR DEWFGRGEEG A