Gene Sare_3088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3088 
Symbol 
ID5706823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3506212 
End bp3509082 
Gene Length2871 bp 
Protein Length956 aa 
Translation table11 
GC content70% 
IMG OID641272524 
Producthelicase domain-containing protein 
Protein accessionYP_001537892 
Protein GI159038639 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.01898 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGATG TCCGTTCTGA GTCGGCACTC GACGTGGCGC CAGGGTCGGT GGTAGTCATC 
CGCGACGAGG AATGGCTGGT GCAGACCGCG GAAGAAACCG CGGACGGGCT CCTCGTGCAC
GTCCGCGGAC TGGGCGAACT CGTTCGTGAC GTGACCGCCA GCTTCTGCGC CTCCCTCGAC
CACATCGCAC CGCTCGATCC AGCAGCCGCG CGCGTCGTGG CGGACGACAG CCCGCGCTAC
CGCAAGGCCC GCCTCTGGCT GGAATCCACC TACCGAAAGA CCGCCGTTCC GCTCGCCGAT
CGCTCGCTCA CCGTTGCCAC ACAGGCCCTC GCGACACCGC TGAGATACCA ACAGACCGCC
GTGCGCAAGG CGCTCGACCC GGACAACCTG CGTCCCCGGA TCCTCCTGGC CGACGCGGTC
GGGCTCGGCA AGACCCTGGA AATCGGCATG ATCCTGTCGG AGCTGGTACG CCGGGGCCGA
GGCGAGCGGA TCCTCATCGT GACCCCGCGC CACGTACTCG AACAGACGCA GCAGGAATTG
TGGTCGCGCT TCGCGCTGCC GTTCGTCCGC CTCGACTCGG CCGGCATCCA ACGGGTCCGC
CAGAAGCTCC CGGCGAACCG TAACCCGTTC ACGTATTTCC GGCGGGCGAT CATCTCGATC
GACACCCTCA AGTCGGAGCG GTACGCGGCC AGCCTGAAGA AGCAGCGTTG GGACGCGGTG
GTCATCGACG AGTCGCACAA CCTCGCCAAC GCGGCCACGC AGAACAACCG GCTGGCCCGC
ACGCTTGCCC GCAATACCGA AACGATGATC CTCGCGTCGG CGACCCCGCA CAACGGCCGC
AAGGAGTCCT TCGCCGAGTT GGTCCGGCTG CTCGACCCGT CCGCGATCCC ACCCGGCGGC
GACCTGGTGG CCGCCGAGAT CGAACGGCTG GTGATCCGCC GCCACCGGCA CAGCGACGAG
GTGGCCAGCG CCGTCGGCGC CGACTGGGCG GAACGACAGG AGCCGGAGAA CGTGCTCGTC
GCCGCGAGCC GCGCGGAGAA CGCGGTCGCG CGGGAGATTG AGGACGTCTG GTTGTGGCCG
CCGGACCAAC GCACCCCCTA CTCGGGGCAG AACGCGTCGC TGTTCCCGTG GACGCTGGCC
AAGGCATTCC TGTCGTCGCC GGCCGCGCTG GCGGAAACCA TCCGGCAGCG GCTGGCTCGC
CTCGGGCACG ACGCCGCCGC GAGCAGAGAA CACGCGGCGC TGAATCGGCT GGCCGACCTC
AACGCGGCAG CGCTGGCGGA GGGCTCGGCC AAGTACGCCG AGCTGGTCCG CCGCCTCCAG
GAGATCGGCG TCGCGTCAGG CTCGCCGACG CGGACGGTGG TGTTCGCCGA ACGAGTAGCC
ACCCTCACCT GGCTGCACAA GAACCTGCCG GCCGACCTCG GCCTGCCCGC CGGAGCCGTC
GCCATCATGC ACGGTGGCCT GCCCGACGAC GAGCAGCAGC AGATCGTCGA CGCCTTCAAG
CAGGAACACG CCGCGATCCG CGTCCTGGTC ACCGGCGACG TCGCCTCCGA AGGAGTCAAC
CTGCACGCCC AGTGCCATCA CCTGATCCAC TACGACATCC CATGGAGTCT CATCCGGATC
GAGCAGCGCA ACGGCCGCAT CGACCGCTAC CGCCAGCGAC AACCACCGCG GATCATCGCC
CTGCTGCTGG ACCCGGACAG TGGCCGGTTC GCCGGCGACA TCCGCATCCT GGCGAAACTC
GTCGACAAGG AGAACGAGGC GCACCGCGCG CTCGGCGACG TCGCCTCGCT GATGGGCCAG
TACGACGTCA AGGGCGAGGA GGACACCATC CGTGCCGTCC TGGCCGGGCA GCGCGACCTG
GACGACGTGG TGCGTATGCC GGAAGAGGTG CTGGCCGGTG ATGACCTGTC GGCGATGTTC
TTCGGCATCG TCGATGCTGA CGACGACGCG GAAACGGCAC CGCTGCCCGA GACGACCGTG
ACCCCGGGCT CCGGGCTCTA CCCCGACGAG GTCTCCTACC TCGAAGAGGC GTTGCACGCC
GCCTTCGTCG ACCCGACCGG CCCGCCGAGC AACAACGGCG TTGCCTGGCG CCGCGACGCC
GCCTTCGGCA CCGCCGAGCT GACACCGCCA CCGGACCTGG TGCAACGCCT TGAGGTCCTG
CCGCAGAGCT ATCTCGCCGA CCGGCGGGTC ACCGAAAGGC TCGTCCTGGC CACCACCCTC
GCCCGGGGCA GGAACCGTAT GGAGCAGGCG CTGTGCGATG AGACCGGTTC CAGTTGGCCG
GACGCGCACT ATCTGGGTCC ACTGCATCCG GTGCTGGACT GGGCCTCGGA CCGCGCTCTG
GCCGGGCTGG GCCGCAATCA GGTGTTCGCG GTCCGCGGTG GTGTCTACGA CCCGACCGTG
CTGCTGCTGG GCACGCTCAC CAACCGGCGC GGGCAGGTCG TCTCGGCGGC GTGGATGACC
GTCGAGTTCC CGATGCCCGA CACCCTCTCG TTCGCCCCGG TGACCCCGCA CCCGTCGGCC
GCCGCGGCAC TGGCCGCGCT GGGTTTCGAC ACCGCGCGGG CCAACCCCGG TCCGGTGCCG
GATGCCGAGG CCCTGGACCG GTTCATCGCC CCGGCGGTGC GGCACGCCCG TGCGCAGATG
CGCGACCTGT TCGCTGCGGC CGAGAAAGAC GTCGCCCACC GGGTCGAGCA GTGGTCGCAC
CGGTTGGCCC AGTGGCAGGA CGAGGCAGAC GCGCTCATCC AGCGCCGCGA TCTCAAGCAG
CGCCGGGTCA GCATCGCGCA GGAGCAGCGG CTGGTGGCCG ACATGAACCC GGATCGGCAA
CTGGTCCGCC CACTGCTGGT CGTCGTGCCG GAGATCGGAG CCACCGCATG A
 
Protein sequence
MNDVRSESAL DVAPGSVVVI RDEEWLVQTA EETADGLLVH VRGLGELVRD VTASFCASLD 
HIAPLDPAAA RVVADDSPRY RKARLWLEST YRKTAVPLAD RSLTVATQAL ATPLRYQQTA
VRKALDPDNL RPRILLADAV GLGKTLEIGM ILSELVRRGR GERILIVTPR HVLEQTQQEL
WSRFALPFVR LDSAGIQRVR QKLPANRNPF TYFRRAIISI DTLKSERYAA SLKKQRWDAV
VIDESHNLAN AATQNNRLAR TLARNTETMI LASATPHNGR KESFAELVRL LDPSAIPPGG
DLVAAEIERL VIRRHRHSDE VASAVGADWA ERQEPENVLV AASRAENAVA REIEDVWLWP
PDQRTPYSGQ NASLFPWTLA KAFLSSPAAL AETIRQRLAR LGHDAAASRE HAALNRLADL
NAAALAEGSA KYAELVRRLQ EIGVASGSPT RTVVFAERVA TLTWLHKNLP ADLGLPAGAV
AIMHGGLPDD EQQQIVDAFK QEHAAIRVLV TGDVASEGVN LHAQCHHLIH YDIPWSLIRI
EQRNGRIDRY RQRQPPRIIA LLLDPDSGRF AGDIRILAKL VDKENEAHRA LGDVASLMGQ
YDVKGEEDTI RAVLAGQRDL DDVVRMPEEV LAGDDLSAMF FGIVDADDDA ETAPLPETTV
TPGSGLYPDE VSYLEEALHA AFVDPTGPPS NNGVAWRRDA AFGTAELTPP PDLVQRLEVL
PQSYLADRRV TERLVLATTL ARGRNRMEQA LCDETGSSWP DAHYLGPLHP VLDWASDRAL
AGLGRNQVFA VRGGVYDPTV LLLGTLTNRR GQVVSAAWMT VEFPMPDTLS FAPVTPHPSA
AAALAALGFD TARANPGPVP DAEALDRFIA PAVRHARAQM RDLFAAAEKD VAHRVEQWSH
RLAQWQDEAD ALIQRRDLKQ RRVSIAQEQR LVADMNPDRQ LVRPLLVVVP EIGATA