Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3088 |
Symbol | |
ID | 5706823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 3506212 |
End bp | 3509082 |
Gene Length | 2871 bp |
Protein Length | 956 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641272524 |
Product | helicase domain-containing protein |
Protein accession | YP_001537892 |
Protein GI | 159038639 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.01898 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGATG TCCGTTCTGA GTCGGCACTC GACGTGGCGC CAGGGTCGGT GGTAGTCATC CGCGACGAGG AATGGCTGGT GCAGACCGCG GAAGAAACCG CGGACGGGCT CCTCGTGCAC GTCCGCGGAC TGGGCGAACT CGTTCGTGAC GTGACCGCCA GCTTCTGCGC CTCCCTCGAC CACATCGCAC CGCTCGATCC AGCAGCCGCG CGCGTCGTGG CGGACGACAG CCCGCGCTAC CGCAAGGCCC GCCTCTGGCT GGAATCCACC TACCGAAAGA CCGCCGTTCC GCTCGCCGAT CGCTCGCTCA CCGTTGCCAC ACAGGCCCTC GCGACACCGC TGAGATACCA ACAGACCGCC GTGCGCAAGG CGCTCGACCC GGACAACCTG CGTCCCCGGA TCCTCCTGGC CGACGCGGTC GGGCTCGGCA AGACCCTGGA AATCGGCATG ATCCTGTCGG AGCTGGTACG CCGGGGCCGA GGCGAGCGGA TCCTCATCGT GACCCCGCGC CACGTACTCG AACAGACGCA GCAGGAATTG TGGTCGCGCT TCGCGCTGCC GTTCGTCCGC CTCGACTCGG CCGGCATCCA ACGGGTCCGC CAGAAGCTCC CGGCGAACCG TAACCCGTTC ACGTATTTCC GGCGGGCGAT CATCTCGATC GACACCCTCA AGTCGGAGCG GTACGCGGCC AGCCTGAAGA AGCAGCGTTG GGACGCGGTG GTCATCGACG AGTCGCACAA CCTCGCCAAC GCGGCCACGC AGAACAACCG GCTGGCCCGC ACGCTTGCCC GCAATACCGA AACGATGATC CTCGCGTCGG CGACCCCGCA CAACGGCCGC AAGGAGTCCT TCGCCGAGTT GGTCCGGCTG CTCGACCCGT CCGCGATCCC ACCCGGCGGC GACCTGGTGG CCGCCGAGAT CGAACGGCTG GTGATCCGCC GCCACCGGCA CAGCGACGAG GTGGCCAGCG CCGTCGGCGC CGACTGGGCG GAACGACAGG AGCCGGAGAA CGTGCTCGTC GCCGCGAGCC GCGCGGAGAA CGCGGTCGCG CGGGAGATTG AGGACGTCTG GTTGTGGCCG CCGGACCAAC GCACCCCCTA CTCGGGGCAG AACGCGTCGC TGTTCCCGTG GACGCTGGCC AAGGCATTCC TGTCGTCGCC GGCCGCGCTG GCGGAAACCA TCCGGCAGCG GCTGGCTCGC CTCGGGCACG ACGCCGCCGC GAGCAGAGAA CACGCGGCGC TGAATCGGCT GGCCGACCTC AACGCGGCAG CGCTGGCGGA GGGCTCGGCC AAGTACGCCG AGCTGGTCCG CCGCCTCCAG GAGATCGGCG TCGCGTCAGG CTCGCCGACG CGGACGGTGG TGTTCGCCGA ACGAGTAGCC ACCCTCACCT GGCTGCACAA GAACCTGCCG GCCGACCTCG GCCTGCCCGC CGGAGCCGTC GCCATCATGC ACGGTGGCCT GCCCGACGAC GAGCAGCAGC AGATCGTCGA CGCCTTCAAG CAGGAACACG CCGCGATCCG CGTCCTGGTC ACCGGCGACG TCGCCTCCGA AGGAGTCAAC CTGCACGCCC AGTGCCATCA CCTGATCCAC TACGACATCC CATGGAGTCT CATCCGGATC GAGCAGCGCA ACGGCCGCAT CGACCGCTAC CGCCAGCGAC AACCACCGCG GATCATCGCC CTGCTGCTGG ACCCGGACAG TGGCCGGTTC GCCGGCGACA TCCGCATCCT GGCGAAACTC GTCGACAAGG AGAACGAGGC GCACCGCGCG CTCGGCGACG TCGCCTCGCT GATGGGCCAG TACGACGTCA AGGGCGAGGA GGACACCATC CGTGCCGTCC TGGCCGGGCA GCGCGACCTG GACGACGTGG TGCGTATGCC GGAAGAGGTG CTGGCCGGTG ATGACCTGTC GGCGATGTTC TTCGGCATCG TCGATGCTGA CGACGACGCG GAAACGGCAC CGCTGCCCGA GACGACCGTG ACCCCGGGCT CCGGGCTCTA CCCCGACGAG GTCTCCTACC TCGAAGAGGC GTTGCACGCC GCCTTCGTCG ACCCGACCGG CCCGCCGAGC AACAACGGCG TTGCCTGGCG CCGCGACGCC GCCTTCGGCA CCGCCGAGCT GACACCGCCA CCGGACCTGG TGCAACGCCT TGAGGTCCTG CCGCAGAGCT ATCTCGCCGA CCGGCGGGTC ACCGAAAGGC TCGTCCTGGC CACCACCCTC GCCCGGGGCA GGAACCGTAT GGAGCAGGCG CTGTGCGATG AGACCGGTTC CAGTTGGCCG GACGCGCACT ATCTGGGTCC ACTGCATCCG GTGCTGGACT GGGCCTCGGA CCGCGCTCTG GCCGGGCTGG GCCGCAATCA GGTGTTCGCG GTCCGCGGTG GTGTCTACGA CCCGACCGTG CTGCTGCTGG GCACGCTCAC CAACCGGCGC GGGCAGGTCG TCTCGGCGGC GTGGATGACC GTCGAGTTCC CGATGCCCGA CACCCTCTCG TTCGCCCCGG TGACCCCGCA CCCGTCGGCC GCCGCGGCAC TGGCCGCGCT GGGTTTCGAC ACCGCGCGGG CCAACCCCGG TCCGGTGCCG GATGCCGAGG CCCTGGACCG GTTCATCGCC CCGGCGGTGC GGCACGCCCG TGCGCAGATG CGCGACCTGT TCGCTGCGGC CGAGAAAGAC GTCGCCCACC GGGTCGAGCA GTGGTCGCAC CGGTTGGCCC AGTGGCAGGA CGAGGCAGAC GCGCTCATCC AGCGCCGCGA TCTCAAGCAG CGCCGGGTCA GCATCGCGCA GGAGCAGCGG CTGGTGGCCG ACATGAACCC GGATCGGCAA CTGGTCCGCC CACTGCTGGT CGTCGTGCCG GAGATCGGAG CCACCGCATG A
|
Protein sequence | MNDVRSESAL DVAPGSVVVI RDEEWLVQTA EETADGLLVH VRGLGELVRD VTASFCASLD HIAPLDPAAA RVVADDSPRY RKARLWLEST YRKTAVPLAD RSLTVATQAL ATPLRYQQTA VRKALDPDNL RPRILLADAV GLGKTLEIGM ILSELVRRGR GERILIVTPR HVLEQTQQEL WSRFALPFVR LDSAGIQRVR QKLPANRNPF TYFRRAIISI DTLKSERYAA SLKKQRWDAV VIDESHNLAN AATQNNRLAR TLARNTETMI LASATPHNGR KESFAELVRL LDPSAIPPGG DLVAAEIERL VIRRHRHSDE VASAVGADWA ERQEPENVLV AASRAENAVA REIEDVWLWP PDQRTPYSGQ NASLFPWTLA KAFLSSPAAL AETIRQRLAR LGHDAAASRE HAALNRLADL NAAALAEGSA KYAELVRRLQ EIGVASGSPT RTVVFAERVA TLTWLHKNLP ADLGLPAGAV AIMHGGLPDD EQQQIVDAFK QEHAAIRVLV TGDVASEGVN LHAQCHHLIH YDIPWSLIRI EQRNGRIDRY RQRQPPRIIA LLLDPDSGRF AGDIRILAKL VDKENEAHRA LGDVASLMGQ YDVKGEEDTI RAVLAGQRDL DDVVRMPEEV LAGDDLSAMF FGIVDADDDA ETAPLPETTV TPGSGLYPDE VSYLEEALHA AFVDPTGPPS NNGVAWRRDA AFGTAELTPP PDLVQRLEVL PQSYLADRRV TERLVLATTL ARGRNRMEQA LCDETGSSWP DAHYLGPLHP VLDWASDRAL AGLGRNQVFA VRGGVYDPTV LLLGTLTNRR GQVVSAAWMT VEFPMPDTLS FAPVTPHPSA AAALAALGFD TARANPGPVP DAEALDRFIA PAVRHARAQM RDLFAAAEKD VAHRVEQWSH RLAQWQDEAD ALIQRRDLKQ RRVSIAQEQR LVADMNPDRQ LVRPLLVVVP EIGATA
|
| |