Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_1065 |
Symbol | |
ID | 3773997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | + |
Start bp | 1075365 |
End bp | 1078484 |
Gene Length | 3120 bp |
Protein Length | 1039 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637799489 |
Product | DEAD/DEAH box helicase-like |
Protein accession | YP_400082 |
Protein GI | 81299874 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00124695 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0212089 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGTGC ATTTAGAAAG TGCTTTTGAA GCAGAAATTT GCCAAGACCT CGCCCAGCAA GGCTGGATTT ATGAGGAAGG CGCAGCCGCT GACTACGATC GCGCTTTAGC GCTTTATCCA CCCGATCTGT TTGCTTGGCT CCAACAAACC CAACCGGAAG CTTGGGAAAC TCTGCAACAA AAACAAGGCA CTCAAGCCGA AGCGATTCTG TTGCAACGAG TCCGGCAACA ACTCGATCAA GTCGGCACCC TCGATCTGTT GCGCTATGGC CTCGAAGTCC TCGGACTGCC GCGATCGCTG AAGCTGGCGC AATTCAAACC CGCCTTTGCT ACTAATGCCG AAATTCGCGA TCGCTACGAA GCCAATCGGC TGCGCGTCAT TCGGCAAGTG CGCTATTCCC TCCACAATCA AAACTGTATT GATCTCGTTC TGTTCCTGAA TGGCATTCCC GTCGCCACCG TCGAACTCAA AACCGATAAC ACCCAAAACA TCGCCGACGC CGTTTATCAA TACAAACAAG ATCGCAACCC CAAACCCGCC GGACAATCGC CAGAACCGTT GCTGAGTTTC CCGAGTGGCG CGATCGTGCA TTTCGCCGTC AGTAATCGCG AAGTGCAAAT GACCACCAAG CTGGCGGGCT TTGCTACTGT TTTCCTACCC TTTAACCAAG GCAGTGATCC AGGTGCGCCC GATTGTGGCG CGGGGAATCC TGTCAGCTCT GCTAGTGGTC ACCGCACCGC TTATCTCTGG CAAGAGGTGT GGCAACGCGA TAGCTGGCTG GAAATTCTGG GGCGCTATTG CATCACCGAG CGCAACAAGA AACAGCAAAT CACCCGCCTG CTCTTCCCGC GCTATCACCA ACTGATCGTG ACGCGGCGAC TGCAAGAGGC GGTCCTGCAA GAAGGGGCTG GCCACAAGTA TTTAGTGCAG CATTCCGCCG GTTCTGGCAA AACCAATTCG ATCGCTTGGA CAGCGCATTT CTTTAGTGAA CTCCACGACG CCGATAACCG CAAAGTCTTC GATTCCGTGA TCGTTGTTAG CGATCGCAAC GTCATTGATA CGCAACTGCA AGAAGCGATC GAAAGTTTCG AGCGGCACAA AGGCGTGGTT GCTGCCATCA CCCGCGACGA CGGCAGCAAA AGCAGCAAAC TTGCCGAAGC CCTCAAGGGC GATAAGAAAA TTGTTGTCTG CACGATTCAG ACCTTTCCCT TCGCTCTGCA AGCCGTACGC GAACTGGCCG CCACCGAAGG TAAACGCTTT GCTGTCATTG CCGACGAAGC CCACAGTTCT CAAACCGGCG AAGCCGCCAG CAAACTCAAA CAACTGCTTT CTCCTGCCGA ACTCGCCGAT CTCAACGACG GCGGCGAAGT GGATCTCGAA GCCGTCCTTG CGGCCCAAAT GAGCGATCGT GCCCGCGAAA GCGGCATCAC CTACGTTGCT TTTACCGCCA CACCCAAAGC CAAAACGCTG GAATTATTCG GCAGACGTCC CGATCCCAAC CGACCGGCTG GCCCTGATAA TCTGCCCATG CCCTTCCACG TCTATTCAAT GCGGCAAGCG ATCGAGGAAG GCTTCATCCT CGATGTTCTG CAAAACTACA CCAGCTACAA AGTCGCCTTC CGCCTTGCCC AAGCAGGCCA GCAGTTTAGC GATCAAGAAG TGGAGCGCAA CGCTGCCCTC AAAAAGCTGA TGGGCTGGGT CAAGCTGCAT CCCCACAACA TCGCCCAGAA AGTCGCGATC GTGGTTGAGC ATTTTCAGCA ATACGTCGCG CCACTGCTGA ATGGCAAAGC CAAAGCGATG GTCGTGGTCG GTTCCCGCAA AGAAGCCGTG CGCTGGAAAC TAGCGATCGA TCGCTACATT GCAGCAAAAC AGTACCGTCT CGGCACCCTT GTTGCCTTTA GTGGCGAAGT CCGCGATCGC GATTCTGGCC CCGACCCGTT CACTGAAACC AGCCCCAACC TCAACCCCAA CTTACAAGGC GACATCCGCG AAACCTTCAA AAGCGATCGC TACCAAATCC TGATCGTCGC CAACAAATTC CAAACCGGCT TCGACCAACC CTTGCTCTGT GGCATGTACA TCGATCGGCG TCTTGCGGGC ATCCAAGCCG TGCAAACTCT ATCGCGCCTC AACCGCTGCC ATCCCGGCAA AGACACCACT TATATTGTCG ATTTCAGCAA CGATCCCGCT GAGATTCTGG CCGCCTTCAA GACCTACTAC ACCACCGCTG AACTCGCCAG CGCCACCGAT CCCAACCTGA TCTTGGATCT GCGGCTCAAG CTCGATGCTC AGAAGCATTA CGACGCATAC GAAATCGATC GCGTTGTCAA AGTTGTCCTC AACCCCAACG CCAAACAAAG TGATCTGCAA AAAGCCCTAG AACCGATCGC CGAACGCTTG CTGTATCAAT ACCGCAACGC TCGCCAAGCC GCCCGCACCG CCGAAGCCCA AAACGACCCT GCCGCCCTCA AAACCGCCCA AGACGAACTC GCCGCCCTCA CCCTGTTCCG CAGCGACCTA GGCACCTACG TCCGTCTTTA CACCTTCCTC TCGCAAATCT TCGACTACGG CAACACCGCT TACGAAAAAC GCGCGATCGT CTTCCGTCGC CTGATCCCGT TGCTGGAATT TGAGCGCGAA GTCGGCAGCA TCGACCTCTC CAAAGTTGTC CTGACTCACT ACCAAGTCCG CCACCAAGGC GATCGCCGCC TCGATCTCCA ACAGGGCGAA GCCATCCCCG TCCCCGGCCT CCAACCCGGC AAAGGTGTTG TCCAAGACCA AGAGAAAGTC TGGCTCTCAG CCCTGATCGT CAAACTGAAC GAACTATTCA CAGGCGATCT CACCGACGCG GACAAGGTCA ACTACGTCAG CGTTGTCCTC CGCAGCAAAC TCCTCGAATC CCCCACCCTG CGTCAGCAAG CGATCGCCAA CAGTAAAGAG CAATTTGCGA GCTCCCCCGA CTTTGCCCAA GAACTGCTCG AAGCGATCAT CAGCGCCCTC GACGCTCACC AATCCATGAG TAGCCAAGCC CTCAACTCTA AGCAGGTACA GGACGGCATC AAAAACATCC TGCTCAACCA AACCGGCCTC TACGAAGAAC TGCGATCGCA AGCCGCCTAG
|
Protein sequence | MTVHLESAFE AEICQDLAQQ GWIYEEGAAA DYDRALALYP PDLFAWLQQT QPEAWETLQQ KQGTQAEAIL LQRVRQQLDQ VGTLDLLRYG LEVLGLPRSL KLAQFKPAFA TNAEIRDRYE ANRLRVIRQV RYSLHNQNCI DLVLFLNGIP VATVELKTDN TQNIADAVYQ YKQDRNPKPA GQSPEPLLSF PSGAIVHFAV SNREVQMTTK LAGFATVFLP FNQGSDPGAP DCGAGNPVSS ASGHRTAYLW QEVWQRDSWL EILGRYCITE RNKKQQITRL LFPRYHQLIV TRRLQEAVLQ EGAGHKYLVQ HSAGSGKTNS IAWTAHFFSE LHDADNRKVF DSVIVVSDRN VIDTQLQEAI ESFERHKGVV AAITRDDGSK SSKLAEALKG DKKIVVCTIQ TFPFALQAVR ELAATEGKRF AVIADEAHSS QTGEAASKLK QLLSPAELAD LNDGGEVDLE AVLAAQMSDR ARESGITYVA FTATPKAKTL ELFGRRPDPN RPAGPDNLPM PFHVYSMRQA IEEGFILDVL QNYTSYKVAF RLAQAGQQFS DQEVERNAAL KKLMGWVKLH PHNIAQKVAI VVEHFQQYVA PLLNGKAKAM VVVGSRKEAV RWKLAIDRYI AAKQYRLGTL VAFSGEVRDR DSGPDPFTET SPNLNPNLQG DIRETFKSDR YQILIVANKF QTGFDQPLLC GMYIDRRLAG IQAVQTLSRL NRCHPGKDTT YIVDFSNDPA EILAAFKTYY TTAELASATD PNLILDLRLK LDAQKHYDAY EIDRVVKVVL NPNAKQSDLQ KALEPIAERL LYQYRNARQA ARTAEAQNDP AALKTAQDEL AALTLFRSDL GTYVRLYTFL SQIFDYGNTA YEKRAIVFRR LIPLLEFERE VGSIDLSKVV LTHYQVRHQG DRRLDLQQGE AIPVPGLQPG KGVVQDQEKV WLSALIVKLN ELFTGDLTDA DKVNYVSVVL RSKLLESPTL RQQAIANSKE QFASSPDFAQ ELLEAIISAL DAHQSMSSQA LNSKQVQDGI KNILLNQTGL YEELRSQAA
|
| |