Gene Synpcc7942_1065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1065 
Symbol 
ID3773997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1075365 
End bp1078484 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table11 
GC content56% 
IMG OID637799489 
ProductDEAD/DEAH box helicase-like 
Protein accessionYP_400082 
Protein GI81299874 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00124695 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0212089 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGTGC ATTTAGAAAG TGCTTTTGAA GCAGAAATTT GCCAAGACCT CGCCCAGCAA 
GGCTGGATTT ATGAGGAAGG CGCAGCCGCT GACTACGATC GCGCTTTAGC GCTTTATCCA
CCCGATCTGT TTGCTTGGCT CCAACAAACC CAACCGGAAG CTTGGGAAAC TCTGCAACAA
AAACAAGGCA CTCAAGCCGA AGCGATTCTG TTGCAACGAG TCCGGCAACA ACTCGATCAA
GTCGGCACCC TCGATCTGTT GCGCTATGGC CTCGAAGTCC TCGGACTGCC GCGATCGCTG
AAGCTGGCGC AATTCAAACC CGCCTTTGCT ACTAATGCCG AAATTCGCGA TCGCTACGAA
GCCAATCGGC TGCGCGTCAT TCGGCAAGTG CGCTATTCCC TCCACAATCA AAACTGTATT
GATCTCGTTC TGTTCCTGAA TGGCATTCCC GTCGCCACCG TCGAACTCAA AACCGATAAC
ACCCAAAACA TCGCCGACGC CGTTTATCAA TACAAACAAG ATCGCAACCC CAAACCCGCC
GGACAATCGC CAGAACCGTT GCTGAGTTTC CCGAGTGGCG CGATCGTGCA TTTCGCCGTC
AGTAATCGCG AAGTGCAAAT GACCACCAAG CTGGCGGGCT TTGCTACTGT TTTCCTACCC
TTTAACCAAG GCAGTGATCC AGGTGCGCCC GATTGTGGCG CGGGGAATCC TGTCAGCTCT
GCTAGTGGTC ACCGCACCGC TTATCTCTGG CAAGAGGTGT GGCAACGCGA TAGCTGGCTG
GAAATTCTGG GGCGCTATTG CATCACCGAG CGCAACAAGA AACAGCAAAT CACCCGCCTG
CTCTTCCCGC GCTATCACCA ACTGATCGTG ACGCGGCGAC TGCAAGAGGC GGTCCTGCAA
GAAGGGGCTG GCCACAAGTA TTTAGTGCAG CATTCCGCCG GTTCTGGCAA AACCAATTCG
ATCGCTTGGA CAGCGCATTT CTTTAGTGAA CTCCACGACG CCGATAACCG CAAAGTCTTC
GATTCCGTGA TCGTTGTTAG CGATCGCAAC GTCATTGATA CGCAACTGCA AGAAGCGATC
GAAAGTTTCG AGCGGCACAA AGGCGTGGTT GCTGCCATCA CCCGCGACGA CGGCAGCAAA
AGCAGCAAAC TTGCCGAAGC CCTCAAGGGC GATAAGAAAA TTGTTGTCTG CACGATTCAG
ACCTTTCCCT TCGCTCTGCA AGCCGTACGC GAACTGGCCG CCACCGAAGG TAAACGCTTT
GCTGTCATTG CCGACGAAGC CCACAGTTCT CAAACCGGCG AAGCCGCCAG CAAACTCAAA
CAACTGCTTT CTCCTGCCGA ACTCGCCGAT CTCAACGACG GCGGCGAAGT GGATCTCGAA
GCCGTCCTTG CGGCCCAAAT GAGCGATCGT GCCCGCGAAA GCGGCATCAC CTACGTTGCT
TTTACCGCCA CACCCAAAGC CAAAACGCTG GAATTATTCG GCAGACGTCC CGATCCCAAC
CGACCGGCTG GCCCTGATAA TCTGCCCATG CCCTTCCACG TCTATTCAAT GCGGCAAGCG
ATCGAGGAAG GCTTCATCCT CGATGTTCTG CAAAACTACA CCAGCTACAA AGTCGCCTTC
CGCCTTGCCC AAGCAGGCCA GCAGTTTAGC GATCAAGAAG TGGAGCGCAA CGCTGCCCTC
AAAAAGCTGA TGGGCTGGGT CAAGCTGCAT CCCCACAACA TCGCCCAGAA AGTCGCGATC
GTGGTTGAGC ATTTTCAGCA ATACGTCGCG CCACTGCTGA ATGGCAAAGC CAAAGCGATG
GTCGTGGTCG GTTCCCGCAA AGAAGCCGTG CGCTGGAAAC TAGCGATCGA TCGCTACATT
GCAGCAAAAC AGTACCGTCT CGGCACCCTT GTTGCCTTTA GTGGCGAAGT CCGCGATCGC
GATTCTGGCC CCGACCCGTT CACTGAAACC AGCCCCAACC TCAACCCCAA CTTACAAGGC
GACATCCGCG AAACCTTCAA AAGCGATCGC TACCAAATCC TGATCGTCGC CAACAAATTC
CAAACCGGCT TCGACCAACC CTTGCTCTGT GGCATGTACA TCGATCGGCG TCTTGCGGGC
ATCCAAGCCG TGCAAACTCT ATCGCGCCTC AACCGCTGCC ATCCCGGCAA AGACACCACT
TATATTGTCG ATTTCAGCAA CGATCCCGCT GAGATTCTGG CCGCCTTCAA GACCTACTAC
ACCACCGCTG AACTCGCCAG CGCCACCGAT CCCAACCTGA TCTTGGATCT GCGGCTCAAG
CTCGATGCTC AGAAGCATTA CGACGCATAC GAAATCGATC GCGTTGTCAA AGTTGTCCTC
AACCCCAACG CCAAACAAAG TGATCTGCAA AAAGCCCTAG AACCGATCGC CGAACGCTTG
CTGTATCAAT ACCGCAACGC TCGCCAAGCC GCCCGCACCG CCGAAGCCCA AAACGACCCT
GCCGCCCTCA AAACCGCCCA AGACGAACTC GCCGCCCTCA CCCTGTTCCG CAGCGACCTA
GGCACCTACG TCCGTCTTTA CACCTTCCTC TCGCAAATCT TCGACTACGG CAACACCGCT
TACGAAAAAC GCGCGATCGT CTTCCGTCGC CTGATCCCGT TGCTGGAATT TGAGCGCGAA
GTCGGCAGCA TCGACCTCTC CAAAGTTGTC CTGACTCACT ACCAAGTCCG CCACCAAGGC
GATCGCCGCC TCGATCTCCA ACAGGGCGAA GCCATCCCCG TCCCCGGCCT CCAACCCGGC
AAAGGTGTTG TCCAAGACCA AGAGAAAGTC TGGCTCTCAG CCCTGATCGT CAAACTGAAC
GAACTATTCA CAGGCGATCT CACCGACGCG GACAAGGTCA ACTACGTCAG CGTTGTCCTC
CGCAGCAAAC TCCTCGAATC CCCCACCCTG CGTCAGCAAG CGATCGCCAA CAGTAAAGAG
CAATTTGCGA GCTCCCCCGA CTTTGCCCAA GAACTGCTCG AAGCGATCAT CAGCGCCCTC
GACGCTCACC AATCCATGAG TAGCCAAGCC CTCAACTCTA AGCAGGTACA GGACGGCATC
AAAAACATCC TGCTCAACCA AACCGGCCTC TACGAAGAAC TGCGATCGCA AGCCGCCTAG
 
Protein sequence
MTVHLESAFE AEICQDLAQQ GWIYEEGAAA DYDRALALYP PDLFAWLQQT QPEAWETLQQ 
KQGTQAEAIL LQRVRQQLDQ VGTLDLLRYG LEVLGLPRSL KLAQFKPAFA TNAEIRDRYE
ANRLRVIRQV RYSLHNQNCI DLVLFLNGIP VATVELKTDN TQNIADAVYQ YKQDRNPKPA
GQSPEPLLSF PSGAIVHFAV SNREVQMTTK LAGFATVFLP FNQGSDPGAP DCGAGNPVSS
ASGHRTAYLW QEVWQRDSWL EILGRYCITE RNKKQQITRL LFPRYHQLIV TRRLQEAVLQ
EGAGHKYLVQ HSAGSGKTNS IAWTAHFFSE LHDADNRKVF DSVIVVSDRN VIDTQLQEAI
ESFERHKGVV AAITRDDGSK SSKLAEALKG DKKIVVCTIQ TFPFALQAVR ELAATEGKRF
AVIADEAHSS QTGEAASKLK QLLSPAELAD LNDGGEVDLE AVLAAQMSDR ARESGITYVA
FTATPKAKTL ELFGRRPDPN RPAGPDNLPM PFHVYSMRQA IEEGFILDVL QNYTSYKVAF
RLAQAGQQFS DQEVERNAAL KKLMGWVKLH PHNIAQKVAI VVEHFQQYVA PLLNGKAKAM
VVVGSRKEAV RWKLAIDRYI AAKQYRLGTL VAFSGEVRDR DSGPDPFTET SPNLNPNLQG
DIRETFKSDR YQILIVANKF QTGFDQPLLC GMYIDRRLAG IQAVQTLSRL NRCHPGKDTT
YIVDFSNDPA EILAAFKTYY TTAELASATD PNLILDLRLK LDAQKHYDAY EIDRVVKVVL
NPNAKQSDLQ KALEPIAERL LYQYRNARQA ARTAEAQNDP AALKTAQDEL AALTLFRSDL
GTYVRLYTFL SQIFDYGNTA YEKRAIVFRR LIPLLEFERE VGSIDLSKVV LTHYQVRHQG
DRRLDLQQGE AIPVPGLQPG KGVVQDQEKV WLSALIVKLN ELFTGDLTDA DKVNYVSVVL
RSKLLESPTL RQQAIANSKE QFASSPDFAQ ELLEAIISAL DAHQSMSSQA LNSKQVQDGI
KNILLNQTGL YEELRSQAA