Gene Synpcc7942_1157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1157 
Symbol 
ID3775107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1186081 
End bp1189266 
Gene Length3186 bp 
Protein Length1061 aa 
Translation table11 
GC content56% 
IMG OID637799583 
Producthypothetical protein 
Protein accessionYP_400174 
Protein GI81299966 
COG category[L] Replication, recombination and repair 
COG ID[COG1074] ATP-dependent exoDNAse (exonuclease V) beta subunit (contains helicase and exonuclease domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTGA CTGAGGAACA ACGCCAAGCT GTTCAAGCAG CAGGTTCGGT AGCGGTTACG 
GCGGGTGCTG GCACCGGCAA AACAGAATTG CTCAGTCAGC GCTACCTCTG GCATCTACAG
CAAGCGTCCG AGTCGGTTTC GCCGCTGGAA ATTGTCGTGC TGACCTTTAC CGAGAAGGCA
GCCGCTGAAT TGCGATCGCG GATCCGTAAA GCCGTTGGGA ACAAATGGCC CGATCGCTCT
GATTGGTTGG CTGAAGTTGA AGCTGCTCAA ATCAGTACCT TTCATAGCCT CTGTGCCCGG
ATTTGTCGCG AACATCCCGC CGCAGCGGGA CAACCGGCGG ATTTCACGAT CCTCGATGAT
CTTGCAGGCA AGCTCTGGCA GCAGACCGTG CTGACGGCAG CGATGCAGGA ACTGGATCCA
AGCTGCTTCG ATGTCATGGA TTACAGCGAG TGGCGATCGC TGCTCGAAAC GCTGCTGGAT
GATCCGGTTC GCAGTCAAGC GCTACTCGCA GTCGAATCTG AACAATGGCG TGAGGTACTC
GCTCAAGCTC AGCGCGATCG CCTGCAACAA ATCCAGCAGT CCTCTGATTG GCAAGCTGCG
ATCGAGGATC TCAGCGGAGC AAGTTGGCGT GAAGCGGATG CTCTCAGCGA CCAAATTCAG
CAAATTCTCG CTTGGTCAGA TACGCTCTTC GATCCGGCCA ACTCAAATTG GCAAGCAACC
TATGACGAAT TCATTAAGCT GAAACTGACT CAGGCTGGCT CAGCGAAAAA CTGGGGCAGT
GCCGATACGG CAAAGCTGTT GCGCACCCAG TGCAAAGTTC TCCGCGATCG CCTCAGGGAT
GAGGAATCAT TGCTGAGTCT GCGTCCTAAC GAGGCAGATG ATTGGAATCG AGACCAGCGA
TCGCGGATCT ACGAAGCGTT TGAAACCGTT CTTAAGAGCA TTGATGCTCG CAAGCGACAG
GATCGCTGCC TCGACTTCAA TGACCTAGAG CGTGGTGCGG TGCGAGCACT CGAAAGCGAA
GCCGTGCGAT CGCACTATCA ACAACGCTGG CGCTATTGCT TTGTCGATGA ATTCCAAGAC
ACCAACCCCA CGCAAAGCCA GATTCTGCAA GCCCTCTGGG ATCCACAACA CCTGATCCTG
ACTTTGGTTG GTGATGAGAA GCAGTCGATC TATGGCTTCC GTGGCGCGGC CACTCAAGTG
TTTCGCAATT GGCAACAGCA AATCCAGCAG CACCAAGGGC ACATCGTCAC GCTGAGCCAG
AGCTTCCGCA CCCACCAAAC GCTGCTCGAA ACCATCAATC AAGTCTTTGA GCCGGTGCTT
GATCCCTATC AACCTCTCCG CAGCGATCGC CAGCCGCCCC ATCCGCTGCC ACCCATTCAG
CAACTGGTGA TTGAGCCGGA AGAGAAAGAT TCTCTCGAGC AAGCCCGCAT CCAAGAAGCC
ACCGCGATCG CTCAGCAGAT CCAGACTTGG ATCCAGCAAC CACTACTGGT TTGGGACAAG
CCGAGCAACC AACATCGCCC GATCGCCTAC GGCGACATTG CCATTCTCTG CCGGCGGCGC
GCACCCTTGG AAACGGTCTA CGGCGAAGTT CTCAATCAGG CTGGGATTCC CGTGCTGGTC
AATGGGGGCG GTAGCCTCCT CGAAACCCCT GTCGGTTATG ACTTGCAAGC GCTGCTGGAG
TTTTTGGTCT ATCCCAGCAA CGACCTCGCC CTAGCGACCC TGTTGCGCAG TCCGGCCTTT
GGCCTCAGCG ATGCCCAGCT CTATCAGCGA GCCCAGCAAG GCAAAGGCTG GTGGAGTCAT
CGTCAGGAAC GGCCAGATCC AGCGTTTGCG GCCGCCATCA AAATTTTGGA AGGCCTGCTG
CGATCGCGCT TTCTCGAATC ACCTCTGCGG CTGGTGCAGC AGTTCGATCG CGCCACGGGC
TACAGTGCTG TGCTGGCTAG CTTACCCCAA GCTCAACGAC TGCTGGCCGA TTGGCAAGCC
CTGCTGACCT TTCTGCGTGA ACAACCTCAA GCGCATGATC TGGAGTTGTT GCTGCGCTAT
TGGAAGCAAT TACAACAGGC CGAAGTCGTC CTGCCGCGTC CCGTCCTAGA AGCGGGTAAT
GCCGTTACCC TGATGACCTT ACACGGATCA AAGGGTTTGG AATGGCCGGT CGTGATCATT
CCCGATCTCA CCGCTAAACC GAGATCGCAA GCTGAAACAG TTCTGTTTGA TACTGAGCTC
GGAGTCGCTC TGCGGCAACC CTACGTCAAA GAACAGGCCG CTGCCTATCA GTTTTTCAAG
TACCGCAAAC AGCAAGCCGA AGACGCCGAA ACCAGGCGGT TACTCTACGT TGGCTTTACG
CGGGCGCGTG ACTTGCTCCT GCTCACCAGT CCCAAATCTG CAGAGGCGCG ATCGCCTCTG
GATCTACTCG CCCCCGGACT AGAACAAGCT GCGATCCAAC CGCTTGATCC CAACTCAGGA
ACGGAGATTG CGGTTACGGC GACGTTACCC ACTGCGGATG CACCCCTGTT CTGGCAGTCC
ACAGCCGCGA TCGCTCCCCA GTACAGTGAG CTCTCTGTTT CAGCATTTGC GGACTATTTC
CGTTGTCCAG CTCTTTTTCA CTTCCGCTAT ACCCAGGGGC ATCCCGGCGC GATCGATGGG
GAAGGCGCAG GGACGAGCGA TCGGCCGCTA CAACTAGGGC TCTTAGTCCA TCGTGCCCTT
GAACTCGATT TACGTCAGCC AGAAGCACTC AAGCCTTACT GTGCTGCTGA AGATCAAGAC
CTGATTCCGC AGGCACTGCA GTTAGCTAGC TGCTTTGCGA CGGAACCCGT TTACAGCGCT
GTTCGTGAGC AGGCTCAACA GCGTGAAGTG ACTCTGCACT TGGAACTCAG TAGCGGCCTA
GTGCTCAAGG GGCGGGCCGA TCTAGTCGGA GCGGATTGGG TCCTGGATTT CAAAACCGAT
CAGCAACCTC AACCAGAGAA TTATCAGTTG CAGCTTTGGG CTTATGCTGC GGCTCTCAAT
CGCCCACAAG CCGCGATCGC TTGGCTCCGC CACAATCAAC TGGATTGGAT TGAAATTGAA
TTCATTCCTG ATCAAGCTGA ACGAGCTGCC GTCCAGTTAG CCCAAGGTGA CTTTGATCCT
CAGCCAGGAC TCTGCTGTCA ATATTGCAGC TATCGCAGTA TTTGCGAGGC AACTAGTAAA
AACTAA
 
Protein sequence
MKLTEEQRQA VQAAGSVAVT AGAGTGKTEL LSQRYLWHLQ QASESVSPLE IVVLTFTEKA 
AAELRSRIRK AVGNKWPDRS DWLAEVEAAQ ISTFHSLCAR ICREHPAAAG QPADFTILDD
LAGKLWQQTV LTAAMQELDP SCFDVMDYSE WRSLLETLLD DPVRSQALLA VESEQWREVL
AQAQRDRLQQ IQQSSDWQAA IEDLSGASWR EADALSDQIQ QILAWSDTLF DPANSNWQAT
YDEFIKLKLT QAGSAKNWGS ADTAKLLRTQ CKVLRDRLRD EESLLSLRPN EADDWNRDQR
SRIYEAFETV LKSIDARKRQ DRCLDFNDLE RGAVRALESE AVRSHYQQRW RYCFVDEFQD
TNPTQSQILQ ALWDPQHLIL TLVGDEKQSI YGFRGAATQV FRNWQQQIQQ HQGHIVTLSQ
SFRTHQTLLE TINQVFEPVL DPYQPLRSDR QPPHPLPPIQ QLVIEPEEKD SLEQARIQEA
TAIAQQIQTW IQQPLLVWDK PSNQHRPIAY GDIAILCRRR APLETVYGEV LNQAGIPVLV
NGGGSLLETP VGYDLQALLE FLVYPSNDLA LATLLRSPAF GLSDAQLYQR AQQGKGWWSH
RQERPDPAFA AAIKILEGLL RSRFLESPLR LVQQFDRATG YSAVLASLPQ AQRLLADWQA
LLTFLREQPQ AHDLELLLRY WKQLQQAEVV LPRPVLEAGN AVTLMTLHGS KGLEWPVVII
PDLTAKPRSQ AETVLFDTEL GVALRQPYVK EQAAAYQFFK YRKQQAEDAE TRRLLYVGFT
RARDLLLLTS PKSAEARSPL DLLAPGLEQA AIQPLDPNSG TEIAVTATLP TADAPLFWQS
TAAIAPQYSE LSVSAFADYF RCPALFHFRY TQGHPGAIDG EGAGTSDRPL QLGLLVHRAL
ELDLRQPEAL KPYCAAEDQD LIPQALQLAS CFATEPVYSA VREQAQQREV TLHLELSSGL
VLKGRADLVG ADWVLDFKTD QQPQPENYQL QLWAYAAALN RPQAAIAWLR HNQLDWIEIE
FIPDQAERAA VQLAQGDFDP QPGLCCQYCS YRSICEATSK N