Gene Synpcc7942_1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1000 
Symbol 
ID3773927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1008139 
End bp1011369 
Gene Length3231 bp 
Protein Length1076 aa 
Translation table11 
GC content57% 
IMG OID637799419 
Producthypothetical protein 
Protein accessionYP_400017 
Protein GI81299809 
COG category[R] General function prediction only 
COG ID[COG2374] Predicted extracellular nuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACAT CCGTTTTTAT TAACGAGTTT CACTACGACA ATGCCGGCAC GGATGCCGGT 
GAATTTATTG AAATCTTGGC GCCTCTGGCC ACTAGCCTGA CTGGCTGGCG CCTAGTGCTC
TACAACGGGG CTAACGGTCT GAGTTACAAC ACTACAGACT TAGGCAGCCT CTCGAGTACT
GACCTTGGGA ATGGCTTCAA AGCAGTCGTG GTCAACTACC CCAGCAATGG GATTCAAAAT
GGCGCACCCG ACGGCTTCGC CCTCGTAGAT GACACAGGCG CCGTGGTGCA GTTTCTCAGC
TATGAGGGCA CCTTCACCGC CACAAACGGT CCTGCATTAG GCTTGACCAG TACTGATATT
GGCATTGCCC AGAGCGGTAC TGAACCCCTG GGATCCTCCC TCCAGTTGAC GGGCAGCGGC
ACCCACTACG AAGACTTTAC CTGGGCTAGC ACGACTACCA ACACCAGCGG TGCCCTTAAT
GCGGGCCAGA TCTTGGGTGG GGTTGTCGAG GCGATCGCTC CTCGCATCAA CGAATTTGTG
TTCGACCACG TTGGCACCGA CACCAACGAA TATGTCGAAA TTTTGGGTGC GGCAAACACG
GACTACTCTG CCTACACCCT GCTGCAAATT GAGGGGGACA GCGGTAGCAC CCTCGGCCGG
ATTACCAGCG CCCAGTCCTT GGGCAGCACT GATGCCAGTG GCTACTGGAC CACGGGCTTT
TTGAATAATG TTTACCAAAA TGGCACGCAG ACTTTGTTGC TGGTGAGGAA CTTTACCGGC
AGTGTCAACC AAGTGCTCGA TAGCAACGGC GACGGGGTGC TGGATATTAC CCCTTGGGAT
AGTGTTGCCG ATGGAATAGC AGTAACTGAT GGTGGCCTGG GCGATCGCAC CTACACTGCC
GCTATCCTCA CCCCCGGCTT TGGCGGCAAT AGCCTCCGCG TCGGTGGCGC TTCCCGAATC
CCCAACGGGA CTGATACCGA TACCGCTGCT GACTGGGTGC GCAATGACTT TGACCTTGCA
GGGCTACCGG GCTTTCCCGG AACACCGGAT CAAGGCGAGG CGTTGAACAC GCCCGGAGCG
GCTAATGCGG TGGTTGATGT GCCACCGCCA CCGCCGGAAC TGACGGCAAT TTATACCATT
CAGGGGGCTG GGCAACGGTC TGCCCTCGTT GGCCAGACTG TCACGACCTC GGGTATTGTC
ACGGCTGTCG ACAGCAACGG CTTCTACCTC CAAGACGCCA CCGGCGATGG CGATATTGCC
ACGTCTGACG CCATCTTTGT CTTCACGGGC AGCACCCCCA CCGTGGCTGT GGGTAACGAA
ATCCGCCTGT CTGGGACCGT GTCGGAGTTT ACTCCCGGTG GCCTTTCTAG CCGCAATCTT
TCTACCACCC AGCTTGGCGG CAGCCTCAAC ATCACAGTTC TGAGCACCGG CAACACCCTG
CCAGCAGCGG TCATTCTGGG CGAAGGCGGT CGTTTACTCC CCTCATCTAA TATCGACGAT
GATGCCTTCG GCAGCTTTGA CCCAATCACC GATGGCATCG ACTTCTTCGA GTCGCTGGAG
GGGATGCGGG TCACAGCTCA AAACCTACTG GTGGTCAATG GTACCAACGG CTTTGGCGAA
ATCTTTGGGG TAGTCGACAA CGGTGCGGGG GCGACGGGTC TGAGCGATCG CGGCACCCTC
AACATTAGCC CTGACGACTT TAACCCCGAG CGGGTGCAGG TGCAGTTCGA CAGCGGTGTT
TTCAACTTCG ACTTCCCCGA TGTCAACGTG GGCGATGGCA TCGGCGCGGT TACCGGCGTG
CTGGGCTACG ACTTTGGGAA TTTCCAGATT GTCGCTACCG AAGACTTCAG CGGCAGCATC
CAACGGGGTG GTTTACAACC GGAGGTCAGC ACACTCACCC GTGGCAATGA CCAGTTGACG
GTTGCCACCT ACAACGTACT CAACCTCGAC CCCAACGATA ACGATGGCGA TGCCGACATT
GCCAATGGCC AGTTTGCAGC GATCGCCCGC CAAATCATCA ATAACCTCAA TTCCCCGGAC
ATCATTGCCC TGCAGGAAGT TCAAGACAAC AGTGGCAGCC TCAATGATGG GGTAACGGCG
GCGAACCTGA CTCTGCAGAC TTTAGTGGAT GCGATCGCGG CAGCGGGTGG CCCCACCTAC
GCCTTCATCG ACAACCCGTT CATCACCAAC AATGCTAGCG GTGGTCAGCC CGGGGCCAAC
ATTCGTACTG CCTACCTCTA CGACCCCAGC CGAGTGTCGT TTGTGGAAGA CTCATTGACG
ACCATTGGTA GCCAACAACC TGGTGAAGCC TTTGCGGGCG CTCGTTTACC GCTAGTCGCT
GACTTTGAGT TCAACGGCGA GACGGTGACT CTGGTCAACA ATCACTTCTC CTCGAAGGGG
GGCAGTGCGC CGATTCTAGG GACTGCTCAA CCCTTTGAAA GCCGCCAAGA AGATTCCACA
GTTAATGGCT CGTTGGACGA TCGCCAGGCT CAGTCTCAGG CTGTGCAGGA CTACGTCAAT
GGACGGCTGG GGAACGACCC CACTGCCAAG GTGATTGTGC TGGGCGACTT CAACGAGTTT
GAGTTTGTCT CGCCGGTGCG TGACCTCGCA GCCAACACAG GCCTGACGAA TTTGACTGAG
CTGCTACCCG AGAATGAACG CTACACGTTT ATCTTCCAGG GCAATTCCCA GTCGATTGAC
CATATTTTGG TCAGCGGCAG TTTGGCTAAT CGTGCCGAGA TCGATATCGT CAACGTGAAT
ACCGAGTTTG CTGAGTCGCT CCAAGCCAGT GATCACGATC CGGTACTCGC TCGGTTCACC
ATTGCAGCCC CCAATGTGAT CCGTGGTACC TCACAACGTG ATGTACTCAC GGGCAGCGAT
CGCAACGATA TCTTCCTCGC CAGCGGCGGC CCTGACCGGA TTACCACCGG CGGTGGCCGC
GATCGCATTG TCTTCACCAA CACTAGTCAA ACTGGTGTCA CCATTACTGA CTTTGAGGTG
GGGGTCGACA AGCTAGTCCT GACTGACCTG TTGGCGAGCG TTGGCTACAC CGGCAATAAT
CCCCTGGCGG ATGGCTTGAT CAAAATCCGC AGCTTGGGTA ACAGCGATCG CACCCAGCTT
TCTCTGGAGT TGGATCGTAT TGGCGGTGGG CGCACACAGT TCACAAACTT CCTCACTTTT
GAAGGAGTGA GTGCGGCGGA ACTGAGCAAC TTGGATAACT TTTTCTTCTA G
 
Protein sequence
MATSVFINEF HYDNAGTDAG EFIEILAPLA TSLTGWRLVL YNGANGLSYN TTDLGSLSST 
DLGNGFKAVV VNYPSNGIQN GAPDGFALVD DTGAVVQFLS YEGTFTATNG PALGLTSTDI
GIAQSGTEPL GSSLQLTGSG THYEDFTWAS TTTNTSGALN AGQILGGVVE AIAPRINEFV
FDHVGTDTNE YVEILGAANT DYSAYTLLQI EGDSGSTLGR ITSAQSLGST DASGYWTTGF
LNNVYQNGTQ TLLLVRNFTG SVNQVLDSNG DGVLDITPWD SVADGIAVTD GGLGDRTYTA
AILTPGFGGN SLRVGGASRI PNGTDTDTAA DWVRNDFDLA GLPGFPGTPD QGEALNTPGA
ANAVVDVPPP PPELTAIYTI QGAGQRSALV GQTVTTSGIV TAVDSNGFYL QDATGDGDIA
TSDAIFVFTG STPTVAVGNE IRLSGTVSEF TPGGLSSRNL STTQLGGSLN ITVLSTGNTL
PAAVILGEGG RLLPSSNIDD DAFGSFDPIT DGIDFFESLE GMRVTAQNLL VVNGTNGFGE
IFGVVDNGAG ATGLSDRGTL NISPDDFNPE RVQVQFDSGV FNFDFPDVNV GDGIGAVTGV
LGYDFGNFQI VATEDFSGSI QRGGLQPEVS TLTRGNDQLT VATYNVLNLD PNDNDGDADI
ANGQFAAIAR QIINNLNSPD IIALQEVQDN SGSLNDGVTA ANLTLQTLVD AIAAAGGPTY
AFIDNPFITN NASGGQPGAN IRTAYLYDPS RVSFVEDSLT TIGSQQPGEA FAGARLPLVA
DFEFNGETVT LVNNHFSSKG GSAPILGTAQ PFESRQEDST VNGSLDDRQA QSQAVQDYVN
GRLGNDPTAK VIVLGDFNEF EFVSPVRDLA ANTGLTNLTE LLPENERYTF IFQGNSQSID
HILVSGSLAN RAEIDIVNVN TEFAESLQAS DHDPVLARFT IAAPNVIRGT SQRDVLTGSD
RNDIFLASGG PDRITTGGGR DRIVFTNTSQ TGVTITDFEV GVDKLVLTDL LASVGYTGNN
PLADGLIKIR SLGNSDRTQL SLELDRIGGG RTQFTNFLTF EGVSAAELSN LDNFFF