Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_1000 |
Symbol | |
ID | 3773927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | - |
Start bp | 1008139 |
End bp | 1011369 |
Gene Length | 3231 bp |
Protein Length | 1076 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637799419 |
Product | hypothetical protein |
Protein accession | YP_400017 |
Protein GI | 81299809 |
COG category | [R] General function prediction only |
COG ID | [COG2374] Predicted extracellular nuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACAT CCGTTTTTAT TAACGAGTTT CACTACGACA ATGCCGGCAC GGATGCCGGT GAATTTATTG AAATCTTGGC GCCTCTGGCC ACTAGCCTGA CTGGCTGGCG CCTAGTGCTC TACAACGGGG CTAACGGTCT GAGTTACAAC ACTACAGACT TAGGCAGCCT CTCGAGTACT GACCTTGGGA ATGGCTTCAA AGCAGTCGTG GTCAACTACC CCAGCAATGG GATTCAAAAT GGCGCACCCG ACGGCTTCGC CCTCGTAGAT GACACAGGCG CCGTGGTGCA GTTTCTCAGC TATGAGGGCA CCTTCACCGC CACAAACGGT CCTGCATTAG GCTTGACCAG TACTGATATT GGCATTGCCC AGAGCGGTAC TGAACCCCTG GGATCCTCCC TCCAGTTGAC GGGCAGCGGC ACCCACTACG AAGACTTTAC CTGGGCTAGC ACGACTACCA ACACCAGCGG TGCCCTTAAT GCGGGCCAGA TCTTGGGTGG GGTTGTCGAG GCGATCGCTC CTCGCATCAA CGAATTTGTG TTCGACCACG TTGGCACCGA CACCAACGAA TATGTCGAAA TTTTGGGTGC GGCAAACACG GACTACTCTG CCTACACCCT GCTGCAAATT GAGGGGGACA GCGGTAGCAC CCTCGGCCGG ATTACCAGCG CCCAGTCCTT GGGCAGCACT GATGCCAGTG GCTACTGGAC CACGGGCTTT TTGAATAATG TTTACCAAAA TGGCACGCAG ACTTTGTTGC TGGTGAGGAA CTTTACCGGC AGTGTCAACC AAGTGCTCGA TAGCAACGGC GACGGGGTGC TGGATATTAC CCCTTGGGAT AGTGTTGCCG ATGGAATAGC AGTAACTGAT GGTGGCCTGG GCGATCGCAC CTACACTGCC GCTATCCTCA CCCCCGGCTT TGGCGGCAAT AGCCTCCGCG TCGGTGGCGC TTCCCGAATC CCCAACGGGA CTGATACCGA TACCGCTGCT GACTGGGTGC GCAATGACTT TGACCTTGCA GGGCTACCGG GCTTTCCCGG AACACCGGAT CAAGGCGAGG CGTTGAACAC GCCCGGAGCG GCTAATGCGG TGGTTGATGT GCCACCGCCA CCGCCGGAAC TGACGGCAAT TTATACCATT CAGGGGGCTG GGCAACGGTC TGCCCTCGTT GGCCAGACTG TCACGACCTC GGGTATTGTC ACGGCTGTCG ACAGCAACGG CTTCTACCTC CAAGACGCCA CCGGCGATGG CGATATTGCC ACGTCTGACG CCATCTTTGT CTTCACGGGC AGCACCCCCA CCGTGGCTGT GGGTAACGAA ATCCGCCTGT CTGGGACCGT GTCGGAGTTT ACTCCCGGTG GCCTTTCTAG CCGCAATCTT TCTACCACCC AGCTTGGCGG CAGCCTCAAC ATCACAGTTC TGAGCACCGG CAACACCCTG CCAGCAGCGG TCATTCTGGG CGAAGGCGGT CGTTTACTCC CCTCATCTAA TATCGACGAT GATGCCTTCG GCAGCTTTGA CCCAATCACC GATGGCATCG ACTTCTTCGA GTCGCTGGAG GGGATGCGGG TCACAGCTCA AAACCTACTG GTGGTCAATG GTACCAACGG CTTTGGCGAA ATCTTTGGGG TAGTCGACAA CGGTGCGGGG GCGACGGGTC TGAGCGATCG CGGCACCCTC AACATTAGCC CTGACGACTT TAACCCCGAG CGGGTGCAGG TGCAGTTCGA CAGCGGTGTT TTCAACTTCG ACTTCCCCGA TGTCAACGTG GGCGATGGCA TCGGCGCGGT TACCGGCGTG CTGGGCTACG ACTTTGGGAA TTTCCAGATT GTCGCTACCG AAGACTTCAG CGGCAGCATC CAACGGGGTG GTTTACAACC GGAGGTCAGC ACACTCACCC GTGGCAATGA CCAGTTGACG GTTGCCACCT ACAACGTACT CAACCTCGAC CCCAACGATA ACGATGGCGA TGCCGACATT GCCAATGGCC AGTTTGCAGC GATCGCCCGC CAAATCATCA ATAACCTCAA TTCCCCGGAC ATCATTGCCC TGCAGGAAGT TCAAGACAAC AGTGGCAGCC TCAATGATGG GGTAACGGCG GCGAACCTGA CTCTGCAGAC TTTAGTGGAT GCGATCGCGG CAGCGGGTGG CCCCACCTAC GCCTTCATCG ACAACCCGTT CATCACCAAC AATGCTAGCG GTGGTCAGCC CGGGGCCAAC ATTCGTACTG CCTACCTCTA CGACCCCAGC CGAGTGTCGT TTGTGGAAGA CTCATTGACG ACCATTGGTA GCCAACAACC TGGTGAAGCC TTTGCGGGCG CTCGTTTACC GCTAGTCGCT GACTTTGAGT TCAACGGCGA GACGGTGACT CTGGTCAACA ATCACTTCTC CTCGAAGGGG GGCAGTGCGC CGATTCTAGG GACTGCTCAA CCCTTTGAAA GCCGCCAAGA AGATTCCACA GTTAATGGCT CGTTGGACGA TCGCCAGGCT CAGTCTCAGG CTGTGCAGGA CTACGTCAAT GGACGGCTGG GGAACGACCC CACTGCCAAG GTGATTGTGC TGGGCGACTT CAACGAGTTT GAGTTTGTCT CGCCGGTGCG TGACCTCGCA GCCAACACAG GCCTGACGAA TTTGACTGAG CTGCTACCCG AGAATGAACG CTACACGTTT ATCTTCCAGG GCAATTCCCA GTCGATTGAC CATATTTTGG TCAGCGGCAG TTTGGCTAAT CGTGCCGAGA TCGATATCGT CAACGTGAAT ACCGAGTTTG CTGAGTCGCT CCAAGCCAGT GATCACGATC CGGTACTCGC TCGGTTCACC ATTGCAGCCC CCAATGTGAT CCGTGGTACC TCACAACGTG ATGTACTCAC GGGCAGCGAT CGCAACGATA TCTTCCTCGC CAGCGGCGGC CCTGACCGGA TTACCACCGG CGGTGGCCGC GATCGCATTG TCTTCACCAA CACTAGTCAA ACTGGTGTCA CCATTACTGA CTTTGAGGTG GGGGTCGACA AGCTAGTCCT GACTGACCTG TTGGCGAGCG TTGGCTACAC CGGCAATAAT CCCCTGGCGG ATGGCTTGAT CAAAATCCGC AGCTTGGGTA ACAGCGATCG CACCCAGCTT TCTCTGGAGT TGGATCGTAT TGGCGGTGGG CGCACACAGT TCACAAACTT CCTCACTTTT GAAGGAGTGA GTGCGGCGGA ACTGAGCAAC TTGGATAACT TTTTCTTCTA G
|
Protein sequence | MATSVFINEF HYDNAGTDAG EFIEILAPLA TSLTGWRLVL YNGANGLSYN TTDLGSLSST DLGNGFKAVV VNYPSNGIQN GAPDGFALVD DTGAVVQFLS YEGTFTATNG PALGLTSTDI GIAQSGTEPL GSSLQLTGSG THYEDFTWAS TTTNTSGALN AGQILGGVVE AIAPRINEFV FDHVGTDTNE YVEILGAANT DYSAYTLLQI EGDSGSTLGR ITSAQSLGST DASGYWTTGF LNNVYQNGTQ TLLLVRNFTG SVNQVLDSNG DGVLDITPWD SVADGIAVTD GGLGDRTYTA AILTPGFGGN SLRVGGASRI PNGTDTDTAA DWVRNDFDLA GLPGFPGTPD QGEALNTPGA ANAVVDVPPP PPELTAIYTI QGAGQRSALV GQTVTTSGIV TAVDSNGFYL QDATGDGDIA TSDAIFVFTG STPTVAVGNE IRLSGTVSEF TPGGLSSRNL STTQLGGSLN ITVLSTGNTL PAAVILGEGG RLLPSSNIDD DAFGSFDPIT DGIDFFESLE GMRVTAQNLL VVNGTNGFGE IFGVVDNGAG ATGLSDRGTL NISPDDFNPE RVQVQFDSGV FNFDFPDVNV GDGIGAVTGV LGYDFGNFQI VATEDFSGSI QRGGLQPEVS TLTRGNDQLT VATYNVLNLD PNDNDGDADI ANGQFAAIAR QIINNLNSPD IIALQEVQDN SGSLNDGVTA ANLTLQTLVD AIAAAGGPTY AFIDNPFITN NASGGQPGAN IRTAYLYDPS RVSFVEDSLT TIGSQQPGEA FAGARLPLVA DFEFNGETVT LVNNHFSSKG GSAPILGTAQ PFESRQEDST VNGSLDDRQA QSQAVQDYVN GRLGNDPTAK VIVLGDFNEF EFVSPVRDLA ANTGLTNLTE LLPENERYTF IFQGNSQSID HILVSGSLAN RAEIDIVNVN TEFAESLQAS DHDPVLARFT IAAPNVIRGT SQRDVLTGSD RNDIFLASGG PDRITTGGGR DRIVFTNTSQ TGVTITDFEV GVDKLVLTDL LASVGYTGNN PLADGLIKIR SLGNSDRTQL SLELDRIGGG RTQFTNFLTF EGVSAAELSN LDNFFF
|
| |