Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_0750 |
Symbol | |
ID | 3775923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | + |
Start bp | 743738 |
End bp | 746590 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637799165 |
Product | Phage tail tape measure protein TP901, core region |
Protein accession | YP_399769 |
Protein GI | 81299561 |
COG category | [S] Function unknown |
COG ID | [COG5283] Phage-related tail protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.173353 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGGTA AAGCCTTTCA GGTAACGCTT GAAATTGGGG GGCGCGTTGC AGCGTCCCTC GGTTCGTCGA TTGCCCGTGC CCAAGGCCAG CTCAACACAC TGGCCAGGGC CACCAAGACA GGCTTTATGG GCGTCGTGCG CAACGACGCT TTTCAGGCGT TGGCAGCGGC TAGCGCGGCG GTTGGCGCGG GCTTGATCTA CTCGACCAAG CAAGCTGTTG CCTTCGAGTC TCAGCTGGCT GACATCGGCA AGACGGCAGG CTCTAGCGCG GCAGAACTCA AGGCGTTGGG GGCTGACCTA CTGGCGCTCA GCGCCCGCGA TCGCACGAAC CAATCAGCCA GCAATCTCGC CTCTGGCATC CAAGACCTAG TGGCCCAAGG CCTCGAACTA AAGGATGCGA TCGCCAGCAT TGAAACCCTA GGTCGAGTAG CGACGGCGAC CAACTCGAAC CTGACCGACA TCACTAAGAC CGGCTTTCAG CTCCAGAATG CGCTGAAGAT CAAGCCGACT GAACTCAAGG CGACGTTCGA CGCACTGGCC TACGCGGGCA AGCAAGGGGC GTTCGAGCTG AAGGACATGG CGCAGTTCAT GCCGACGATC GCTGCGGCGG CTGGTTCTCT GGGGGTGACA GGCAGAGAAG GCGCGGTCAG TCTGGCCGCC ATGATGCAGA TGGTGCGACG CGATGCGCCG GACGCTGGGC AGGCTGCCAC GCGGTTGACA GACGCAATGC TGAAGATGAC CGCACCGGAT GCGGTGAAGC GCTTCAAGAA GTTTGGCGTC AACATCGAGC AGGTGCTGAA AGACGCCAAG GCCAACGGCA TCAATCCGAT GGAAGCGGCA GTCGAAACCC TGTTCAAAGT GACGGGTGGC GACACGTTTA AGCTCGGACA GATCTTCGGT GACAAGGAAG CAAAGCTGGC TCTAATGAGC CTGATGAAGT ATCGCGCCGA GTACACCAAG CTGCGCGATG ACGCGGGCGG CTCGATTGCG GCAGGCACTG TTGACGCTGA CTATCAGCGA TCGCTGGGTA CCTTCGCGGA ACAGATGAAG GCGCTGCAAA ACACGGGCGA GCGGCTGGCG ATCTCGATCG GCACGGCGCT GCTGCCGTCG CTGAACTCAC TGGCTAACGT GGTCACGCCA GTGATTGAAG GTATGGCCCG CTGGGCTGAA ACCAACCCCG GGCTGATGAA AGGCATCGTG GCGATCGCGG GGCTGACGGT GGGCTTAACT GCTGCCCTGC CCCTCATTGG TGCCGTGGTG GCAGCCATCG GTGTGATTGG TGGCCCGATC ACCTTGGCCG TGCTGGGAAT TGGTGCGGCG ATCGCCCTAG TCATTGCGTA CTGGGACGAC CTCAAGCAGG TTGCTCTCGG ATTCTGGGGT TCAGTGAAGC AAGCGGGCGC GAGTGACCTG TTCCAAGGCA TCCGGCAGGG ACTCACCGGG GTGATGACGC TACTCAGTGA GGCCAAGCGC TTCTGGGTGG CTTTGTTCTC CGGCAACGAG CAGGAAGTCG CGGCCGCAGC CCAGCGGATC GGGCAGACGA TCGGCGCGGT CATCCTGCCA GCCGTGGCGC AAGTCTGGCT GTCGATCGGG CGCATGGCCG CGATGGGTGT CGTCGCGCTC GGTCGGTCGT TCATCACCGG GCTAGCCGGT GCGATGAGGG CCATGCCCGG AATCGTGATG GGCGCCGCTC GCATGGCGGG CACTGTCCTC GTGACGCTCA TGCAAGCAGC GATCGCGACG GTCGGCTCCC TACTGCAACA GCTACCCGGC ATCGCCGGAT CCGCGTTTGC GGGCATCACG AGCCAGTTCA GGGCGGTCTG GGATCAAGCG CTCGGTGTAG TGCGATCGTT CCTGCCCCAG ATGGGCGCGA TCTTGTTCCC ACTGCCCACG TTGGTCATCG GCATCTTCCA GAAGATCGTC CCTGGCATCG CGTCAGTCTT TGCTCAGATG GTCGCCCAGA TTCAGGGCGC GTTCCAGCAG GTCGTCGCGT TCATCCGAAG CGTGCCGTCA ATGCTGGCCG GCGTCGGTGA GGCGATCATC CAAACCATCA TCGACGGGGT GAAGGCAAAG GCTGGCGAGC TGCTGGCGAC CGTGCAGCAG AGCTTCGCCA GGGTGCGCGA GCTGATGCCG TTCAGCGACG CGAAGCGCGG CCCGTTCTCG ACCCTCACCA AATCCGGCAT GGCGATCCCC GGCACGCTCG GGATTGGCGT GCGTCGCGGT GCAGGTTTGC TCCGGCGTCC GCTCGTTGCG GCTGCGACGG CGGCGATGGC CGCGATGGGT GCGGTGCAAG CCCCAGCGAT CGCTATAGCA GCGCCGACCC TGCCCCAACC CTTGCCAGCA CTCACTCAGC CCGCGATGGG TGCGCCGAAA ACTGGCCCGG TTCTGGCCCG GCAGGCCGTC CCGCTGCAGA CCGAGAATGC AGGCTCGCAG CAGCTTGTCG CGTCGATTTC CCCTGCAATA GCGCTGCAGG TTCCCGCGCC CGCGATCGCG ACTCCGGCTC CGATCAGCGT CCCTGCTCCG CAGATTGTTT CGCAGCCGTC GATTGCCCTG CCGGCCCCAA CGATCGTGGC TCAAGCCTCT GTGGCAACGC CCGAGGCCCG CTTTGCCCTG ACTGAGGTCA GGATCCCCGA GGTCAATAGC ACACCCACGA TCGCCGCGCC GATGCCCGCG CCGGTCGTGG TTTCGCCTGC GCCTCGAAGC GATCGGCGCG CCCCGATCAA CATCACTGCA CCGATCACAA TCAACGCTGG TCCAGGGCAG GACGCGCGTA GTATCGCCGC CCAAGTGCGC CAGGTGTTCG ATGACCTGAT GCGCGAGGCC GAACTCAACC AGCGTGCTGC TCTAAACGAC TGA
|
Protein sequence | MAGKAFQVTL EIGGRVAASL GSSIARAQGQ LNTLARATKT GFMGVVRNDA FQALAAASAA VGAGLIYSTK QAVAFESQLA DIGKTAGSSA AELKALGADL LALSARDRTN QSASNLASGI QDLVAQGLEL KDAIASIETL GRVATATNSN LTDITKTGFQ LQNALKIKPT ELKATFDALA YAGKQGAFEL KDMAQFMPTI AAAAGSLGVT GREGAVSLAA MMQMVRRDAP DAGQAATRLT DAMLKMTAPD AVKRFKKFGV NIEQVLKDAK ANGINPMEAA VETLFKVTGG DTFKLGQIFG DKEAKLALMS LMKYRAEYTK LRDDAGGSIA AGTVDADYQR SLGTFAEQMK ALQNTGERLA ISIGTALLPS LNSLANVVTP VIEGMARWAE TNPGLMKGIV AIAGLTVGLT AALPLIGAVV AAIGVIGGPI TLAVLGIGAA IALVIAYWDD LKQVALGFWG SVKQAGASDL FQGIRQGLTG VMTLLSEAKR FWVALFSGNE QEVAAAAQRI GQTIGAVILP AVAQVWLSIG RMAAMGVVAL GRSFITGLAG AMRAMPGIVM GAARMAGTVL VTLMQAAIAT VGSLLQQLPG IAGSAFAGIT SQFRAVWDQA LGVVRSFLPQ MGAILFPLPT LVIGIFQKIV PGIASVFAQM VAQIQGAFQQ VVAFIRSVPS MLAGVGEAII QTIIDGVKAK AGELLATVQQ SFARVRELMP FSDAKRGPFS TLTKSGMAIP GTLGIGVRRG AGLLRRPLVA AATAAMAAMG AVQAPAIAIA APTLPQPLPA LTQPAMGAPK TGPVLARQAV PLQTENAGSQ QLVASISPAI ALQVPAPAIA TPAPISVPAP QIVSQPSIAL PAPTIVAQAS VATPEARFAL TEVRIPEVNS TPTIAAPMPA PVVVSPAPRS DRRAPINITA PITINAGPGQ DARSIAAQVR QVFDDLMREA ELNQRAALND
|
| |