Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_1783 |
Symbol | |
ID | 3774358 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | + |
Start bp | 1849303 |
End bp | 1852170 |
Gene Length | 2868 bp |
Protein Length | 955 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637800224 |
Product | hypothetical protein |
Protein accession | YP_400800 |
Protein GI | 81300592 |
COG category | [S] Function unknown |
COG ID | [COG1615] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.890136 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.25047 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCGCC ACTTAAGCCG CTGGGGACTT GCGATCGCGG CGATCGCCCT AGGACTGAGC TTGCTGACAC GCATCCACAT TGAAACCCTG TGGTTTACAG CCCTAGGCAT CCCGACTGTG TTTTTGCGGC GACTAGCGGT ACAAGCGCTG CTGTTCAGCG TTGTAGGCAT TGCGATCACA GGACTGATTG GCGGGAACTT GCGCTGGGCG GCGCGCCATC AAACAGATCA ACCCGATCGC CCGGCACCTC GCCTGCAGTT GGGCGGTCTG CTCACGGTCT TAACGCTACT CTGGATTGCC CTACTAGCCT TGACGACCCA AGCCATCCTT GCCGCTTGGA ACTGTCAGAC TGGCCGCGCG CTGCCATTCC TGCCGCAAAT CCTGACGCTG GACTGGCTGC AATCCTCACT GATCACAGCA GGGTCTTGGC CTCTAGGGAT GGGGTTGCTC CTAGGCGTGG GTAGCTTAGT CCTCTTTCTC TGGCGACCTT GGCCCCTCCT GATCGGCTTA TCAAGCCTGA CCAGTCTGGC GATCGCCCTG TTGACCTCGC GGGAATGGCT GCGCATTTGG CCCGCTTTTG CCGCCGAAAG CGTCAGCGAT CGCGATCCGA TTTTCCAGCA GGATCTTGCC TTTTATCTGT TTCGACTACC AGCCCTAGAG GTGCTGCAGT TTGACCTCTG GATTGGGCTG GCCTTTAGTT TTTGCGCGGT TCTGGCTGTC TATTACCTCG CCAAGCAGAG TGTCAGCAAC GCTGAGTTTC GCGGCTTTGC CCCTAGTCAA CAGCGCCATC TAGTGCGATT GGCGATCGCG ATCGCCCTGT TCCTAGCAGG ACATTGCTGG CTGGCCCAAC GGCAGTTGCT GTTTTCCGAA TTGGGGGCGG TCTATGGCAT CGGCTTCACC GATCGCTGGG TCAAACTACC GCTGCTACAG GTTTGGATGA TTCTGTTTGG CATCGCCGCG ATCGCCCTGT TCTGGCAAAG TCGGCGGGGG CTTCTGCCTC AGCGCTGGAT TCGCAACTTT CAACTAGCAG CGATCGCCAG CGTCCTGATC TGGGTGACCC TACCCGCGAT CGTGCAGCAA CTGGTGGTGC AACCCAACGA AATTGCTCGC GAGCTGCCTT ATCTGAAGCA GGCCATTCTG TTTACCCGCC GAGCTTTTGG CCTCGATCAA ATTGAAACGC GCACGTTTGA CCCCCAGCCC AGCCTCAACC GCGCCGTCCT GGCGGCTAAT CGCGAAACTG TTCAGAATAT TCGCCTTTGG GATACGCGGC CGCTCCTGCA GAGCAATCGG CAGTTGCAGC AAATCCGCCT CTACTACAGC TTCCCCAGCG CCCAAATCGA TCGCTATCGC CTTCAGACCA GCTTTGGGGA TGCTCTGCAG CAAGTGATTA TTGCAGCACG CGAGCTGGAC TACACCGCCA TTCCCGCCGC CGCCAAAACC TGGGTCAATG AACACCTGGT CTACACCCAT GGCTATGGCT TCACCCTTAG CCCAGTCAAC AGCAGCGCTC CCGATGGGTT GCCTCGCTAC TTCGTCAAGG ACATCGGGGC CAACACCCGC ATCATTGGCG ACGCCAGCTT GGGAATCAGC ACCGAAGCTG TCAAAGCTGC GATATCGACT GAAAACCCGC GCATTTACTA CGGCCAGCTC ACCCGCAATT ACGTTTTCAC GCCCAGTCGC ACCCAAGAGC TGGACTATCC CAGCGGCAAC GATAACGTCT ACAACATCTA CGACGGCAAG GGTGGCGTCA CACTGGGCAA CTATGCGCAG CGACTGCTCT TTTCCTTGTA TCTACGGGAC TGGCGCCTTC CCTTTTCCGG AGATTTAACC GCCCAGACTC GCGTCCTGTT CCGCCGCCAA ATTGAAGACC GCGTCCGAGC GATCGCACCA TTTTTGCGCT ACGACGCGGA GCCCTACCTG GTCTCTGTCA ACGCCGACAC AGCTGAGGCC TCGGGCTTGG GGCGCAGTTC CCTATTTTGG ATTTTGGACG CCTACACCGT TAGCGATCGC TATCCCTACG CTGATCCGGG CGAGCAACCC TTCAACTACA TCCGCAACTC AGTCAAAGTG ATCATCGATG CCTACAACGG CAGCGTCCAG TTCTATATCG TTGACCCCAA GGATCCACTG ATTCAAACCT GGTCCAGACT CTTCCCGTCG CTGTTCCAGC CGATTGATGC GATGGCGCCA GTACTGCGAT CGCATTTGCG CTATCCCACA GACCTCTTCA AGGCTCAGTC GTCGCAACTG CTGACCTACC ACGTACTCGA TCCACAGGTT TTCTACAACC GCGATGACCA GTGGGCCTAC CCACGCGAAA TCTACGCTGG CGAGACGGCA ACCGTTCAGC CCTACTACTT AATTACGCGC TTACCGACCG CTGCGAGCGA AGAGTTCCTG ATCCTGACGC CCTTCACACC ATTGGGGCGT AACAATATGA TTGCTTGGCT GGCGGGGCGA TCAGACGGCG AGGAATACGG TCGCCTCTTG CTCTACGAAT TTCCGCGCCA GCGCCTCATT TTTGGTCCCG AGCAAATCAC GGCTCGCATC AACCAAGATC CTCAGATTTC TGAGCAAATT ACGCTCTGGA ACCGCGAAGG CTCCCGCGCT GCCGAAGGCA ACTTGCTGGT TATTCCCATT GACCAAGCGC TGCTCTACGT TGAGCCACTC TACTTAGAAG CCTCGCGCAA TAGCTTGCCA GCGCTCACCC GCGTCATTAC GGCCTACCAA GATCGCATTG TGATGACTCC CAGCCTCCTC GAAAGCCTGC AAAAACTGTT TCCAGACTCA ACACCCGCCC TGACGCCGCT TGAACAACCG GTGTTGACGA CCGAGCAGTC CGCTGTCCTC AACCCTGACC AGCCCTAG
|
Protein sequence | MPRHLSRWGL AIAAIALGLS LLTRIHIETL WFTALGIPTV FLRRLAVQAL LFSVVGIAIT GLIGGNLRWA ARHQTDQPDR PAPRLQLGGL LTVLTLLWIA LLALTTQAIL AAWNCQTGRA LPFLPQILTL DWLQSSLITA GSWPLGMGLL LGVGSLVLFL WRPWPLLIGL SSLTSLAIAL LTSREWLRIW PAFAAESVSD RDPIFQQDLA FYLFRLPALE VLQFDLWIGL AFSFCAVLAV YYLAKQSVSN AEFRGFAPSQ QRHLVRLAIA IALFLAGHCW LAQRQLLFSE LGAVYGIGFT DRWVKLPLLQ VWMILFGIAA IALFWQSRRG LLPQRWIRNF QLAAIASVLI WVTLPAIVQQ LVVQPNEIAR ELPYLKQAIL FTRRAFGLDQ IETRTFDPQP SLNRAVLAAN RETVQNIRLW DTRPLLQSNR QLQQIRLYYS FPSAQIDRYR LQTSFGDALQ QVIIAARELD YTAIPAAAKT WVNEHLVYTH GYGFTLSPVN SSAPDGLPRY FVKDIGANTR IIGDASLGIS TEAVKAAIST ENPRIYYGQL TRNYVFTPSR TQELDYPSGN DNVYNIYDGK GGVTLGNYAQ RLLFSLYLRD WRLPFSGDLT AQTRVLFRRQ IEDRVRAIAP FLRYDAEPYL VSVNADTAEA SGLGRSSLFW ILDAYTVSDR YPYADPGEQP FNYIRNSVKV IIDAYNGSVQ FYIVDPKDPL IQTWSRLFPS LFQPIDAMAP VLRSHLRYPT DLFKAQSSQL LTYHVLDPQV FYNRDDQWAY PREIYAGETA TVQPYYLITR LPTAASEEFL ILTPFTPLGR NNMIAWLAGR SDGEEYGRLL LYEFPRQRLI FGPEQITARI NQDPQISEQI TLWNREGSRA AEGNLLVIPI DQALLYVEPL YLEASRNSLP ALTRVITAYQ DRIVMTPSLL ESLQKLFPDS TPALTPLEQP VLTTEQSAVL NPDQP
|
| |