Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_2480 |
Symbol | |
ID | 3774500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | - |
Start bp | 2560946 |
End bp | 2561848 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 637800929 |
Product | prolyl 4-hydroxylase, alpha subunit |
Protein accession | YP_401497 |
Protein GI | 81301289 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3751] Predicted proline hydroxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.019488 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0000292929 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCACAAAT TAAGTTCTAT TCTTGTGAAT TTTGACTTGA AGCCTCTAAA TTTGATAAAA ACTATTTATG ATTTCAGTCG ACAAGAGGTT CTTTTAACTG AAACTGAGTT GTTTCAGTTA CTCCAAAGCT TTGAGGACTG TCCACCTCGT GAGAGTCTAC AAAGTTCCTC TTTTCAGTGG AGGAACCTGT TGTTTTCTCA GGTGTCAGCG GGTAATACGA CTTACTGGTA TTCCCAACAA ATACCGATTC AAATCAATTC AAATTTAAGT GGTATTGCCA ACCATCTAGC TGTAGAGTCT TCAACAGGGA AGCCAGATAT TTTATATTGG CGAATCCTTG ATTTTTTAAG CCCTGAAAAG CTACAACAGC TTTGGAATTA TTTACTAACT GCCCGATCGC AATTTAATCC GGCTCACAAC TCCGCAGGTT TGAATAACTA TCGACAATCT CTTTTTACTG CTCCTCCTCC TGAAATTTAT TCCGAGATCA GTGAAAAGAT TTTAGGTGCT TTGATACCAA TTGCCGATGA GCTGCCCAAT TCCTCACAAG AAATTGGCGA GATAGAGATG CAAATAACAG CTCACAACGA TGGTCATTAT TACAAAATTC ATAATGATAA CGGCAGTCCT GATACAGCTA CACGTTTTCT CACTTACGTT TACTACTTCT ATCGACAACC CAAACCATTT ACTGGTGGCG AGCTGCGACT GTATGAACTT GCTATCAAAG ATGGCTTTTA CGTTGCAGGC GATCGCTATC AAGACATTGA ACCGCTACAC AATAGCCTGA TTGTTTTCCC AAGCCACTAC ATGCACGAAG TTCTACCGAT TCGATGTCCT TCTCAGCGAT TTGAGGATAG CCGCTTTACA GTTAATGGTT GGATTCGAGT TGCTATTGAC TGA
|
Protein sequence | MHKLSSILVN FDLKPLNLIK TIYDFSRQEV LLTETELFQL LQSFEDCPPR ESLQSSSFQW RNLLFSQVSA GNTTYWYSQQ IPIQINSNLS GIANHLAVES STGKPDILYW RILDFLSPEK LQQLWNYLLT ARSQFNPAHN SAGLNNYRQS LFTAPPPEIY SEISEKILGA LIPIADELPN SSQEIGEIEM QITAHNDGHY YKIHNDNGSP DTATRFLTYV YYFYRQPKPF TGGELRLYEL AIKDGFYVAG DRYQDIEPLH NSLIVFPSHY MHEVLPIRCP SQRFEDSRFT VNGWIRVAID
|
| |