Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_0892 |
Symbol | cofG |
ID | 3774070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | - |
Start bp | 902109 |
End bp | 903119 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637799309 |
Product | FO synthase subunit 1 |
Protein accession | YP_399909 |
Protein GI | 81299701 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR03550] 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofG subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000000147702 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.000138874 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGAAGCACT TCTCAGATCA AACTTGGCGA TCGCCCCAAC AATCAGCTGT GATCACCTAC AGTCCGGCCT ATACCCTCGT TCCCACCTAC GAGTGCTTCA ATCGCTGCAC CTACTGCAAC TTCCGGCGCG ATCCCGGAAT GGACGAATGG TTGAGCCTAT CGACTGCTCA GCAGCGACTG CTGACAGTGC GCGATCGCGG AGTTTGTGAA GTTTTGATCC TGAGTGGTGA AGTCCATCCT CAGAGCCCGC GGCGGGCTGC TTGGCGACAG CGGCTAATTG AGCTGGCGGA GTTGGCTCTA GACATGGGAT TTTTGCCCCA CACCAATGCT GGTCCCCTGA ACCGAGCAGA GATGATCGCC TTACAAAATG TCAATGTGTC CTTGGGCTTG ATGCTAGAGC AACTGACACC GCAGCTTCAG CGCACGGTGC ATCGCGCGGC ACCCAGTAAA GATCCACAGC TACGGCTGCA ACAGCTTGAA CAGGCCGGAG AGCTAGGTAT TCCCTTCACC ACCGGACTGC TGTTGGGGAT CGGTGAAACA AGCCGCGATC GCCTTGAGAC ACTTGAAGCG ATCGCGGCTT GTCACGATCG CTGGGGACAT ATCCAGGAGG TGATCTTGCA GCCGCATAGT CCGGGCCGTC AGCAAGCGAT TCAACACCCT CCGCTAGCCC CGGATGAGCT GATTGATTGC GTGGCGATCG CTCGGCAAGT TCTACCCACA TCGATTGCGA TTCAAGTGCC ACCGAATCTT TTGACCCAGC CGCAGCAACT AGCGGACTGT TTGGGAGCGG GTGCCCGTGA TTTAGGTGGG ATTGTGCCCT ACGACGAAGT GAATCCTGAT TACCAGCATC ATGATTTGGA TGAGCTTCGA GAAGCACTAG CGCAGCAAGG GTGGCAACTC CAACCGCGTT TGCCGGTCTA TCCTCACCTA GTTGATCGCT TACCACAGCG ACTGCAGACC CATGTGGCGG CGTGGCTGAA TCGGTTCAAC TCGCAGTCCA GGTCAAGCTA G
|
Protein sequence | MKHFSDQTWR SPQQSAVITY SPAYTLVPTY ECFNRCTYCN FRRDPGMDEW LSLSTAQQRL LTVRDRGVCE VLILSGEVHP QSPRRAAWRQ RLIELAELAL DMGFLPHTNA GPLNRAEMIA LQNVNVSLGL MLEQLTPQLQ RTVHRAAPSK DPQLRLQQLE QAGELGIPFT TGLLLGIGET SRDRLETLEA IAACHDRWGH IQEVILQPHS PGRQQAIQHP PLAPDELIDC VAIARQVLPT SIAIQVPPNL LTQPQQLADC LGAGARDLGG IVPYDEVNPD YQHHDLDELR EALAQQGWQL QPRLPVYPHL VDRLPQRLQT HVAAWLNRFN SQSRSS
|
| |