Gene Synpcc7942_0892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_0892 
SymbolcofG 
ID3774070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp902109 
End bp903119 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content56% 
IMG OID637799309 
ProductFO synthase subunit 1 
Protein accessionYP_399909 
Protein GI81299701 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR03550] 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofG subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000147702 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.000138874 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGAAGCACT TCTCAGATCA AACTTGGCGA TCGCCCCAAC AATCAGCTGT GATCACCTAC 
AGTCCGGCCT ATACCCTCGT TCCCACCTAC GAGTGCTTCA ATCGCTGCAC CTACTGCAAC
TTCCGGCGCG ATCCCGGAAT GGACGAATGG TTGAGCCTAT CGACTGCTCA GCAGCGACTG
CTGACAGTGC GCGATCGCGG AGTTTGTGAA GTTTTGATCC TGAGTGGTGA AGTCCATCCT
CAGAGCCCGC GGCGGGCTGC TTGGCGACAG CGGCTAATTG AGCTGGCGGA GTTGGCTCTA
GACATGGGAT TTTTGCCCCA CACCAATGCT GGTCCCCTGA ACCGAGCAGA GATGATCGCC
TTACAAAATG TCAATGTGTC CTTGGGCTTG ATGCTAGAGC AACTGACACC GCAGCTTCAG
CGCACGGTGC ATCGCGCGGC ACCCAGTAAA GATCCACAGC TACGGCTGCA ACAGCTTGAA
CAGGCCGGAG AGCTAGGTAT TCCCTTCACC ACCGGACTGC TGTTGGGGAT CGGTGAAACA
AGCCGCGATC GCCTTGAGAC ACTTGAAGCG ATCGCGGCTT GTCACGATCG CTGGGGACAT
ATCCAGGAGG TGATCTTGCA GCCGCATAGT CCGGGCCGTC AGCAAGCGAT TCAACACCCT
CCGCTAGCCC CGGATGAGCT GATTGATTGC GTGGCGATCG CTCGGCAAGT TCTACCCACA
TCGATTGCGA TTCAAGTGCC ACCGAATCTT TTGACCCAGC CGCAGCAACT AGCGGACTGT
TTGGGAGCGG GTGCCCGTGA TTTAGGTGGG ATTGTGCCCT ACGACGAAGT GAATCCTGAT
TACCAGCATC ATGATTTGGA TGAGCTTCGA GAAGCACTAG CGCAGCAAGG GTGGCAACTC
CAACCGCGTT TGCCGGTCTA TCCTCACCTA GTTGATCGCT TACCACAGCG ACTGCAGACC
CATGTGGCGG CGTGGCTGAA TCGGTTCAAC TCGCAGTCCA GGTCAAGCTA G
 
Protein sequence
MKHFSDQTWR SPQQSAVITY SPAYTLVPTY ECFNRCTYCN FRRDPGMDEW LSLSTAQQRL 
LTVRDRGVCE VLILSGEVHP QSPRRAAWRQ RLIELAELAL DMGFLPHTNA GPLNRAEMIA
LQNVNVSLGL MLEQLTPQLQ RTVHRAAPSK DPQLRLQQLE QAGELGIPFT TGLLLGIGET
SRDRLETLEA IAACHDRWGH IQEVILQPHS PGRQQAIQHP PLAPDELIDC VAIARQVLPT
SIAIQVPPNL LTQPQQLADC LGAGARDLGG IVPYDEVNPD YQHHDLDELR EALAQQGWQL
QPRLPVYPHL VDRLPQRLQT HVAAWLNRFN SQSRSS