Gene Synpcc7942_0686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_0686 
Symbol 
ID3775856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp680978 
End bp682138 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content57% 
IMG OID637799098 
ProductFO synthase subunit 2 
Protein accessionYP_399705 
Protein GI81299497 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily
[TIGR03551] 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofH subunit 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTGATT CGGCGACCGT CGCTGCCATC CTGGCGTCTG TTTTGGATGG CAAGCCTTTA 
GAACCGGAAG CCGCTACCGT TTTACTCAAA GCCCGCGATC GCTCGCTCCG TCAGCAAATT
CAGGCAGCGG CGAACCAACT CCGTTCCCGA CAGGTGGGCG ATCGCGTCAG CTATGTGATC
AATCGCAATC TCAATTTCAC CAATATTTGC GAGCAGCACT GTAACTTCTG CGCCTTTCGT
CGTGATGCGG ATCAAGACGG TGCTTTCTGG CTAGATGCTT CAATTCTGCT TGAAAAAGGA
GCGGCAGCCG TTGCCGCTGG TGCAACGGAA TTTTGTCTGC AGGGTGGCCT GAATCCAGCG
GCAAAACGCA ACGGGCGATC GCTCGACTTT TATGTCGAGT TGACGGCCAG CCTTAAACAA
GCCTTTCCGC AGATTCATCT CCATGCTTTT TCACCGCAAG AAATTCAGTT TATTGCTCGG
GAGGATGGGC TGAGTTTTCG TGAGGTGTTG ATGGCTTTGC GATCGGCGGG GGTGGGCTCT
TTGCCCGGCA CTGCAGCGGA AGTGTTGGAT GACTCGGTGC GACGGATTCT CTGTCCCGAA
AAATTAGATA GTGCAACCTG GAAAACAATC ATCCAGACTG CGCACCAAGT TGGTTTGCCG
ACGACTAGCA CGCTGCTCAG TGGTCATCTC GAAACGCCCA GTCAGCAGGC TCAGCATCTA
GAACAACTGC GACAACTCCA ACAAGCTGCG ATCGCTGGCG AAACCCCAGC CCGGATCACG
GAGTTCATTC TGCTGCCGTT TGTGGGGGAG CTGGCGCCGG CACCGCTGCG CAAGCGGGTC
AAGCGCGATC AGCCTGATTT ATCCGATGCA CTGTTGGTGA TGGCTGTGGC CAGGCTGTAT
CTGGGCGACT GGATTGCCAA TCACCAACCG AGTTGGGTCA AGCTGGGGTT GGCTGGAGCA
ACCCAAGCCC TCGACTGGGG CTGCAATGAC TTGGGCGGCA CGTTGATGGA GGAGCACATT
ACCAGCATGG CAGGGGCCCA GGGCGGAACG GCTCAAACCG TGGAACAGCT AGAGGCGGCG
ATCGCAGCTG CGGGGCGCCA ACCTTACCAG CGCGATACGC TCTACCGGCC CGTGGCGGTG
GAGGCTGTTC ATGCCGGTTA A
 
Protein sequence
MIDSATVAAI LASVLDGKPL EPEAATVLLK ARDRSLRQQI QAAANQLRSR QVGDRVSYVI 
NRNLNFTNIC EQHCNFCAFR RDADQDGAFW LDASILLEKG AAAVAAGATE FCLQGGLNPA
AKRNGRSLDF YVELTASLKQ AFPQIHLHAF SPQEIQFIAR EDGLSFREVL MALRSAGVGS
LPGTAAEVLD DSVRRILCPE KLDSATWKTI IQTAHQVGLP TTSTLLSGHL ETPSQQAQHL
EQLRQLQQAA IAGETPARIT EFILLPFVGE LAPAPLRKRV KRDQPDLSDA LLVMAVARLY
LGDWIANHQP SWVKLGLAGA TQALDWGCND LGGTLMEEHI TSMAGAQGGT AQTVEQLEAA
IAAAGRQPYQ RDTLYRPVAV EAVHAG