Gene Synpcc7942_2026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_2026 
Symbol 
ID3774213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp2094451 
End bp2095854 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content56% 
IMG OID637800471 
Productglycosyltransferase 
Protein accessionYP_401043 
Protein GI81300835 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.910937 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.952991 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAGCC AGCGGTCTTG GATTCGTCCC TACGAGCGAC AACTGCCTTA CCTGCAGCTT 
GCGCTCGACT TGCTGCTGAT TGTGCTCAAT GCAGTCATCG CTTTTCGGCT GCGCTTCGAG
TTCAGCTATT TCCAAGACAA CGCTCGCTAC CTGCTGCCAC TGCTGCTAGC ATTGCCCGCT
GCGGCCATCT ATTTCCCGCT GTTTAACCTC TACGATTCCT GGCGCGGGCG ATCGATGATT
CCACTGATGC TGCGGGTCGC CGCCGCTTGG GGTCTAACGA TCGCTACCAC CGTCACGATC
ATTTTTGCCC TGCACTACGG CAGTTCCTTC TCGCGGCTCT GGTTTACGAC CTGGGCGATC
GGCAGTCTCT ACAGCTTTGG ACTGATTCGC CTGCTCTGGG TGGCCGGTTT ACAGATGATG
CGCCACAAGG GCTGGAACGA ACGCCGTCTC CTCATTGTCG GGGCCGGTGA CCTGGGGCAA
GTACTGACCG ATCGCTTGGC GGCCGCTCGT TGGACCGGAC TGCGCGTGGT TGGGTTCCTC
GACGACAACC CGGAGCTAGA GGGACAAAAC TATCGCGGCA TTCCAATCAC GGCTCAGGTT
GACCAGATTG AAACTTGGAT CGATCGTTTT GACGCCCATG AAGTCTGGTT AGCCTTGCCC
TTACGGGCAC AGGATCGCGT CCAAGAAATT TTGCATCTGT TGCGCCACAG CACGGTGACA
ATCCGCCTAA TACCTGATGT CTTTAGCTTC CGGCTGATCA ACCACGGCTT CACAGAAGTT
TTGGGCGTAC CACTCATTGA CCTCAACGCC TCACCGATGA TGGGCTCCAA TCGCTTCATC
AAGGCGATCG AGGACAAGGT CTTAGCAGGC ATGATTCTGC TACTAGCGAG TCCAGTCATG
CTGGCGATCG CGATCGGCAT CAGGCTGACC TCACCGGGCC CAATCTTCTA TCGCCAGGAG
CGAATTGGCT GGAATGGGCA GCCGTTTATG ATGCTCAAGT TCCGGTCGAT GCCGGTGAAT
TCAGAAGCAA AAGGGGTGCA GTGGGGCAAA GCCTACGCCA AGCCCACCAC ACCGCTGGGA
AGTTTCCTAC GGCGCACTAG TTTGGATGAG TTGCCGCAGT TTATTAACGT CTTGCGCGGT
GAGATGTCGA TCGTCGGTCC TCGTCCCGAG CGATCGCTGT TTGTCGAGCA GTTCAAGGAT
GAAATCCCTG ACTACATGAA GAAACACATG GTCAAGGCAG GAATTACTGG CTGGGCACAG
GTGAACGGGC TGCGCGGCGA CACGGATCTA CGCAAGCGGA TTGAGTACGA CCTCTACTAC
ATTGAGAACT GGTCCCTTGC CTTTGACCTG CAGATTATTG GCATGACGCT GCTCAAAGGC
TTCCTCAGTC GCAATGCGTA CTAG
 
Protein sequence
MPSQRSWIRP YERQLPYLQL ALDLLLIVLN AVIAFRLRFE FSYFQDNARY LLPLLLALPA 
AAIYFPLFNL YDSWRGRSMI PLMLRVAAAW GLTIATTVTI IFALHYGSSF SRLWFTTWAI
GSLYSFGLIR LLWVAGLQMM RHKGWNERRL LIVGAGDLGQ VLTDRLAAAR WTGLRVVGFL
DDNPELEGQN YRGIPITAQV DQIETWIDRF DAHEVWLALP LRAQDRVQEI LHLLRHSTVT
IRLIPDVFSF RLINHGFTEV LGVPLIDLNA SPMMGSNRFI KAIEDKVLAG MILLLASPVM
LAIAIGIRLT SPGPIFYRQE RIGWNGQPFM MLKFRSMPVN SEAKGVQWGK AYAKPTTPLG
SFLRRTSLDE LPQFINVLRG EMSIVGPRPE RSLFVEQFKD EIPDYMKKHM VKAGITGWAQ
VNGLRGDTDL RKRIEYDLYY IENWSLAFDL QIIGMTLLKG FLSRNAY