Gene Synpcc7942_2151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_2151 
Symbol 
ID3773708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp2234141 
End bp2236354 
Gene Length2214 bp 
Protein Length737 aa 
Translation table11 
GC content44% 
IMG OID637800596 
Productcellulose synthase (UDP-forming) 
Protein accessionYP_401168 
Protein GI81300960 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.242321 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATTAA AACAACTGCG AAAACTTGCT CCTGAAATCT CTTTTCTTCT AATTGCTGGC 
CTCACGTTAC TTGCAATATT ACTGCTGCAA TTTGATTGGC AAAATGCTTT TGCTAACTTG
ATTGGATTAT GGTACTCAGG TCAAGCTCTG TTCAAGCCGG AGGGATCCAC ACTAGAGAAC
TTACTCTATC CCGCCTTTGT TTGGCTAGCG ATTACTTTTC TACTCAAGGA GCTTTCTCCT
CAACCGTCAT TTTGGCCTCG ACTGATCACG AGCATCGGCA TTGCAGTTCT AGGAATCCGT
TACGAGCTAT GGCGATTCTT CGGAAGCCTC AATCTCGACG ATCCCCTCAA TGGCACTCTA
TCTGTTCTAC TCTTTTTTGC TGAATTGCTG ACAGTTGTTA ATACAATCGC ATTTTTTCTC
CATACAATTT TTAGCTATGA TCGCTCGCCA GAAGCAGACA TCCTCAGCCA AGCAGTGATC
AGCCAAGCCT ATCAACCAAG CGTTGATATT ATTCTGCCCA CCTACAACGA AGGGGTTGAT
ATCCTCAGAA GATCGGTCAT TGCTTGCCAG GCAATGGACT ATTCTAATAA AAAGGTCTTT
CTTCTCGACG ATACTCGTCG TCCAGAAGTT CGAGCGCTTG CAGAAGAATT AGGCTGCGGC
TACTTCGATC GCCCAGACAA CAAACATGCG AAGGCAGGTA ATATCAACCA TGCCTTGCCC
TACTTATCTG GAGAATTATT AGTTGTTTTT GACGCAGATT TTGTCCCTAG TAAAAACTTT
CTAATTCGAA CAATTGGATT TTTCCAAGAT CCAAAAACGG GCCTGCTACA AACGCCTCAG
CATTTTTTTA ATGAAGATCC GATTTCAGTC AACCTTGGAC TTGAAGGAAT TTTGAATAAC
GAACAAAATT TATTTTTTCG GTTTATTCAA CCAAGCCGAG ACTATTTTAA CTCAGTCGTT
TGCTGTGGTA CCTGCTTTGT TATTCGGCGA TCGGCACTCG ATGAGATTGG TGGAATTCCG
ACGGAAAGTA TCACTGAAGA CTACTTTACT TCGATTAAGC TGCAAACTCA TGGCTATCGC
GTCAAGTATC TCAATGAGGC ATTGGCAGCA GGCCTAGCAC CGGAAACTAT TAGCGCCTAT
GTTAATCAGC GACTTCGTTG GGGGCAGGGA ACTCTCCAGT CCCTCTTTAC AGATTCCAAT
TTTCTAACGG TCCCAGGCCT CAACTGGCAG GAGCGCCTTT CCCAGTCACT CAGTATCGTT
TACTGGTTCC TATCGATTCC TAGAGTTGTC TTTCTCGTTG TTCCCTTAGC ATTTCTGATA
TTTGGGTTGG CACCTCTCCG TGCTACTGTC AATGAAATTC TTTATTTTTT CCTTCCATAT
TACATCGCAA ATATCATGGC ATTTTCTTGG CTGACAGAAG GAAGGCGATC CGCATTTTGG
TCTGATGTTT ATGAGACGAT TATTTGTGTG CCAATGGCAA TTACTATTCT CAGGACTTTA
GTTAGCCCTA AGGGAAAACC TTTTAAAGTG ACACCCAAGG GGGATACTAA TTCTGATGGA
ATCACAATTA ACTGGTCGCT CATTACTCCT CTCTTAACAG TAATGATCCT GACTATAATT
GGAATTGTTA TGAGGAGTTC TAGTCTGCAA GATACGAATG TAAATCCCGA TAGCTTAGTA
ATTAATATAG TTTGGGCTGT TTATAACTTA TCACTGCTGC TCATTTGTAT TCTTGTCGCA
ATCGATGTTC CTCAACGCCG TCACACTCGG TTTTCCCATG AAGAAGCTTG TGAATTGCGT
CTGGGAGCAG TTGTCTTTTC AGGGGTTACT GAAGACCTTT CAGAAGAAGG GGGGCGAATC
TTACTGAATC GCTTCGAGGG TTTAGTGGAT ATTGAGGATG AAGCACCTCA AACTACTTTG
GAAGGAATTT TTCGTCTGCG ATCGCCTAGC CCTTTAGCTG ACATGCCAAT TCATGCAGAA
GTGAGATGGC ACGATCGCCG TGCTCTTGCT GCTCTACGTT TTGAGGCTGC TGTAACTGAG
GTAGAGGTAC GCCTTTACTT CTCAAATCTG ACGCTTGCAG AACAGCGCCG CTTAATTCCT
TATCTCTACT GCCAGCCCGG CCAGTGGCAA GACTTACAAG TGCCTGAATT CAAGACAGCT
TGGGCTATGC TTCAATCGGT CTTTCGTCTG CATCCCTTGG CAAATAGCCG CTGA
 
Protein sequence
MQLKQLRKLA PEISFLLIAG LTLLAILLLQ FDWQNAFANL IGLWYSGQAL FKPEGSTLEN 
LLYPAFVWLA ITFLLKELSP QPSFWPRLIT SIGIAVLGIR YELWRFFGSL NLDDPLNGTL
SVLLFFAELL TVVNTIAFFL HTIFSYDRSP EADILSQAVI SQAYQPSVDI ILPTYNEGVD
ILRRSVIACQ AMDYSNKKVF LLDDTRRPEV RALAEELGCG YFDRPDNKHA KAGNINHALP
YLSGELLVVF DADFVPSKNF LIRTIGFFQD PKTGLLQTPQ HFFNEDPISV NLGLEGILNN
EQNLFFRFIQ PSRDYFNSVV CCGTCFVIRR SALDEIGGIP TESITEDYFT SIKLQTHGYR
VKYLNEALAA GLAPETISAY VNQRLRWGQG TLQSLFTDSN FLTVPGLNWQ ERLSQSLSIV
YWFLSIPRVV FLVVPLAFLI FGLAPLRATV NEILYFFLPY YIANIMAFSW LTEGRRSAFW
SDVYETIICV PMAITILRTL VSPKGKPFKV TPKGDTNSDG ITINWSLITP LLTVMILTII
GIVMRSSSLQ DTNVNPDSLV INIVWAVYNL SLLLICILVA IDVPQRRHTR FSHEEACELR
LGAVVFSGVT EDLSEEGGRI LLNRFEGLVD IEDEAPQTTL EGIFRLRSPS PLADMPIHAE
VRWHDRRALA ALRFEAAVTE VEVRLYFSNL TLAEQRRLIP YLYCQPGQWQ DLQVPEFKTA
WAMLQSVFRL HPLANSR