Gene Synpcc7942_0466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_0466 
Symbol 
ID3773412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp451838 
End bp454087 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content59% 
IMG OID637798873 
Productcellulose synthase (UDP-forming) 
Protein accessionYP_399485 
Protein GI81299277 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.500609 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAGGC CAATCAGCAG GCAACAATCC CTGCCCACTC TGGCCGCTCT TAGGCACTGG 
TTTTCCCACA ATCGACAGCT CCCCACCTTC TTCACGCTAG GGGGACTGCT AGTGCTACTC
AGCTTACAGA TGGCTTGGGT GCTGGGCGAA CCGCGCACTA ATCAATTTTT TCAAACAATC
GATCGCTGGC AACAACGGCC GCCGCTCTGG TTGCAAACGC CCATGGAAGG GCTCTGGGCC
TGGAGTTGGC CCTGTCTGCT GCTGGTCTGC CTGTGGGTGA TTACGCGGCT ATGGACCCGA
CCCAGTTTTT GGCCTCGGAT CGTAGTGGTT GGTATTCTGG CACTGCTGAC GGTGCGCTAT
CTCCTCTGGC GATCGCTCTC AACCCTCAAT GTCTCGACCC CGTTGAATGG CGGTTTGAGT
CTGATTCTCT ATGGCATGGA GCTGCTGTTG CTGGTCACGG GGCTGATTCA GTTGTTGCTG
AGTGTGGCAG CCCGCGATCG CCAAGCCCAG GCGGATCAGT ATGCAGTGGC GGTGCAACAG
GGGCAGTACC AGCCCCACGT CGACATTTTG GTGCCGACCT ACAACGAGCC CGTGGGTTTG
CTACGACGGA CGCTCGTCGG TTGCTTGACC TTGGATTACG CCGCCAAAAC GGTGCATGTC
CTCGATGACG GCGATCGCCC AGAAGTAGCG GCTCTAGCGC GGCAGTTGGG CTGTCGCTAT
CAAGCCCGTC GCGATCGCCA GGGGGCTAAG GCCGGGAATC TCAACTACGC CTTGCCCAAT
TGTCGAGGCG AACTGGTGGC GGTCTTTGAT GCCGACTTTA TTCCTCGGCA GTCCTTTCTG
GCGCGGACGG TCGGGTTCTT CCAAGACGGC CGCATCGGAC TGGTGCAAAC GCCCCAAAGC
TTCTACAACC CTGACCCGAT CGCCTACAAC CTCGACTTGG CTGAACAGAT TCCGCCCGAG
GATGAAATCT TTTATCGACA TGTTCAACCG ATGCGAGATG GCGTCGGCAG TGTTGTCTGC
GTGGGCACCT CCTTTGTTGT GCGCCGCCAA GCTTTGGATG CGATCGGTGG TTTTGTGACC
GAGTCGCTGT CCGAAGACTA CTTCACTGGG ATCCGCATTG CAGCAGCAGG CTACCAGCTG
GTTTACCTCA ACGAAAAACT GAGTCAGGGT TTGGCCCCGG AGAGTTTGGC TGCCTATGCC
AAGCAACGCT TGCGTTGGGC GAGGGGGACG CTCCAAGCCT TCTTTATTCA GGCCAACCCT
CTAACGGTTC CAGGGCTCAA TCCGTTGCAG CGCCTCGGCC ACTTGGAGGG GCTGTTGCAC
TGGTTCTCGA GCCTACCGCG CATCCTCTTT CTGGTGATGC CGTTGGGCTA CGGCGTTGGC
ATCATGCCGC TACGGGCCAC TGGGCCCGAG CTGCTCTATT TCTTGCTGCC GATCTACCTG
GGGCACCTGA CGGTCTTCAA CTGGCTGAAT CGGCGATCGC GATCGGCACT ACTCTCCGAA
ATCTACTCCT TAGTGCTGGC TGTGCCACTG GCGATCACCA GTCTGCAAGC GCTGGTGCAG
CCCTTCCGTT GCCAATTCGC CGTCACCCCC AAAGGACTCC GGCAGGATCG CTTCTTCTTC
AACTGGGCGT TAGCTTGGCC GCTGCTGATT CTCTTTGCTG CCACCTGGCT CAGTCTGGCG
CTGAACGTGC GGCAGTTGTG GGTGCTGAGT CAGCAGGCCG ATCAGACCCA AATGCGAGGT
CTGGCTCTTG GGCTGTGGTG GTCCGGCTAC AACTTGGTGT TGCTGGCCGT GGCGTTGCTC
GCGCTCTGGG ATGCCCCTCG GGATGGTCGC GAAGCCATGA TTGCTCGCCC CCTGGCTTTA
GAACTACAAA CTGATTCAGG ACACGTGCTG ACGCTACGGA GTCAAGCTCA AAGTGAAATG
GTCATTCGTC TCGAAGGCAA TTGGGTCCTC CCGGAAGGTT CTCTCACCCT GACGCGCTTG
GGAGATCAAA CGCTCTCGGT TCCAGTTCAG CACTGCGATC GCGATACCCA AGGCACTTGG
TTGTGGTTAG CGCCGCAGCC CATCGATCAA AATCGCCTGA TTAAAACAAT CTACTGCTAC
GGAAGCGAAG ACCATCGTCC AGAAAGTCCG GGCGAACGGC GATCGCTGCT TTTGATCCTG
AAAACCCTAC TGCGACCGCC CATCCTGCGA TCGCGGCAGC GGCCAGCCCA GCTGGGTCGG
TTGCTGGTAG AGGTGCCGCA GCGTTCCTAG
 
Protein sequence
MTRPISRQQS LPTLAALRHW FSHNRQLPTF FTLGGLLVLL SLQMAWVLGE PRTNQFFQTI 
DRWQQRPPLW LQTPMEGLWA WSWPCLLLVC LWVITRLWTR PSFWPRIVVV GILALLTVRY
LLWRSLSTLN VSTPLNGGLS LILYGMELLL LVTGLIQLLL SVAARDRQAQ ADQYAVAVQQ
GQYQPHVDIL VPTYNEPVGL LRRTLVGCLT LDYAAKTVHV LDDGDRPEVA ALARQLGCRY
QARRDRQGAK AGNLNYALPN CRGELVAVFD ADFIPRQSFL ARTVGFFQDG RIGLVQTPQS
FYNPDPIAYN LDLAEQIPPE DEIFYRHVQP MRDGVGSVVC VGTSFVVRRQ ALDAIGGFVT
ESLSEDYFTG IRIAAAGYQL VYLNEKLSQG LAPESLAAYA KQRLRWARGT LQAFFIQANP
LTVPGLNPLQ RLGHLEGLLH WFSSLPRILF LVMPLGYGVG IMPLRATGPE LLYFLLPIYL
GHLTVFNWLN RRSRSALLSE IYSLVLAVPL AITSLQALVQ PFRCQFAVTP KGLRQDRFFF
NWALAWPLLI LFAATWLSLA LNVRQLWVLS QQADQTQMRG LALGLWWSGY NLVLLAVALL
ALWDAPRDGR EAMIARPLAL ELQTDSGHVL TLRSQAQSEM VIRLEGNWVL PEGSLTLTRL
GDQTLSVPVQ HCDRDTQGTW LWLAPQPIDQ NRLIKTIYCY GSEDHRPESP GERRSLLLIL
KTLLRPPILR SRQRPAQLGR LLVEVPQRS