Gene Syncc9605_0794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_0794 
Symbol 
ID3737305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp758785 
End bp760836 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content63% 
IMG OID637775390 
Productcellulose synthase (UDP-forming) 
Protein accessionYP_381117 
Protein GI78212338 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.298349 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0356093 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCTCTAC GAAGCGGCAT AGGAACAACG CCTCCTAGCG TTGGCAAGGA ATCAACGCCT 
GGCTGTGGAG TGAGCCTGCT GCTGCTGCTC TGGCCGTTGG CGCTGATCCA GCGGCCGGAA
GCGGAGTCTT CGCTGTGGGC CCGACGCAGC CTGATCCTGT TGATCAGCGC CCTCACCCTT
CGTTATCTGC ACTGGCGCTG CACCGCCAGC CTCAATCTCA ACACCGCTGT TTCCACCAGC
CTCAGCGGCC TGTTACTGCT GGCGGAAGGC TGGCTACTGA TCACCGGTCT GCTGCCGCTG
TGGCTGGCCT GGCACCGCTT CCCGGATCGA CGTCGGGACG TCCAAAAACG CCATCGGGAC
TGGCAGGCCT CGGCGTGGAG GCCCCATGTG GACATCCTCG TACCCACCTA CGGCGAGCCG
CTGGCGGTGC TGCAACGCTC GCTTCTGGGC TGCACGCAGC AGAGCTATCC CCACACCAGT
GTTTTGGTGC TGGATGACTC TGGCCGTGAG GAGGTGAAGC GCCTAGCCAA ACAACTGGGC
TGTCGCTACT TGCATCGTCC CGAACGCCTG CACGCCAAGG CGGGCAACCT CAACGCCGGG
CTGAGCCAAT GTCATGGCGA GCTGGTGGCG GTGTTCGATG CGGACTTCAT CCCCCAGCAG
CGGTTCCTGG AACACAGCAT CGGCTTTCTG CTGGACCCCG ATGTAGCCAT GCTGCAGACG
CCGCAGTCGT TCATCAATGC CGATCCGGTG ATGCGCAACC TGCGGATGGA GTCGTGGCTG
CTGCCGGATG AGGAGAGCTT CTATCGCTGG ATCGAACCGG TACGGGACGG CTGGGGCGCC
GTGGTCTGTG CCGGCACAGC CTTCGTGGTG CGGCGGCGCG CCCTGGATGG GATCGGTGGC
TTCGTCGAGG GTGCCTTGTC GGAGGACTAC GTCACCGGCA TTGCCCTGCG TGCGCAGGGA
TGGAAGCTGC TGTACCTCCA GCAGAAGCTC AGTGCTGGGC TGGCCGCCGA GTCCATGGAG
GACTTCGTGC AACAGCGGCA ACGCTGGGCC AACGGCACCC TCCAAAGTTT GAGACTGCCC
CATGGCCCTC TACGGGCGTA TGGCCTCAGC CCCGGCCAGA GGCTGGCCTA CATGGAGGGC
GTCATCCATT GGCTCAACAA CCTGCCGAGG TTGGTGCTGA TGCTGATGCC GCTGAGCTAC
GGGCTGCTTG GTGTCGCACC AATCCTTCTG GATCAGCGAG CCATCATTGA ATTGATGCTG
CCCCTCTGGG GAACGGTGCT GCTCAGCATC GGCTGGCTGA ATCGCAACAG CCGCTCTGCC
CTGCTGACCG AACTAACCAG CTGGGTGCTC ACCGTTCCGC TGGTGGTGAG CCTGATCTGG
AATGTCCTCG GTTCCTCCGT TGGATTCCGG GTCACCCCCA AGCATCGCCA GCGATCCCGG
GGCGGCTGGT CATGGTTTCT AGCGCTTCCC CTGATCGTGC TTAGCCTGTT CAACCTGGCG
AATCTGCTGG GGCTCGTGCA GCAGCTGATG CTCGCAGGCT GGGACAGAGT CGGGCCGCTT
CAGCTGGGGC TGGTCTGGGC TGGGTTAAAT CTGCTGGGAA CCCTCATCGC TCTTCGCGCC
TGCTGGAATC CTCCGCAGAG CGATCCCTCC CCCTGGCTCA GCCTTGATCA CGCCGCCGAG
CTGATTGATT CCGGGGGCCA TTGCCATCCC TGTCGGATCA CCGCCATCAG TGAAAGCGGT
GTGGAGCTGG CCTTCTCCAC AAAGGTTCTG CCCCTGATGC ACAGCAGCCA GCTGCAGTGG
ACGGCCGCCA TCCCGCCTCT ACCGGTTGTG ATGCTGCAGA TCCAAGGGCG GCAGGCAGCC
TTGAGCTGGG GCAACCTCAG CCAACAACAA CAACACAGCC TGATCCGCTG GTTGTTCTGC
AGTGATGGGA TATGGCCAGA CCGGCGCCCC AGACGGGAAG TGTTGGGACT GCTGATGCTG
CTGAAACGGT TGCTCTTCGG CGGCCCAACT CCCCAAGTTT TCCACCGTTC TCTTGTGCCA
CGTCTTCCCT GA
 
Protein sequence
MALRSGIGTT PPSVGKESTP GCGVSLLLLL WPLALIQRPE AESSLWARRS LILLISALTL 
RYLHWRCTAS LNLNTAVSTS LSGLLLLAEG WLLITGLLPL WLAWHRFPDR RRDVQKRHRD
WQASAWRPHV DILVPTYGEP LAVLQRSLLG CTQQSYPHTS VLVLDDSGRE EVKRLAKQLG
CRYLHRPERL HAKAGNLNAG LSQCHGELVA VFDADFIPQQ RFLEHSIGFL LDPDVAMLQT
PQSFINADPV MRNLRMESWL LPDEESFYRW IEPVRDGWGA VVCAGTAFVV RRRALDGIGG
FVEGALSEDY VTGIALRAQG WKLLYLQQKL SAGLAAESME DFVQQRQRWA NGTLQSLRLP
HGPLRAYGLS PGQRLAYMEG VIHWLNNLPR LVLMLMPLSY GLLGVAPILL DQRAIIELML
PLWGTVLLSI GWLNRNSRSA LLTELTSWVL TVPLVVSLIW NVLGSSVGFR VTPKHRQRSR
GGWSWFLALP LIVLSLFNLA NLLGLVQQLM LAGWDRVGPL QLGLVWAGLN LLGTLIALRA
CWNPPQSDPS PWLSLDHAAE LIDSGGHCHP CRITAISESG VELAFSTKVL PLMHSSQLQW
TAAIPPLPVV MLQIQGRQAA LSWGNLSQQQ QHSLIRWLFC SDGIWPDRRP RREVLGLLML
LKRLLFGGPT PQVFHRSLVP RLP