Gene Syncc9605_2221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_2221 
Symbol 
ID3735413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp2034274 
End bp2036559 
Gene Length2286 bp 
Protein Length761 aa 
Translation table11 
GC content61% 
IMG OID637776809 
ProductN-acetylneuraminate synthase 
Protein accessionYP_382515 
Protein GI78213736 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2089] Sialic acid synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAGGCA TCCTGATTGA GCGAAATTTC ACCCAGTTCG TTGTCTTCGC GGAAGACAGC 
ATCCTCTCGG CACTCAGCAA GATCACGGCG AACCAGTCCC GCCTGATTTT TGTTGTCTCG
GAAAGTGGCA TTCTCCAGGG CGTTCTCACC GATGGTGACT TCCGACGCTG GATTGCCGGA
TGCGGCGAGA TTGATCTGAA CCGGCCGGTC ACTGCGGCCA TGAACGCCAA CTGCCGGTCG
GCCGTCGAAG GCACCAGCGC CAGTGATCTG AGTGCCCTGC TCAATTCCCG CATCATTGCG
CTGCCGCTGC TGGACAGCCA TGGCCGCATC GTCGCCGTGG CCCGACGCGC CACCGACGGG
CTCCAGATCG GTTCCCACCG CATCGGCGAT GACGCCCCCT GCTTTCTGAT CGCCGAGATC
GGCAACAACC ACAACGGCGA TCTCGACACC GCTCTTCAAC TGATCGATGC CGCCCACGCC
GCAGGGGCGG ACTGCGCCAA GTTCCAGATG CGGGACATGA GCCGGCTGTA TCGCAATTCA
GGCGATAGCA ACGACATGGC CTCCGATCTG GGCACCCAGT ACACCCTGGA TCTGCTCGAG
CGCTTCCAAC TCAGTGACGA TGAGCTGTTC CGCTGCTTTG ACCATGCCGC CAGCAAAGGC
TTGGTACCTC TCTGCACCCC TTGGGATGAA ACGAGCCTCG AGAAGCTCAA CCGCTGGGGG
ATGGAAGGCT TCAAGGTGGC GTCGGCGGAT TTCACCAACC ACGCTTTTCT CTCCAGCCTG
GCGGAGACCG GCAAACCCCT GATCTGCTCA ACCGGCATGG CTTCGGAAGT GGAGATCCGC
TCCGGCATCC GCCACCTCCA AAACGAGGGA GCCGGTTACG TGCTGCTGCA CTGCAATTCC
ACGTATCCCA CCCCCTTCAA GGACGTCAAT CTGCGCTACC TCGAGCGGCT GCGCGACCTG
GCCGAGGCTC CCGTGGGCTA TTCCGGCCAT GAACGGGGCA TTGAAGTGCC GATTGCCGCC
GCCGCCCTCG GCGCCGTGGT GATCGAGAAA CACATCACCC TCGATCGCTC TATGGAAGGC
AACGACCACA AGGTGAGCCT CCTGCCCAAT GAATTCGCCC AGATGATCCA CGGCATTCGC
CGGGTGGAAG AGTCGATGGG CAGCAGCGGC GAGCGCAGCA TCAGCCAGGG CGAAATGATG
AACCGCGAAG TGCTGGCCAA GAGTCTCGTC GCCAGATGCG ACGTGCCTGC CGGCACGGAG
ATCACCGAGG CGATGGTGGG CATCCAGAGC CCGGGGCAAG GCCTGCAGCC CAACCGTCTT
GCCGACCTCA TTGGCAAGAC GCTGCCGGTG AGCAAAGCCG CCGGCGATTT TTTCTTTCCC
TCAGACCTTG AAACCCCGGC CGCTACGCCG CGCGCCTACC GATTCCAACA CCGCTTTGGC
CTACCGGTTC GTTATCACGA CATCGAGAGC TTTGCTGCCA GCAGCAATCT CGACCTTGTT
GAAATCCATC TCAGCTACAA GGATCTCGAG ATCAACCTCG ATGAAGTGCT GCCGACAAAG
CAGCCGATAG GCCTGGTGGT GCATGCCCCC GAATTGTTTG CTGGGGATCA CACCCTCGAC
CTCTGCAGCG CTGATGGCGA CTACCGCCGC CATTCGATTG CAGAACTCCA GCGCGTGGTC
GACATCTCTC GTGACCTGCG AAATCGTTTC GACTGCCCCG ATCCCGTTCT ACTGGTGACC
AACGTCGGGG GATTCTCAGA GCACCACCAT CTCGAGCGCG CGGACTTGCA ATCGCTACGG
CAACGCCTGA TCGAGAGCCT TCAGCAGATC AACACCTCTG ACGAGGTGGA AATCATTCCC
CAAACCATGC CCCCCTTCCC CTGGCACTTC GGGGGGCAGA GATATCACAA CCTCTTCGTC
GACACCGACT TCATCGAGGA ATTCTGCAAG GGAACCGGCA TGCGCGTCTG CCTGGATGCC
TCCCACTCCA AGCTCGCCTG CACCCACCTC AATGCCTCCT TCAGTGGCTT CCTCAGGGCG
ATCCTTCCGT TCACAGCCCA CCTGCATCTG GCTGACGCCA AGGACGTGGA TGGTGAAGGG
CTGCAGATCC ACGACGGTGA GATCGACTGG GTGCAGCTGT TTGCCCTGAT GGGTCAACTG
GCCCCCGAAG CAAGCTTCAT CCCCGAAATC TGGCAAGGCC ATAAGAACAA CGGGGAAGGC
GCCTGGCTCG CCCTTGAGCG CCTTGAAGGC TGTGTTGATT TGAGCCAGCA GCGGCATGTC
GCGTGA
 
Protein sequence
MRGILIERNF TQFVVFAEDS ILSALSKITA NQSRLIFVVS ESGILQGVLT DGDFRRWIAG 
CGEIDLNRPV TAAMNANCRS AVEGTSASDL SALLNSRIIA LPLLDSHGRI VAVARRATDG
LQIGSHRIGD DAPCFLIAEI GNNHNGDLDT ALQLIDAAHA AGADCAKFQM RDMSRLYRNS
GDSNDMASDL GTQYTLDLLE RFQLSDDELF RCFDHAASKG LVPLCTPWDE TSLEKLNRWG
MEGFKVASAD FTNHAFLSSL AETGKPLICS TGMASEVEIR SGIRHLQNEG AGYVLLHCNS
TYPTPFKDVN LRYLERLRDL AEAPVGYSGH ERGIEVPIAA AALGAVVIEK HITLDRSMEG
NDHKVSLLPN EFAQMIHGIR RVEESMGSSG ERSISQGEMM NREVLAKSLV ARCDVPAGTE
ITEAMVGIQS PGQGLQPNRL ADLIGKTLPV SKAAGDFFFP SDLETPAATP RAYRFQHRFG
LPVRYHDIES FAASSNLDLV EIHLSYKDLE INLDEVLPTK QPIGLVVHAP ELFAGDHTLD
LCSADGDYRR HSIAELQRVV DISRDLRNRF DCPDPVLLVT NVGGFSEHHH LERADLQSLR
QRLIESLQQI NTSDEVEIIP QTMPPFPWHF GGQRYHNLFV DTDFIEEFCK GTGMRVCLDA
SHSKLACTHL NASFSGFLRA ILPFTAHLHL ADAKDVDGEG LQIHDGEIDW VQLFALMGQL
APEASFIPEI WQGHKNNGEG AWLALERLEG CVDLSQQRHV A