Gene Synpcc7942_0889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_0889 
Symbol 
ID3774066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp894636 
End bp896453 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content60% 
IMG OID637799305 
Producthypothetical protein 
Protein accessionYP_399906 
Protein GI81299698 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.914068 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.000140666 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAACCTG CGCTGAGACG TGGATTGGTC TGGATTGGTT GCACCGTCAG TAGTTGGGGT 
GGGCTTGTTC TTCCGTCAGC TTGGAGCCCT GCAGCGATCG CCCAGACACG AGCTGCCACC
AATCAAACCG GGATGCAACT CCAGATCAAT GGTCGGGCGA TCGCCGGCCA GTGGGTCGTA
GAACCAGCTT CCCAGGGCAG CTTTATCTGG ATTGAAGAAG TTGCGCTGCG CAACGGCCTT
GGCTTGGAAC TATTACCGGG CTTGAGTACC ACTCGCCAAG CGATCCGCTG GTTCTCGGAT
GGACGTGTGG CTGAAGGGGT GGAGCAAGTG CCCGCCCGTC GTCAAAATGG TCGGCGTTTC
TTAGAAGTGT CATCGCTGGC GCGTCGCAGT GGTTGGTCCC TGGCAGTACT GGCAGACCAA
CTGGTGATTC AACTGCCGGC GGCGACGCTC CGGAATCTCA GGGATGGCCA ACAGGGAGCT
AACCGCCGCC TCGTCTTGGA TTTCGATCGC CCAACCCCAT GGGAGTGGGA TGCAATCGGC
CAACAGCTCC GCTTTGATGG TTCAGTTCCT GCTCAGTTGG TGCCCAGTTT GCAGCGCTAC
GGCATTCAGG CAGAGCAACA GGGCGATCGC ACGAGCCTAC GATTACCGGC TGGTCTGGGG
GTCCGCGTCT CGAGCTTGGG ATCGCCTGAT CGCATCTTTA TCGACCTGCC AACTACCACC
GGCCTCCTCA GCCCGCCCGC TGGTCTGGGG AGTAATCCCG CCCCCTTGAC TCCCGCCCCA
CCTGACCTAG CGGGTACGCA ACTGCAGCAG CGTCAAGTCA CTGTTGACGG CGCGACCTTC
CCCGTCTTTG TGATTCAGTT GGACCTGCGC CAACCTAATG TTCGTCTCGC ACCGATTTGG
GCAGGCAATG GCTCACTGGA GGGCACACAG GTTTTGCAAG CTGTGGCTCG CGATCGCGGG
GCTGCGATCG CCATCAATGC GGGCTTTTTC AACCGCAATA ATCGTCTGCC ACTCGGGGCG
ATTCGCCGTG ACAACATCTG GTATTCCGGT CCGATCCTCA ACCGTGGGGC CATGGCTTGG
AATGATCAGG GCGAAGTGCT GATTGACCGG CTCGGTCTGC AAGAGACCCT GCAACTCAGC
TCTGGCACTC GCATTCCCCT CGTGGCGCTC AATAGTGGCT ACGTTCGCGC TGGAGCCGCT
CGCTACACCG AGGCTTGGGG CAACAGCTAC CAGACAATCC TCGACAACGA AGTAGTGGTG
ACAGTCCAGG GCGATCGCGT GGTCTCCCAG TCCCAAGCGG ACAAGGCTGG CAGCAATCGC
TTTACGATTC CCCGCAATGG CTACCTGATC GTCTTGCGAT CGGCCAATAG CCTGCGCACT
TCTCTGGTGA ACGGCACGAC AATTCAAGTC CTACAGCAAG CGCAACCCAG CCAGTTCGAT
CGCTTCCCCC ATGCTCTAGG AGGAGGGCCA CTCCTCGTCA AATCAGGGCG AGTGGTCGTC
AATCCCCAAG CCGAAGGATT TAGCCGAGCC TTTGAGATCG AAGCAGCTCC GCGCAGTGCG
ATCGGTCTGA TGCCGGATGG CCGCTTGGTT CTAGTGGCTG CCCACGAGCA AAACCAAGGC
CAAGGCCCCA CCCTGCCTCA AATGGCTGCG ATTATGCAGC AGCTCGGCGT CGTTGATGCC
CTCAACTTTG ATGGCGGCAG CTCCACTTCT CTAATCGTCA ATGGTCAGCT CGTCAATCGG
GCTCGAGGCA GTGCTGCCCG GGTTCACAAC GGGCTCGGGG TCTTCCTGGG GCCGACTACG
CCGGCCAGTC TGCGATGA
 
Protein sequence
MKPALRRGLV WIGCTVSSWG GLVLPSAWSP AAIAQTRAAT NQTGMQLQIN GRAIAGQWVV 
EPASQGSFIW IEEVALRNGL GLELLPGLST TRQAIRWFSD GRVAEGVEQV PARRQNGRRF
LEVSSLARRS GWSLAVLADQ LVIQLPAATL RNLRDGQQGA NRRLVLDFDR PTPWEWDAIG
QQLRFDGSVP AQLVPSLQRY GIQAEQQGDR TSLRLPAGLG VRVSSLGSPD RIFIDLPTTT
GLLSPPAGLG SNPAPLTPAP PDLAGTQLQQ RQVTVDGATF PVFVIQLDLR QPNVRLAPIW
AGNGSLEGTQ VLQAVARDRG AAIAINAGFF NRNNRLPLGA IRRDNIWYSG PILNRGAMAW
NDQGEVLIDR LGLQETLQLS SGTRIPLVAL NSGYVRAGAA RYTEAWGNSY QTILDNEVVV
TVQGDRVVSQ SQADKAGSNR FTIPRNGYLI VLRSANSLRT SLVNGTTIQV LQQAQPSQFD
RFPHALGGGP LLVKSGRVVV NPQAEGFSRA FEIEAAPRSA IGLMPDGRLV LVAAHEQNQG
QGPTLPQMAA IMQQLGVVDA LNFDGGSSTS LIVNGQLVNR ARGSAARVHN GLGVFLGPTT
PASLR