Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_0889 |
Symbol | |
ID | 3774066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | - |
Start bp | 894636 |
End bp | 896453 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637799305 |
Product | hypothetical protein |
Protein accession | YP_399906 |
Protein GI | 81299698 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.914068 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.000140666 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAACCTG CGCTGAGACG TGGATTGGTC TGGATTGGTT GCACCGTCAG TAGTTGGGGT GGGCTTGTTC TTCCGTCAGC TTGGAGCCCT GCAGCGATCG CCCAGACACG AGCTGCCACC AATCAAACCG GGATGCAACT CCAGATCAAT GGTCGGGCGA TCGCCGGCCA GTGGGTCGTA GAACCAGCTT CCCAGGGCAG CTTTATCTGG ATTGAAGAAG TTGCGCTGCG CAACGGCCTT GGCTTGGAAC TATTACCGGG CTTGAGTACC ACTCGCCAAG CGATCCGCTG GTTCTCGGAT GGACGTGTGG CTGAAGGGGT GGAGCAAGTG CCCGCCCGTC GTCAAAATGG TCGGCGTTTC TTAGAAGTGT CATCGCTGGC GCGTCGCAGT GGTTGGTCCC TGGCAGTACT GGCAGACCAA CTGGTGATTC AACTGCCGGC GGCGACGCTC CGGAATCTCA GGGATGGCCA ACAGGGAGCT AACCGCCGCC TCGTCTTGGA TTTCGATCGC CCAACCCCAT GGGAGTGGGA TGCAATCGGC CAACAGCTCC GCTTTGATGG TTCAGTTCCT GCTCAGTTGG TGCCCAGTTT GCAGCGCTAC GGCATTCAGG CAGAGCAACA GGGCGATCGC ACGAGCCTAC GATTACCGGC TGGTCTGGGG GTCCGCGTCT CGAGCTTGGG ATCGCCTGAT CGCATCTTTA TCGACCTGCC AACTACCACC GGCCTCCTCA GCCCGCCCGC TGGTCTGGGG AGTAATCCCG CCCCCTTGAC TCCCGCCCCA CCTGACCTAG CGGGTACGCA ACTGCAGCAG CGTCAAGTCA CTGTTGACGG CGCGACCTTC CCCGTCTTTG TGATTCAGTT GGACCTGCGC CAACCTAATG TTCGTCTCGC ACCGATTTGG GCAGGCAATG GCTCACTGGA GGGCACACAG GTTTTGCAAG CTGTGGCTCG CGATCGCGGG GCTGCGATCG CCATCAATGC GGGCTTTTTC AACCGCAATA ATCGTCTGCC ACTCGGGGCG ATTCGCCGTG ACAACATCTG GTATTCCGGT CCGATCCTCA ACCGTGGGGC CATGGCTTGG AATGATCAGG GCGAAGTGCT GATTGACCGG CTCGGTCTGC AAGAGACCCT GCAACTCAGC TCTGGCACTC GCATTCCCCT CGTGGCGCTC AATAGTGGCT ACGTTCGCGC TGGAGCCGCT CGCTACACCG AGGCTTGGGG CAACAGCTAC CAGACAATCC TCGACAACGA AGTAGTGGTG ACAGTCCAGG GCGATCGCGT GGTCTCCCAG TCCCAAGCGG ACAAGGCTGG CAGCAATCGC TTTACGATTC CCCGCAATGG CTACCTGATC GTCTTGCGAT CGGCCAATAG CCTGCGCACT TCTCTGGTGA ACGGCACGAC AATTCAAGTC CTACAGCAAG CGCAACCCAG CCAGTTCGAT CGCTTCCCCC ATGCTCTAGG AGGAGGGCCA CTCCTCGTCA AATCAGGGCG AGTGGTCGTC AATCCCCAAG CCGAAGGATT TAGCCGAGCC TTTGAGATCG AAGCAGCTCC GCGCAGTGCG ATCGGTCTGA TGCCGGATGG CCGCTTGGTT CTAGTGGCTG CCCACGAGCA AAACCAAGGC CAAGGCCCCA CCCTGCCTCA AATGGCTGCG ATTATGCAGC AGCTCGGCGT CGTTGATGCC CTCAACTTTG ATGGCGGCAG CTCCACTTCT CTAATCGTCA ATGGTCAGCT CGTCAATCGG GCTCGAGGCA GTGCTGCCCG GGTTCACAAC GGGCTCGGGG TCTTCCTGGG GCCGACTACG CCGGCCAGTC TGCGATGA
|
Protein sequence | MKPALRRGLV WIGCTVSSWG GLVLPSAWSP AAIAQTRAAT NQTGMQLQIN GRAIAGQWVV EPASQGSFIW IEEVALRNGL GLELLPGLST TRQAIRWFSD GRVAEGVEQV PARRQNGRRF LEVSSLARRS GWSLAVLADQ LVIQLPAATL RNLRDGQQGA NRRLVLDFDR PTPWEWDAIG QQLRFDGSVP AQLVPSLQRY GIQAEQQGDR TSLRLPAGLG VRVSSLGSPD RIFIDLPTTT GLLSPPAGLG SNPAPLTPAP PDLAGTQLQQ RQVTVDGATF PVFVIQLDLR QPNVRLAPIW AGNGSLEGTQ VLQAVARDRG AAIAINAGFF NRNNRLPLGA IRRDNIWYSG PILNRGAMAW NDQGEVLIDR LGLQETLQLS SGTRIPLVAL NSGYVRAGAA RYTEAWGNSY QTILDNEVVV TVQGDRVVSQ SQADKAGSNR FTIPRNGYLI VLRSANSLRT SLVNGTTIQV LQQAQPSQFD RFPHALGGGP LLVKSGRVVV NPQAEGFSRA FEIEAAPRSA IGLMPDGRLV LVAAHEQNQG QGPTLPQMAA IMQQLGVVDA LNFDGGSSTS LIVNGQLVNR ARGSAARVHN GLGVFLGPTT PASLR
|
| |