Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_0839 |
Symbol | |
ID | 3774016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | - |
Start bp | 833950 |
End bp | 834954 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637799255 |
Product | Nitrilase |
Protein accession | YP_399857 |
Protein GI | 81299649 |
COG category | [R] General function prediction only |
COG ID | [COG0388] Predicted amidohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.29683 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.0736263 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGATA AAATCATTGT GGCAGCTGCA CAAATCCGAC CTGTTCTATT CAGTTTGGAA GGATCTGTTG CTCGGGTTCT AGCGGCCATG GCAGAAGCCG CAGCAGCGGG CGTTCAGCTG ATTGTTTTCC CTGAAACCTT TCTGCCCTAC TATCCTTATT TCTCCTTCGT CGAACCGCCG GTTCTGATGG GGCGATCGCA CCTCAAGCTC TACGAACAAG CCTTCACAAT GACGGGGCCG GAACTCCAGC AAATTGCCAG GGCTGCTCGA CAGCATCGTC TCTTTGTTTT GCTCGGCGTC AATGAGCGAG ATGGCGGTAG TCTTTACAAC ACTCAGCTAT TGATTAGCGA TCAGGGTGAC TTACTTCTGA AGCGCCGCAA AATCACCCCG ACCTATCACG AACGGATGGT CTGGGGACAA GGCGGTGGCG CGGGCCTAAC CGTTGTCGAA ACGGTGCTGG GTAAGGTTGG GGCCTTGGCT TGCTGGGAGC ACTACAACCC CTTAGCCCGC TTCAGTCTGA TGACTCAGGG GGAGGAAATT CACTGCGCCC AATTCCCAGG ATCGCTGGTG GGGCCAATTT TTAGCGAGCA AACCGCGGTC ACACTGCGTC ACCATGCCCT CGAAGCCGGT TGCTTTGTGC TCAGCTCGAC GGCTTGGTTG GATCCGGCCG ACTACGACAC AATCACTCCC GATCGTAGTT TGCACAAAGC CTTTCAAGGG GGCTGTCACA CCGCGATCAT CAGTCCCGAA GGGCGCTATC TCGCAGGGCC ACTGCCCGAG GGCGAAGGAC TCGCGATCGC TGAACTCGAC AAAAGTTTGA TCACCAAACG TAAGCGAATG ATGGACAGCG TCGGTCACTA TTCGCGCCCC GATTTACTCA GCCTTCGGAT CAATCGCAGC CCTGCCACAC AAGTGCAAGC GATCGGTTCA GCTGCTGCCT TGCCAGAACT ACCGAACCTA GAAGCCGCAC CGGCTGAGAC TGCGGAGGAC TATCTCCATG CCTAA
|
Protein sequence | MADKIIVAAA QIRPVLFSLE GSVARVLAAM AEAAAAGVQL IVFPETFLPY YPYFSFVEPP VLMGRSHLKL YEQAFTMTGP ELQQIARAAR QHRLFVLLGV NERDGGSLYN TQLLISDQGD LLLKRRKITP TYHERMVWGQ GGGAGLTVVE TVLGKVGALA CWEHYNPLAR FSLMTQGEEI HCAQFPGSLV GPIFSEQTAV TLRHHALEAG CFVLSSTAWL DPADYDTITP DRSLHKAFQG GCHTAIISPE GRYLAGPLPE GEGLAIAELD KSLITKRKRM MDSVGHYSRP DLLSLRINRS PATQVQAIGS AAALPELPNL EAAPAETAED YLHA
|
| |