Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Syncc9902_2118 |
Symbol | |
ID | 3742767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus sp. CC9902 |
Kingdom | Bacteria |
Replicon accession | NC_007513 |
Strand | - |
Start bp | 2019288 |
End bp | 2020259 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637772316 |
Product | putative O-succinylbenzoate synthase |
Protein accession | YP_378119 |
Protein GI | 78185685 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | [TIGR01927] o-succinylbenzoic acid (OSB) synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.946546 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCTTC AGCTTCTGGT GCGTTCCTTC AGTTTTTCGC TCCTGCACCC GTTGCGGACC GCCACGGGTG TGGTGGCCAA GCGGCGCGGT TGGCTACTGC GGTTGTGCGA TGGCGATACC GGAGCCGTGG GTTGGGGCGA GGTGGCGCCG TTGCGAACGG AGCAGTGGTT GCACTGCAAA ATATTGATGG GAGCGCTGCC TGAAGAGGTG AGTCGTCATC AGCTTGAGGT GTTGATCCAT CAGGGGCCAG GGGCGTTTGG CTTTGGCTTG GGTTCAGCGT TAGCAGAGCT GGATGGACTC GTTGGGGATT TGTCCTCCCA GCCTTGGCAA GAAGCGCCTT CCCCGGCTCA CCTCCTGCCA GCGGGCGATC AGATGCTCAT GGCGTTGGAT CAAGTTCTCA TGGATTGCCC GTCATCCCAC TCGCTCACCT TCAAGTGGAA GGTGGCTACA GAAGCATCGG AACGTGAACA ACGTTGGTTG GAGCAGCTTC TCGTGCGTCT TCCCCGTAAC GCCAGGCTGC GGTTGGATGC CAATGGCGGG TGGGACCGCT CTACCGCAGG AGTTTGGCTG GAGCGGTTGC GGCAGGAGTC GCGGTTTGAT TGGCTGGAGC AGCCATTGGC GGTTGATGAC CACGAAGGGC TCGAGCAACT GGCCAAACGC GGGTCGGTGG CCCTCGATGA ATCACTTGAG CGTCGACCCG AGCTCCGTGA CAGCTGGATG GGGTGGCAGG TGCGGCGCCC TGCGGTAGAT GGCGATCCAC GGCCGTTGTT GCGGCAAATC CAAGCTGGAG TTCCCTATCG AATGGTGAGT ACGGCGTTCG AATCCGGGAT CGCTCGCCGT TGGGTGCACC ATTTGGCGGC GTTGCAGTGG GTTGGCCCAA CGCCTGCTGC CCCTGGCCTG GCGCCGGGAT GGTGCCCTGA AGGTGCCTTG TTCTCTCACA ATCCAGAGCT GGTCTGGGCA GCTGCAGGGT GA
|
Protein sequence | MTLQLLVRSF SFSLLHPLRT ATGVVAKRRG WLLRLCDGDT GAVGWGEVAP LRTEQWLHCK ILMGALPEEV SRHQLEVLIH QGPGAFGFGL GSALAELDGL VGDLSSQPWQ EAPSPAHLLP AGDQMLMALD QVLMDCPSSH SLTFKWKVAT EASEREQRWL EQLLVRLPRN ARLRLDANGG WDRSTAGVWL ERLRQESRFD WLEQPLAVDD HEGLEQLAKR GSVALDESLE RRPELRDSWM GWQVRRPAVD GDPRPLLRQI QAGVPYRMVS TAFESGIARR WVHHLAALQW VGPTPAAPGL APGWCPEGAL FSHNPELVWA AAG
|
| |