Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_0212 |
Symbol | |
ID | 3775820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | + |
Start bp | 214471 |
End bp | 215559 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637798618 |
Product | chorismate synthase |
Protein accession | YP_399231 |
Protein GI | 81299023 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.468689 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAGCA GCTTCGGCCA TCTTTTTCGC ATCAGCACCT TCGGTGAATC CCACGGGGGA GGCGTCGGTG TAGTTATCGA TGGCTGTCCG CCTCGGCTGG AAATTTCCGA AGCCGAAATT CAATTTGAGC TCGATCGCCG CCGTCCGGGT CAAAGCAAAA TTACGACGCC GCGCAAAGAA GCGGATCAGT GCGAAATTCT CTCGGGAGTC GTCGATGGCA AAACCCTCGG TACGCCGATC GCGATCGTGG TACGCAATAA AGACCAGCGA TCGCAGGACT ATAGCGAAAT GCAGGTTGCT TATCGGCCTT CCCATGCGGA CGCCACCTAC GACGCTAAGT ACGGTATTCG GGCGGTTGCA GGCGGGGGGC GCTCCTCAGC GCGGGAAACG ATCGGTCGCG TAGCAGCTGG CGCGATCGCC AAGAAACTGC TGCGGGAAAT TGCCGGTGTT GAGATCGTCG GCTACGTTAA ACGGATCAAG GATCTGGAGG GGCAGATTGA TCCCGAAACC GTGACGCTGG AGCAAGTCGA AAGCACCATC GTCCGCTGCC CCGATGAGGC GATCGCACCG CAGATGATTG ACCTGATTGA AGCGATCGGG CGGGAAGGGG ATTCTCTCGG TGGTGTGGTC GAATGCGTGG CCCGTCGCGT TCCTCGCGGT TTAGGCGAAC CCGTCTTCGA CAAGCTGGAA GCGGATTTGG CCAAAGCTTG TATGTCCTTG CCCGCCACTA AAGGCTTTGA GATCGGCTCG GGCTTTGCTG GAACGGAAAT GACTGGCAGC GAACATAATG ACGCCTTTTA CACCGATGAG CAGGGTCAAA TTCGCACTCG CACCAACCGT AGCGGCGGCA CCCAAGGCGG CATCAGCAAC GGCGAAAACA TCGTGATTCG CGTGGCTTTC AAACCGACTG CGACGATTCG CAAAGAGCAA GAAACCGTCA CCAACAGCGG CGAAGCCACC ACTCTGGCTG CGCGGGGCCG CCACGATCCC TGTGTCTTAC CGCGGGCAGT GCCGATGGTG GAAGCGATGG TTGCCCTTGT CCTTTGCGAT CACCTGCTGC GCCAACAAGC CCAATGCAGC TGGTGGTAA
|
Protein sequence | MGSSFGHLFR ISTFGESHGG GVGVVIDGCP PRLEISEAEI QFELDRRRPG QSKITTPRKE ADQCEILSGV VDGKTLGTPI AIVVRNKDQR SQDYSEMQVA YRPSHADATY DAKYGIRAVA GGGRSSARET IGRVAAGAIA KKLLREIAGV EIVGYVKRIK DLEGQIDPET VTLEQVESTI VRCPDEAIAP QMIDLIEAIG REGDSLGGVV ECVARRVPRG LGEPVFDKLE ADLAKACMSL PATKGFEIGS GFAGTEMTGS EHNDAFYTDE QGQIRTRTNR SGGTQGGISN GENIVIRVAF KPTATIRKEQ ETVTNSGEAT TLAARGRHDP CVLPRAVPMV EAMVALVLCD HLLRQQAQCS WW
|
| |