Gene Synpcc7942_0212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_0212 
Symbol 
ID3775820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp214471 
End bp215559 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content58% 
IMG OID637798618 
Productchorismate synthase 
Protein accessionYP_399231 
Protein GI81299023 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.468689 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAGCA GCTTCGGCCA TCTTTTTCGC ATCAGCACCT TCGGTGAATC CCACGGGGGA 
GGCGTCGGTG TAGTTATCGA TGGCTGTCCG CCTCGGCTGG AAATTTCCGA AGCCGAAATT
CAATTTGAGC TCGATCGCCG CCGTCCGGGT CAAAGCAAAA TTACGACGCC GCGCAAAGAA
GCGGATCAGT GCGAAATTCT CTCGGGAGTC GTCGATGGCA AAACCCTCGG TACGCCGATC
GCGATCGTGG TACGCAATAA AGACCAGCGA TCGCAGGACT ATAGCGAAAT GCAGGTTGCT
TATCGGCCTT CCCATGCGGA CGCCACCTAC GACGCTAAGT ACGGTATTCG GGCGGTTGCA
GGCGGGGGGC GCTCCTCAGC GCGGGAAACG ATCGGTCGCG TAGCAGCTGG CGCGATCGCC
AAGAAACTGC TGCGGGAAAT TGCCGGTGTT GAGATCGTCG GCTACGTTAA ACGGATCAAG
GATCTGGAGG GGCAGATTGA TCCCGAAACC GTGACGCTGG AGCAAGTCGA AAGCACCATC
GTCCGCTGCC CCGATGAGGC GATCGCACCG CAGATGATTG ACCTGATTGA AGCGATCGGG
CGGGAAGGGG ATTCTCTCGG TGGTGTGGTC GAATGCGTGG CCCGTCGCGT TCCTCGCGGT
TTAGGCGAAC CCGTCTTCGA CAAGCTGGAA GCGGATTTGG CCAAAGCTTG TATGTCCTTG
CCCGCCACTA AAGGCTTTGA GATCGGCTCG GGCTTTGCTG GAACGGAAAT GACTGGCAGC
GAACATAATG ACGCCTTTTA CACCGATGAG CAGGGTCAAA TTCGCACTCG CACCAACCGT
AGCGGCGGCA CCCAAGGCGG CATCAGCAAC GGCGAAAACA TCGTGATTCG CGTGGCTTTC
AAACCGACTG CGACGATTCG CAAAGAGCAA GAAACCGTCA CCAACAGCGG CGAAGCCACC
ACTCTGGCTG CGCGGGGCCG CCACGATCCC TGTGTCTTAC CGCGGGCAGT GCCGATGGTG
GAAGCGATGG TTGCCCTTGT CCTTTGCGAT CACCTGCTGC GCCAACAAGC CCAATGCAGC
TGGTGGTAA
 
Protein sequence
MGSSFGHLFR ISTFGESHGG GVGVVIDGCP PRLEISEAEI QFELDRRRPG QSKITTPRKE 
ADQCEILSGV VDGKTLGTPI AIVVRNKDQR SQDYSEMQVA YRPSHADATY DAKYGIRAVA
GGGRSSARET IGRVAAGAIA KKLLREIAGV EIVGYVKRIK DLEGQIDPET VTLEQVESTI
VRCPDEAIAP QMIDLIEAIG REGDSLGGVV ECVARRVPRG LGEPVFDKLE ADLAKACMSL
PATKGFEIGS GFAGTEMTGS EHNDAFYTDE QGQIRTRTNR SGGTQGGISN GENIVIRVAF
KPTATIRKEQ ETVTNSGEAT TLAARGRHDP CVLPRAVPMV EAMVALVLCD HLLRQQAQCS
WW