Gene Syncc9605_0304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_0304 
Symbol 
ID3737538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp307347 
End bp308447 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content61% 
IMG OID637774888 
Productchorismate synthase 
Protein accessionYP_380635 
Protein GI78211856 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.21806 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATGG GCAGCAGCTT CGGCGACCTC TTCCGGATCA GCACCTTCGG TGAATCCCAC 
GGGGGAGGGG TGGGTGTGAT TGTTGAGGGC TGTCCACCAC GGCTCAACCT CAGCGTCGAA
TCGATTCAGG CCGAACTGGA TCGACGCAAG CCAGGCCAGA GTCACATCAC CACACCGCGC
AAGGAAGCGG ACCAGGTGCA AGTTCTCAGT GGCCTGCTGG ATGGCGAGAC CACGCTTGGC
ACCCCAATCG CCATGGTCGT GCGGAACAAG GACCAACGGC CGGGGGATTA CAAGGACATG
GCGGTCGCCT TCCGCCCATC CCATGCCGAT GCCACATACC AGGCGAAATA TGGAATCCAG
GCCCGCAGCG GTGGGGGCCG TGCATCGGCG CGGGAAACCA TCGGCCGTGT CGCTGCAGGT
GCAATCGCCA AGCAGCTGCT GAAACAAGCG GCAGGAACTG AAATCCTGGC CTGGGTGAAG
CGGATCCACA ACATCGAAGC CTCCGGCATC GACCCGCAAC GGGTTCAGCT CAGTGATGTA
GAAGCCAACA TCGTGCGATG TCCCGAATCG GCAGTAGCCG AGCGGATGGT TGAGCGCATC
GAAGCCATCG GCCGCGAAGG TGATTCCTGC GGCGGGGTGA TCGAATGCGT GGTGCGCCAT
CCCGCCGTTG GTTTAGGCAT GCCGGTGTTC GACAAACTCG AAGCCGACCT CGCCAAAGCT
GTGATGTCGT TACCGGCCAC CAAGGGTTTT GAAATTGGAT CCGGTTTCGA TGGAACGCTG
TTGAAAGGCA GCGAGCACAA CGATGCTTTT CTGCCGAGCG ACGACGGTCG GCTGAAGACC
GCCACCAACA ACTCCGGCGG CATCCAGGGG GGCATCAGCA ATGGTGAGCC GATTGTGATC
CGGGTAGCCT TCAAGCCAAC GGCCACGATC CGCAAAGAGC AGCAGACCAT CGATTCCGAT
GGCAAGGCCA CCACACTCGC AGGGAAAGGA CGGCATGACC CCTGCGTTCT GCCACGGGCT
GTACCGATGG TGGAGGCGAT GGTGGCACTC GTTCTGGCTG ATCACCTGCT GAGGCAACAG
GGGCAATGCA GCCTTTGGTG A
 
Protein sequence
MAMGSSFGDL FRISTFGESH GGGVGVIVEG CPPRLNLSVE SIQAELDRRK PGQSHITTPR 
KEADQVQVLS GLLDGETTLG TPIAMVVRNK DQRPGDYKDM AVAFRPSHAD ATYQAKYGIQ
ARSGGGRASA RETIGRVAAG AIAKQLLKQA AGTEILAWVK RIHNIEASGI DPQRVQLSDV
EANIVRCPES AVAERMVERI EAIGREGDSC GGVIECVVRH PAVGLGMPVF DKLEADLAKA
VMSLPATKGF EIGSGFDGTL LKGSEHNDAF LPSDDGRLKT ATNNSGGIQG GISNGEPIVI
RVAFKPTATI RKEQQTIDSD GKATTLAGKG RHDPCVLPRA VPMVEAMVAL VLADHLLRQQ
GQCSLW