Gene Syncc9902_2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_2040 
Symbol 
ID3743000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp1948992 
End bp1950086 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content57% 
IMG OID637772237 
Productchorismate synthase 
Protein accessionYP_378041 
Protein GI78185607 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAGCA GCTTCGGCGA TCTCTTCCGC ATCAGCACCT TCGGTGAATC CCACGGCGGA 
GGCGTTGGTG TGATTGTGGA GGGCTGCCCG CCAAGGCTGG AACTGGATTT AGACGAAATC
CAAGCAGAAC TCAACCGTCG CAAACCAGGA CAAAGTCACA TCACCACACC CCGAAAAGAA
GCCGATCAGG TGGAGATTCT GAGTGGCCTG CTGGACGGAA AAACCACCCT GGGCACCCCC
ATTGCCATGT TGGTGCGTAA TAAAGATCAG CGACCCGGGG ATTACTCGGA CATGGCCGTG
GCCTTTCGAC CCTCCCATGC AGATGCCACC TATCAATCCA AATACGGCAT CCAAGCCCGT
AGCGGTGGCG GACGAGCATC AGCCCGAGAA ACCATTGGCA GAGTGGCGGC TGGCGCCATT
GCCAAACAAC TTTTGCGTAA AGCAGCTGGA ACTGAAATCC TGGCGTGGGT GAAGCAGATT
CACACAATCG AAGCCCATGG CATCGACCCA TCCACGGTTT CCATGAATGA CATTGAAGCC
AACATTGTGC GCTGTCCAGA AGCCTCCGTG GCCAACCAGA TGATCGAGCG CATTGAGGCG
ATTGGCCGAG AAGGCGATTC CTGCGGTGGA GTGATCGAGT GCGTTGTCAG GCAGCCTGCC
GTGGGACTAG GGATGCCGGT CTTCGACAAA TTGGAAGCCG ATCTCGCCAA GGCGGTGATG
TCGCTACCAG CCACGAAGGG ATTTGAGATC GGCTCAGGGT TTAGTGGAAC CCTCTTAAAA
GGCAGCGAAC ACAATGACGC CTTCATCCCA GGAGACGATG GCCGCCTCCA TACCGCCACG
AACAACTCCG GGGGCATCCA AGGCGGGATC AGCAACGGAG AACCGATCGT GATCAGAGTG
GGATTCAAAC CAACGGCCAC CATTCGCAAA GAACAGCAGA CCATCGACTC TGATGGCAAT
GCGACAACCC TGGCCGCAAA AGGGCGTCAC GACCCCTGCG TACTGCCTCG GGCCGTACCC
ATGGTGGAAG CGATGGTGGC CCTGACGCTG GCAGATCATC TGCTCAGACA ACAGGGCCAA
TGCAGCCTGT GGTGA
 
Protein sequence
MGSSFGDLFR ISTFGESHGG GVGVIVEGCP PRLELDLDEI QAELNRRKPG QSHITTPRKE 
ADQVEILSGL LDGKTTLGTP IAMLVRNKDQ RPGDYSDMAV AFRPSHADAT YQSKYGIQAR
SGGGRASARE TIGRVAAGAI AKQLLRKAAG TEILAWVKQI HTIEAHGIDP STVSMNDIEA
NIVRCPEASV ANQMIERIEA IGREGDSCGG VIECVVRQPA VGLGMPVFDK LEADLAKAVM
SLPATKGFEI GSGFSGTLLK GSEHNDAFIP GDDGRLHTAT NNSGGIQGGI SNGEPIVIRV
GFKPTATIRK EQQTIDSDGN ATTLAAKGRH DPCVLPRAVP MVEAMVALTL ADHLLRQQGQ
CSLW