Gene Synpcc7942_0525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_0525 
SymbolaroB 
ID3774763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp509226 
End bp510332 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content57% 
IMG OID637798933 
Product3-dehydroquinate synthase 
Protein accessionYP_399544 
Protein GI81299336 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTTC AAATCCCTGT TGCTCTGCCG CAAAACGCCT ACGAGATTGC GATCGCGAAT 
GGGGGATTGG CCGCAGCAGG GACTTGGTTG CAGCAAGCTG ATCTGAAGGC GGGCACGAAA
CTTCTGATTG TGACCAACCC GGCGATCGGG CGGCGTTACG GCGATCGCCT CGTGGCAGCA
CTGCAAGAAG CAGGTTTCAT CGTCGACTGC CTGACCCTAC CGGCTGGCGA ACGCTACAAA
ACGCCAGCAA CAGTTCAACG CATCTATGAC AAAGCCCTAG AACTGCGGCT GGAGCGCCGT
TCGGCCTTGG TCGCCTTGGG TGGCGGCGTG ATTGGTGACA TGACAGGTTT TGCGGCGGCA
ACCTGGTTGC GCGGCATTAG CTTTGTGCAG ATCCCGACCT CCCTGCTGGC AATGGTTGAT
GCTTCGATTG GGGGCAAAAC CGGCGTCAAT CATCCCCGTG GCAAAAACCT GATCGGGGCG
TTTCATCAAC CCAAGCTGGT GCTAATTGAT CCAGAAACGC TACAAACCCT GCCCGTACGG
GAGTTCCGTG CCGGCATGGC TGAGGTAATT AAGTATGGCG TGATTTGGGA TCGGGATTTG
TTTGAGCGGT TGGAAGCAAG CCCCTTTCTC GATCGCCCGC GATCGCTACC GGCCAATCTC
CTAACGCTGA TCTTAGAGCG CTCCTGTCGC GCCAAAGCAG AGGTGGTTGC CAAGGATGAA
AAAGAATCGG GCTTGCGGGC CATCCTCAAC TACGGCCATA CGATTGGCCA CGCCGTCGAA
AGTCTGACAG GCTATCGCAT CGTTAACCAT GGCGAAGCTG TGGCGATCGG GATGGTTGCG
GCGGGACGGC TGGCTGTGGC GCTAGGACTC TGGAATCAGG ATGAATGTGA TCGCCAAGAA
GCCGTGATTG CTAAAGCGGG CTTACCAACA CGCCTACCAG AAGGGATTGA TCAAGCTGCA
ATCGTCGAGG CTCTACAACT CGACAAAAAA GTGCAGGCAG GCAAGGTACG GTTTATTCTG
CCAACGACGC TCGGCCACGT CACGATTACC GATCAGGTAC CGAGCCAAAC CCTGCAAGAG
GTGCTGCAGG CGATCGCCAA CCCCTAA
 
Protein sequence
MSVQIPVALP QNAYEIAIAN GGLAAAGTWL QQADLKAGTK LLIVTNPAIG RRYGDRLVAA 
LQEAGFIVDC LTLPAGERYK TPATVQRIYD KALELRLERR SALVALGGGV IGDMTGFAAA
TWLRGISFVQ IPTSLLAMVD ASIGGKTGVN HPRGKNLIGA FHQPKLVLID PETLQTLPVR
EFRAGMAEVI KYGVIWDRDL FERLEASPFL DRPRSLPANL LTLILERSCR AKAEVVAKDE
KESGLRAILN YGHTIGHAVE SLTGYRIVNH GEAVAIGMVA AGRLAVALGL WNQDECDRQE
AVIAKAGLPT RLPEGIDQAA IVEALQLDKK VQAGKVRFIL PTTLGHVTIT DQVPSQTLQE
VLQAIANP