Gene Synpcc7942_1003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1003 
Symbol 
ID3773931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1014434 
End bp1015987 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content58% 
IMG OID637799423 
Productanthranilate synthase, component I 
Protein accessionYP_400020 
Protein GI81299812 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.242829 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTACC CGGATTTTGC AACGTTTGCC AGCTTGGCTG AGCAAGGCAA CTTCATTCCG 
GTCTATCAAG AGTGGGTCGC GGATTTGGAT ACGCCGGTTT CGGCTTGGTA CCGCATTTGC
CGCGATCGCC CCTACAGCTT TTTGCTGGAA TCAGTCGAAG GGGGTGAGCA CCTGGGTCGC
TATAGCTTTT TGGGTTGCGA TCCACTCTGG GTGCTAGAAG CGAGAGGCGA TCGCACAACT
CGTCGGTTTC GGGATGGCTC GGAAGAAGTC TTTAGCGGCG ATCCCTTTGC AGCCCTCAAG
CAATGTTTAG CGCCCTATCA GCCGGTGCAT TTGCCGCAAC TGCCCTCGGG CGTCGGCGGA
CTGTTTGGCT TCTGGGGCTA TGAGCTGATG CGCTGGATTG AGCCACGCGT CCCGGTTCAC
AGCGGCGGCG AAAACGATCT GCCCGACGGC TGCTGGATGC AGGTCGATAG CCTGATGATT
TTCGATCAAG TCAAGCGTCG GCTCTACGCG ATCGCTTATG CGGATTTGCA AGCCGAACCA
GATCTGCATC GCGCTTACGC GCTGGCCTGC AATCGTGTGC AGGAACTGGT CAATCGCTTC
CAAGGATCAC TGAGCGCCAG CGATCGCCAA CTGCCGTGGC TGCCGCCTCA ATCGGCACCG
TCTCGTCCCG TCGATTACCA AAGCAACACC ACCCAGGAGC AATTCTGCGC CAACGTACTG
ACAGCACAGG ACTATATTCG CGCCGGCGAC ATCTTCCAAG TCGTGCTGTC GCAACGGCTA
ACGACGCATT ACAGCGGCGA TCCCTTTGAT CTCTATCGAT CGCTGCGGCT GATCAATCCT
TCCCCCTACA TGGCCTTTTT CCGCTTCGGC GACTGGCAGT TGATCGGCTC CAGTCCAGAA
GTGATGGTCA AGGCTGAACA GGATCCACAC CAAAGCGATC GCCAAGTGGC CACTGTCCGC
CCGATCGCGG GGACTCGTCC TCGGGGGCGC ACTGCACCAG AAGATGCCGC CCTTGCAACG
GATCTGCTGG CTGATCCCAA GGAAGTGGCC GAGCACGTCA TGCTGGTCGA TCTCGGTCGC
AATGACCTAG GCCGTGTCTG CGAGAAAGGC AGCGTCCGCG TCGATGAATT GATGGTGATT
GAGCGCTACT CCCACGTCAT GCATATTGTC AGCAACGTGG TGGGCCTACT CGATCGCGAT
CGCGACGCTT GGGATTTGCT GCGGGCAACT TTCCCAGCAG GGACGGTCAG CGGTGCGCCC
AAGATTCGCG CTATGGAAAT CATCCATGAA TTGGAAGGCT GTCGGCGCGG ACCCTACTCC
GGCGCCTATG GCTACTACGA TTTTGAGGGT CAGTTGAATA CGGCAATCAC GATCCGCACG
ATGATCGTTC AGGCAGAAGG GAGTGGCCAT CGTGTCAGCG TGCAAGCAGG GGCTGGTGTC
GTCGCTGATT CTGTGCCAAT CAAGGAGTAC GAAGAAACCT TGAACAAGGC GCGGGGTTTA
CTGGAAGCGA TCCGTTGTCT ACAACCGCCT CAAGTGCCAG TTGCAGCGGG ATAA
 
Protein sequence
MIYPDFATFA SLAEQGNFIP VYQEWVADLD TPVSAWYRIC RDRPYSFLLE SVEGGEHLGR 
YSFLGCDPLW VLEARGDRTT RRFRDGSEEV FSGDPFAALK QCLAPYQPVH LPQLPSGVGG
LFGFWGYELM RWIEPRVPVH SGGENDLPDG CWMQVDSLMI FDQVKRRLYA IAYADLQAEP
DLHRAYALAC NRVQELVNRF QGSLSASDRQ LPWLPPQSAP SRPVDYQSNT TQEQFCANVL
TAQDYIRAGD IFQVVLSQRL TTHYSGDPFD LYRSLRLINP SPYMAFFRFG DWQLIGSSPE
VMVKAEQDPH QSDRQVATVR PIAGTRPRGR TAPEDAALAT DLLADPKEVA EHVMLVDLGR
NDLGRVCEKG SVRVDELMVI ERYSHVMHIV SNVVGLLDRD RDAWDLLRAT FPAGTVSGAP
KIRAMEIIHE LEGCRRGPYS GAYGYYDFEG QLNTAITIRT MIVQAEGSGH RVSVQAGAGV
VADSVPIKEY EETLNKARGL LEAIRCLQPP QVPVAAG