Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_1003 |
Symbol | |
ID | 3773931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | + |
Start bp | 1014434 |
End bp | 1015987 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637799423 |
Product | anthranilate synthase, component I |
Protein accession | YP_400020 |
Protein GI | 81299812 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.242829 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTTACC CGGATTTTGC AACGTTTGCC AGCTTGGCTG AGCAAGGCAA CTTCATTCCG GTCTATCAAG AGTGGGTCGC GGATTTGGAT ACGCCGGTTT CGGCTTGGTA CCGCATTTGC CGCGATCGCC CCTACAGCTT TTTGCTGGAA TCAGTCGAAG GGGGTGAGCA CCTGGGTCGC TATAGCTTTT TGGGTTGCGA TCCACTCTGG GTGCTAGAAG CGAGAGGCGA TCGCACAACT CGTCGGTTTC GGGATGGCTC GGAAGAAGTC TTTAGCGGCG ATCCCTTTGC AGCCCTCAAG CAATGTTTAG CGCCCTATCA GCCGGTGCAT TTGCCGCAAC TGCCCTCGGG CGTCGGCGGA CTGTTTGGCT TCTGGGGCTA TGAGCTGATG CGCTGGATTG AGCCACGCGT CCCGGTTCAC AGCGGCGGCG AAAACGATCT GCCCGACGGC TGCTGGATGC AGGTCGATAG CCTGATGATT TTCGATCAAG TCAAGCGTCG GCTCTACGCG ATCGCTTATG CGGATTTGCA AGCCGAACCA GATCTGCATC GCGCTTACGC GCTGGCCTGC AATCGTGTGC AGGAACTGGT CAATCGCTTC CAAGGATCAC TGAGCGCCAG CGATCGCCAA CTGCCGTGGC TGCCGCCTCA ATCGGCACCG TCTCGTCCCG TCGATTACCA AAGCAACACC ACCCAGGAGC AATTCTGCGC CAACGTACTG ACAGCACAGG ACTATATTCG CGCCGGCGAC ATCTTCCAAG TCGTGCTGTC GCAACGGCTA ACGACGCATT ACAGCGGCGA TCCCTTTGAT CTCTATCGAT CGCTGCGGCT GATCAATCCT TCCCCCTACA TGGCCTTTTT CCGCTTCGGC GACTGGCAGT TGATCGGCTC CAGTCCAGAA GTGATGGTCA AGGCTGAACA GGATCCACAC CAAAGCGATC GCCAAGTGGC CACTGTCCGC CCGATCGCGG GGACTCGTCC TCGGGGGCGC ACTGCACCAG AAGATGCCGC CCTTGCAACG GATCTGCTGG CTGATCCCAA GGAAGTGGCC GAGCACGTCA TGCTGGTCGA TCTCGGTCGC AATGACCTAG GCCGTGTCTG CGAGAAAGGC AGCGTCCGCG TCGATGAATT GATGGTGATT GAGCGCTACT CCCACGTCAT GCATATTGTC AGCAACGTGG TGGGCCTACT CGATCGCGAT CGCGACGCTT GGGATTTGCT GCGGGCAACT TTCCCAGCAG GGACGGTCAG CGGTGCGCCC AAGATTCGCG CTATGGAAAT CATCCATGAA TTGGAAGGCT GTCGGCGCGG ACCCTACTCC GGCGCCTATG GCTACTACGA TTTTGAGGGT CAGTTGAATA CGGCAATCAC GATCCGCACG ATGATCGTTC AGGCAGAAGG GAGTGGCCAT CGTGTCAGCG TGCAAGCAGG GGCTGGTGTC GTCGCTGATT CTGTGCCAAT CAAGGAGTAC GAAGAAACCT TGAACAAGGC GCGGGGTTTA CTGGAAGCGA TCCGTTGTCT ACAACCGCCT CAAGTGCCAG TTGCAGCGGG ATAA
|
Protein sequence | MIYPDFATFA SLAEQGNFIP VYQEWVADLD TPVSAWYRIC RDRPYSFLLE SVEGGEHLGR YSFLGCDPLW VLEARGDRTT RRFRDGSEEV FSGDPFAALK QCLAPYQPVH LPQLPSGVGG LFGFWGYELM RWIEPRVPVH SGGENDLPDG CWMQVDSLMI FDQVKRRLYA IAYADLQAEP DLHRAYALAC NRVQELVNRF QGSLSASDRQ LPWLPPQSAP SRPVDYQSNT TQEQFCANVL TAQDYIRAGD IFQVVLSQRL TTHYSGDPFD LYRSLRLINP SPYMAFFRFG DWQLIGSSPE VMVKAEQDPH QSDRQVATVR PIAGTRPRGR TAPEDAALAT DLLADPKEVA EHVMLVDLGR NDLGRVCEKG SVRVDELMVI ERYSHVMHIV SNVVGLLDRD RDAWDLLRAT FPAGTVSGAP KIRAMEIIHE LEGCRRGPYS GAYGYYDFEG QLNTAITIRT MIVQAEGSGH RVSVQAGAGV VADSVPIKEY EETLNKARGL LEAIRCLQPP QVPVAAG
|
| |