Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Syncc9605_2603 |
Symbol | |
ID | 3735313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus sp. CC9605 |
Kingdom | Bacteria |
Replicon accession | NC_007516 |
Strand | + |
Start bp | 2417349 |
End bp | 2418647 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637777189 |
Product | anthranilate synthase |
Protein accession | YP_382885 |
Protein GI | 78214106 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0445726 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0808647 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGCC CACACCGGCT TGAGCTGCCC TGGCAGGAGC CCCAATCCCT CGCGCACCAA CTGGCCCATG CCTACGGCGA GGAAGGGATG GTCTGGCTGG ATGGCGACGG CAGCAGCCTG GGCCGACGGG CCACCCTGGC AGTGGCACCC CAGGAGATCA TCTGCTGCCG CGGTCTCCCC GACGAGCCAG GAGCCAGCAA TCCCTTCGAG GCACTGCGGG GACTGGCCCC AGGGCATTGG TGCGGCTGGC TGAGCTATGA AGCCGCCGCC TGGGTGGAAC CGGGAAACCC CTGGGCCAGC GACGGCATGG CCACGCTGTG GATCGCCCGC CACGATCCGG TGCTTCGCTT TGATCTGCAA AAGCGCAGGC TGTGGATCGA AGCCAGCAGC ACGGCTGCTC TGGACCGCCT CACCCAACAG CTGGCCTCCG TCACTGAGCA GCCCAAAGGC AAGCCCCCAT CCATCCCCCT GACGGCCTGG CATCACCACA CCTCAGCAGA TCACTACGCC GCAGGTGTGC AGCGCATCCG TGATCTGATC GCGGCAGGCG ATCTCTTCCA AGCCAATCTC ACGGCTTGTT GCAGCACAGC TTGGCCCCAG GGAGGCAATG CCCTCGAGCT GTTTGTCACC CTTAGGGAAG CCTGCCCTGC TCCCTTTGCA GGGCTGATCA TCAGCGACCA AAACGAGGCG TTGTTGTCAT CGTCCCCGGA GCGGTTTCTG CAGGTGAGTG CCGAGGGAGC TGTACAAACC CGGCCGATCA AAGGCACCAG GCCTCGCCAT GGCGACCCCG AACAGGATGC GAATCTCGCC ACGGAACTCG TGTGCAGCGA TAAGGACCGG GCCGAGAACG TGATGATCGT CGACCTGCTG CGGAATGACC TCGGTCGTGC CTGCCAGCCG GGTTCGATCC AGGTTCCCCA ACTGGTGGGG CTCGAAAGTT ACGCCTCCGT GCATCACCTC ACCTCGGTCG TGGAGGGACA GCTGCAGGCC GGGTTGAGCT GGGTCGATCT CCTGGAAGCC AGTTGGCCTG GGGGGTCGAT CAGCGGGGCG CCGAAACTGC GGGCCTGCCA ACGTTTGCAT GAGCTCGAGC CCACCAGCCG AGGGCCTTAC TGCGGATCAC TGCTGCGGAT CGACTGGGAC GGCAGCTTCG ACAGCAACAT CTTGATCCGA TCTTTACTGC GCCAAGGCGA CACCCTGCGG GCCCATGCCG GCTGCGGAAT TGTCGCCGAC TCGGATCCCC TTGGCGAAGC AGAGGAGTTG ATGTGGAAAC TGCAGCCATT GCTGGAGGCG CTGGCATGA
|
Protein sequence | MIRPHRLELP WQEPQSLAHQ LAHAYGEEGM VWLDGDGSSL GRRATLAVAP QEIICCRGLP DEPGASNPFE ALRGLAPGHW CGWLSYEAAA WVEPGNPWAS DGMATLWIAR HDPVLRFDLQ KRRLWIEASS TAALDRLTQQ LASVTEQPKG KPPSIPLTAW HHHTSADHYA AGVQRIRDLI AAGDLFQANL TACCSTAWPQ GGNALELFVT LREACPAPFA GLIISDQNEA LLSSSPERFL QVSAEGAVQT RPIKGTRPRH GDPEQDANLA TELVCSDKDR AENVMIVDLL RNDLGRACQP GSIQVPQLVG LESYASVHHL TSVVEGQLQA GLSWVDLLEA SWPGGSISGA PKLRACQRLH ELEPTSRGPY CGSLLRIDWD GSFDSNILIR SLLRQGDTLR AHAGCGIVAD SDPLGEAEEL MWKLQPLLEA LA
|
| |