Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_3230 |
Symbol | |
ID | 8392566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 3304499 |
End bp | 3305908 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 644981177 |
Product | anthranilate synthase component I-like protein |
Protein accession | YP_003138903 |
Protein GI | 257061015 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR01824] aminodeoxychorismate synthase, component I, clade 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.359242 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGATA TCTTAACAGG TTGGTATTGG CGATCGCGTC CTTTGAACCA TCAAACAGGT TCACAAATAT TTGAGCGTTT ATTTAACGAT AATCAAACCA TTGCAACCCT CTTAGAAAGT CCCTTTCCTA CCCTCACTGA CTATCCCTAT CTTTCTCGCT ATTCTATTTG TGCAGGGAAA CCTCGTATTA TTAACAAAAA ACCTCAAATA TGGACTCCTA AACTGGGAGA AGTTTTCCCC TTTTTACATA GTTTATTACA AAGGAATTTT TGTCTTTCTG AAAGCCCTAA TCATCTCCCT TTTATTGGAG GTTGGTTAGG ATGGTTAGGG TATGATTCAG CTTGGGAAAT TGAACAATTA CCCCATAAAA ATCGAGATAT TTTACCCTTT CCTGTTGCCT ATTGGTATGA ACCCGAATCC TTTGCCATTC TCGATCATGT TGAACAAACT CTATGGTTAG CCAGTACCAC TCTTGATCAA TTAGATCAAT TAGAACAAAA ATTAGATCAA GATCCTTCTT TAGTCTCTGA CATTTTTACC CCGCCTTCTT CTATTTCTTT TTATACCACT CAACAAGAAT ACGAAAATAC TGTCCGTCAG GCTAAAAAAT ATATTGAAGC AGGAGATATT TTTCAAGCCA ATCTTTCCTT AAGATTTCAT ACAACTACTG CTGCTAATAG TTGGACTATT TATCGAAATT TACAAAAAAT TAATCCTTCT CCTTTTGCGA GTTATTGGCG AACACCTTGG GGAGATGTTA TTAGTTGTTC TCCTGAAAGA TTAATTCAAT TACAAGGAAA TCAAGCCCAA ACTAGGCCAA TAGCAGGAAC ACGACCCCGT GGTAAAACAC CCGAACTCGA ACAACACTTA TTAGCCGAAT TAACCCGTGA TATTAAAGAA CAAGCCGAAC ATATTATGTT AGTTGATTTA GAACGAAACG ATTTAGGACG AGTGTGTCAG TGGGGATCGG TTTATGTGGA TGAATTATTA ACCATAGAAC GCTATAGTCA TGTGATTCAT TTAGTTAGTA ATGTTAGGGG AACTTTAGCC CGCGATCGCA ATGTCATTGA TCTAATTAAA GCCCTTTTTC CAGGGGGAAC CATCACCGGA TGTCCTAAAG TCCGTTGTCT AGAAATTATT GAAGAATTAG AACCTTTGCG TCGCAATCTT TTTTATGGTT CCTGTGGCTA TTTAGATCAA CGGGGAAATC TGGATTTAAA CATACTCATT CGGACACTTT TATCAACGTC TTTATCGAAT GGGTTAAAGG GTATTTGGGG ACAAGTAGGT GCGGGAATTG TCGCCGATAG TGACCCCGAA AAAGAATGGT ATGAGTCCCT ACAAAAAGCT CAAGCCCAGT TAGCGGCTTT GAATGAAGTC AGAAGCCAGA AGTCAGAAGT CAGAAGTTAA
|
Protein sequence | MTDILTGWYW RSRPLNHQTG SQIFERLFND NQTIATLLES PFPTLTDYPY LSRYSICAGK PRIINKKPQI WTPKLGEVFP FLHSLLQRNF CLSESPNHLP FIGGWLGWLG YDSAWEIEQL PHKNRDILPF PVAYWYEPES FAILDHVEQT LWLASTTLDQ LDQLEQKLDQ DPSLVSDIFT PPSSISFYTT QQEYENTVRQ AKKYIEAGDI FQANLSLRFH TTTAANSWTI YRNLQKINPS PFASYWRTPW GDVISCSPER LIQLQGNQAQ TRPIAGTRPR GKTPELEQHL LAELTRDIKE QAEHIMLVDL ERNDLGRVCQ WGSVYVDELL TIERYSHVIH LVSNVRGTLA RDRNVIDLIK ALFPGGTITG CPKVRCLEII EELEPLRRNL FYGSCGYLDQ RGNLDLNILI RTLLSTSLSN GLKGIWGQVG AGIVADSDPE KEWYESLQKA QAQLAALNEV RSQKSEVRS
|
| |