Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_2866 |
Symbol | |
ID | 7104387 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | - |
Start bp | 2956140 |
End bp | 2957549 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643475902 |
Product | anthranilate synthase component I-like protein |
Protein accession | YP_002373021 |
Protein GI | 218247650 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR01824] aminodeoxychorismate synthase, component I, clade 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGATA TCTTAACAGG TTGGTATTGG CGATCACGTC CTTTGAACCA TCAAACAGGT TCACAAATAT TTGAGCGTTT ATTTAACGAT AATCAAACCA TTGCAACCCT CTTAGAAAGT CCCTTTCCTA CCCCCACTGA CTACCCTTAT CTTTCTCGCT ATTCTATTTG TGCAGGCCAA CCTCGTATTA TTAACAAAAA ACCTCAAGTA TGGACTCCTA AGCTCGGAGA AGTTTTCCCC TTTTTAAAGA GTTTATTAGA AAGAAATTTT TGTCTTTCTG AAAGCCCTAA TCATCTCCCT TTTATTGGAG GTTGGTTAGG ATGGCTAGGG TATGATTTGG CTTGGGAAAT TGAACAATTA CCCCATACAA ATCGAGATAT TTTACCCTTT CCTGTTGCCT ATTGGTATGA ACCCGAATCC TTTGCTATTC TCGATCATGT TGAACAAAGG CTATGGTTAG CCAGTACCAC CCTTGATCAA CTCGATGAAT TAGAACAAAA ATTAGAGCAA GATCTGCCTT TAATTCCTGA CCTTTTTACC CCTGCTTCTG GGCTTTCTTT CTATACGACT CAACAAGAAT ACGAAAATGC TGTCCGTCAA GCCAAAAAAT ATATCGAAGC AGGAGATATT TTTCAAGCCA ATCTTTCCTT GAGATTTCAT TCAACTACCG TTGCTGATAG TTGGACTATC TATCGAAATT TACAAAGAAT TAATCCTTCT CCTTTTGCGA GCTATTGGCG AACACCTTGG GGAGATGTTA TTAGTTGTTC TCCTGAAAGA TTAATTCAAT TACAAGGAAA TCAAGCCCAA ACTAGGCCAA TAGCAGGAAC ACGACCCCGT GGTAAAACAC CCGAACTCGA ACAACACTTA TTAGCCGAAT TAACCCGTGA TATTAAAGAA CAAGCCGAAC ATATTATGTT AGTTGATTTA GAACGAAACG ATTTAGGACG AGTGTGTCAG TGGGGATCGG TTTATGTGGA TGAATTATTA ACCATAGAAC GCTATAGTCA TGTGATTCAT TTAGTTAGTA ATGTTAGGGG AACTTTAGCC CGCGATCGCA ATGTCATTGA TCTAATTAAA GCCCTTTTTC CAGGGGGAAC CATCACCGGA TGTCCTAAAG TCCGTTGTCT AGAAATTATT GAAGAATTAG AACCTTTGCG TCGCAATCTT TTTTATGGTT CCTGTGGCTA TTTAGATCAA CGGGGAAATC TGGATTTAAA CATACTCATT CGGACACTTT TATCAACGTC TTTATCGAAT GGGTTAAAGG GCATTTGGGG ACAAGTAGGT GCGGGAATTG TCGCCGATAG TGACCCCGAA AAAGAATGGT ATGAGTCCCT ACAAAAAGCT CAAGCCCAGT TAGCGGCTTT GAATCAAGTC AGAAGTCAGA AGTCAGAAGT CAGAAGTTAA
|
Protein sequence | MTDILTGWYW RSRPLNHQTG SQIFERLFND NQTIATLLES PFPTPTDYPY LSRYSICAGQ PRIINKKPQV WTPKLGEVFP FLKSLLERNF CLSESPNHLP FIGGWLGWLG YDLAWEIEQL PHTNRDILPF PVAYWYEPES FAILDHVEQR LWLASTTLDQ LDELEQKLEQ DLPLIPDLFT PASGLSFYTT QQEYENAVRQ AKKYIEAGDI FQANLSLRFH STTVADSWTI YRNLQRINPS PFASYWRTPW GDVISCSPER LIQLQGNQAQ TRPIAGTRPR GKTPELEQHL LAELTRDIKE QAEHIMLVDL ERNDLGRVCQ WGSVYVDELL TIERYSHVIH LVSNVRGTLA RDRNVIDLIK ALFPGGTITG CPKVRCLEII EELEPLRRNL FYGSCGYLDQ RGNLDLNILI RTLLSTSLSN GLKGIWGQVG AGIVADSDPE KEWYESLQKA QAQLAALNQV RSQKSEVRS
|
| |