Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_4199 |
Symbol | |
ID | 7104594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | - |
Start bp | 4403241 |
End bp | 4404551 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643477185 |
Product | 3-phytase |
Protein accession | YP_002374284 |
Protein GI | 218248913 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG4247] 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAAA TTATTAATAG TCTTGCGGTA GCTTGTTTCT CCTTCGCATT CAGTCATCAA GCAGCGATCG CAGTACAATT AGCGACACCA ACAGCAGAAA CCCCCCCTGT GATCGATGAA ATTGTTGATC CCCCTGGTGA CGCTGATGAT CCCGCGATCT GGCTTCATCC GAACGATCCA TCTCAAAGCT TAGTCTTGGG AACCTTGAAG AATGCGGGGT TAGGGGTCTA TGATTTGGGC GGAAATCTCT TACAATTAAT TCAACCCAAT AGTATCCGCT ACAATAACGT TGATCTCCTA TATGGCTTTT CTTTAGGGGG TAATTCGGTC GATTTAGCGA TCGCTTCAGA CCGTCAGAAT GATATCTTAG CGATCTTTAA AATTGACCCG ATGACTCGTT TATTAGAAAG CATCGTTTCC AACAACATCG GAACAATTTT TACCCCCGTC GGACAAGTTT CTAACGGGAC AACCACTGCT TACGGACTAG CAACCTATAC CGACTTATCG ACTGGGAAAA ATTATGTTTT TGTTTCTCAG CGAGAGACAG GGAACGTCGC TCAATTAGAG CTCTTTGATG ATGGTACGGG AAAAGTTAAT TACACTCAAG TGCGATCGCT CACCTTACCG ATTCCTCCAG GTGGCGTATT GGAAGATGCT CAAGTTGAAG GGATGGTCGC AGATCGGGAA TTGGGGTATG TTTATGTTGG GCAAGAAAAT CGGGGTATTT GGAAATTTTC AGCCTCTCCT AATGGCAGCA CCCTTGGTCA ACTCATTGAT GCTGTTAAAC CTGAAGGAAC GCATTTAGAA GCGGATGTAG AAGGATTAAC CATTTACTAT AGTGATAACG GAACAGGTTA TTTACTAGCA TCAAGTCAAG GAGACAATAC CTTTGCTATT TATGATCGCT TAGGCAATAA CAATTATTTA GGTAGCTTTT CTATTGTAGC ATCAGGGGGC ATTGATTCGG TTGAGGAATC GGATGGTGCT GATGTTATCA ATGTTCCTCT GGGATCTCAA TTTCCTTTTG GATTATTTGT GACACAAGAT GGTTCCAATG ACCCGCCAGA ACTGTTTTTT GATCCAGATG ATCAAGAATT TGTTAATGTT AGTTCTAACT TCAAATTTGT CCCTTGGGAA ACTATTGCTA ATGCTTTTGC ACCTAATCCC TTGCTGATTA ATACTAGCAG CTTTGATCCT CGAAATCCTT CCTCTATTAC CGTTCCAGAA CCGACCCTTT CTATTTGGGG ATTATTCGTT ATGTTAGGAG TTGGTTATCT GAAACGGGGA AAAAATCATC CTTTTCGTTA A
|
Protein sequence | MNKIINSLAV ACFSFAFSHQ AAIAVQLATP TAETPPVIDE IVDPPGDADD PAIWLHPNDP SQSLVLGTLK NAGLGVYDLG GNLLQLIQPN SIRYNNVDLL YGFSLGGNSV DLAIASDRQN DILAIFKIDP MTRLLESIVS NNIGTIFTPV GQVSNGTTTA YGLATYTDLS TGKNYVFVSQ RETGNVAQLE LFDDGTGKVN YTQVRSLTLP IPPGGVLEDA QVEGMVADRE LGYVYVGQEN RGIWKFSASP NGSTLGQLID AVKPEGTHLE ADVEGLTIYY SDNGTGYLLA SSQGDNTFAI YDRLGNNNYL GSFSIVASGG IDSVEESDGA DVINVPLGSQ FPFGLFVTQD GSNDPPELFF DPDDQEFVNV SSNFKFVPWE TIANAFAPNP LLINTSSFDP RNPSSITVPE PTLSIWGLFV MLGVGYLKRG KNHPFR
|
| |