Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_1952 |
Symbol | |
ID | 7102897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 2029530 |
End bp | 2030846 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643475014 |
Product | cytosine deaminase-like protein |
Protein accession | YP_002372146 |
Protein GI | 218246775 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGAGA TTTTAGCAAG CGATCGCTAT TGGCTAAAGA ATGCCCATAT TCCCTTATCT TTACTAGAAA ATACGCAATT AACCTCCGAT ACCAAGGAAG GGTTATGTCG CGTTGATCTA GAAATTTTTC GCGGTAAAAT TAATACTATC GTTCCCGCTA ATTCTGTGGC TTCAGAGACT CCTGAAATAG AGTTAAAAGG AAGAATTATT TTACCGGGGT TTATTGATCT GCATACCCAT TTAGATAAGG GTCATATTTG GGAGCGATCG CCTAATTTAG AAGGAACGTT TGATCAAGCT ATCATCAGCA TTACAAAAGA TGCTCAATTA TATTGGAATT TAGAGGATAT TTATCGTCGT TTTGAATTTG CTCTTAAGTG TAGTTATGCT CATGGAACAA CGGTTTTGAG AACCCATCTT GATACTTATA AAAACCAATC AGATATCAGT TTTGAAGTAT TTAAAACCTT ACAATCTCAG TGGAAAGATA AACTGATTCT TCAAGCAGTT TCTTTAGCTA CTTTGGATTA TTATTTAACA CCTCAAGGAC TAACTTTAGC GGATAAAGTT GCTGAAATTG GAGGTATTTT AGGAGGAGTT GCTTATACTA ATCCTGAGTT AGATATTCAA CTTAAACAAG TTTTTAAATT AGCTCAAGAA CGGAATTTAG ATTTAGATTT TCATGTCGAT GAAAATGGTG ATCCTAACTC AATTTGCTTA CAAAAAGTTG CTGAAACGGC TATTAAATCT CAATTTGCTA ATCAAATTAT TTGTGGTCAC TGTTGTAGTT TAGCAGTACA AACCCCAGAA ATAGCCAATA AAACGATTAA TTTAGTCAAA GAAGCGGGTA TTGCTATTGT TAGTTTACCG ATGTGTAATC TGTATCTTCA AGATCGTACT CCTAACCAGA CTCCCTATTG GAGAGGTATC ACGAAAGTTC ATGAGTTAAA AAATGCAGGA GTTCCGGTTA CTTTTGCCAG TGATAATTGT CGAGATCCTT TTTATGGATT TGGGGATCAT GATATGCTAG AAGTGTTTAA AGAAGCCGTC AAAATTGGTC ATTTAGATAC GGGTTATGAT GATTGGTGTA ATAGTGTTAC TAAAACCCCT GCTGATTTAA TGGGACTGTC CCAATTCGGA AGAATCAAAG TCGGTTTAAA CGCTGATTTA ATTATTTTTA AAGCACGTTA TTTTAGTGAA TTATTCTCTC GTTCTCAACG CGATCGCATT GTTTTACGAA AGGGTAAACC TATTGATACG ACTTTACCTG ATTATTCAGA ATTGGATGAT TTAGTTTTAA AAGGAATTGT ATCATAA
|
Protein sequence | MFEILASDRY WLKNAHIPLS LLENTQLTSD TKEGLCRVDL EIFRGKINTI VPANSVASET PEIELKGRII LPGFIDLHTH LDKGHIWERS PNLEGTFDQA IISITKDAQL YWNLEDIYRR FEFALKCSYA HGTTVLRTHL DTYKNQSDIS FEVFKTLQSQ WKDKLILQAV SLATLDYYLT PQGLTLADKV AEIGGILGGV AYTNPELDIQ LKQVFKLAQE RNLDLDFHVD ENGDPNSICL QKVAETAIKS QFANQIICGH CCSLAVQTPE IANKTINLVK EAGIAIVSLP MCNLYLQDRT PNQTPYWRGI TKVHELKNAG VPVTFASDNC RDPFYGFGDH DMLEVFKEAV KIGHLDTGYD DWCNSVTKTP ADLMGLSQFG RIKVGLNADL IIFKARYFSE LFSRSQRDRI VLRKGKPIDT TLPDYSELDD LVLKGIVS
|
| |