Gene PCC8801_1952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1952 
Symbol 
ID7102897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2029530 
End bp2030846 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content35% 
IMG OID643475014 
Productcytosine deaminase-like protein 
Protein accessionYP_002372146 
Protein GI218246775 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGAGA TTTTAGCAAG CGATCGCTAT TGGCTAAAGA ATGCCCATAT TCCCTTATCT 
TTACTAGAAA ATACGCAATT AACCTCCGAT ACCAAGGAAG GGTTATGTCG CGTTGATCTA
GAAATTTTTC GCGGTAAAAT TAATACTATC GTTCCCGCTA ATTCTGTGGC TTCAGAGACT
CCTGAAATAG AGTTAAAAGG AAGAATTATT TTACCGGGGT TTATTGATCT GCATACCCAT
TTAGATAAGG GTCATATTTG GGAGCGATCG CCTAATTTAG AAGGAACGTT TGATCAAGCT
ATCATCAGCA TTACAAAAGA TGCTCAATTA TATTGGAATT TAGAGGATAT TTATCGTCGT
TTTGAATTTG CTCTTAAGTG TAGTTATGCT CATGGAACAA CGGTTTTGAG AACCCATCTT
GATACTTATA AAAACCAATC AGATATCAGT TTTGAAGTAT TTAAAACCTT ACAATCTCAG
TGGAAAGATA AACTGATTCT TCAAGCAGTT TCTTTAGCTA CTTTGGATTA TTATTTAACA
CCTCAAGGAC TAACTTTAGC GGATAAAGTT GCTGAAATTG GAGGTATTTT AGGAGGAGTT
GCTTATACTA ATCCTGAGTT AGATATTCAA CTTAAACAAG TTTTTAAATT AGCTCAAGAA
CGGAATTTAG ATTTAGATTT TCATGTCGAT GAAAATGGTG ATCCTAACTC AATTTGCTTA
CAAAAAGTTG CTGAAACGGC TATTAAATCT CAATTTGCTA ATCAAATTAT TTGTGGTCAC
TGTTGTAGTT TAGCAGTACA AACCCCAGAA ATAGCCAATA AAACGATTAA TTTAGTCAAA
GAAGCGGGTA TTGCTATTGT TAGTTTACCG ATGTGTAATC TGTATCTTCA AGATCGTACT
CCTAACCAGA CTCCCTATTG GAGAGGTATC ACGAAAGTTC ATGAGTTAAA AAATGCAGGA
GTTCCGGTTA CTTTTGCCAG TGATAATTGT CGAGATCCTT TTTATGGATT TGGGGATCAT
GATATGCTAG AAGTGTTTAA AGAAGCCGTC AAAATTGGTC ATTTAGATAC GGGTTATGAT
GATTGGTGTA ATAGTGTTAC TAAAACCCCT GCTGATTTAA TGGGACTGTC CCAATTCGGA
AGAATCAAAG TCGGTTTAAA CGCTGATTTA ATTATTTTTA AAGCACGTTA TTTTAGTGAA
TTATTCTCTC GTTCTCAACG CGATCGCATT GTTTTACGAA AGGGTAAACC TATTGATACG
ACTTTACCTG ATTATTCAGA ATTGGATGAT TTAGTTTTAA AAGGAATTGT ATCATAA
 
Protein sequence
MFEILASDRY WLKNAHIPLS LLENTQLTSD TKEGLCRVDL EIFRGKINTI VPANSVASET 
PEIELKGRII LPGFIDLHTH LDKGHIWERS PNLEGTFDQA IISITKDAQL YWNLEDIYRR
FEFALKCSYA HGTTVLRTHL DTYKNQSDIS FEVFKTLQSQ WKDKLILQAV SLATLDYYLT
PQGLTLADKV AEIGGILGGV AYTNPELDIQ LKQVFKLAQE RNLDLDFHVD ENGDPNSICL
QKVAETAIKS QFANQIICGH CCSLAVQTPE IANKTINLVK EAGIAIVSLP MCNLYLQDRT
PNQTPYWRGI TKVHELKNAG VPVTFASDNC RDPFYGFGDH DMLEVFKEAV KIGHLDTGYD
DWCNSVTKTP ADLMGLSQFG RIKVGLNADL IIFKARYFSE LFSRSQRDRI VLRKGKPIDT
TLPDYSELDD LVLKGIVS