Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_13521 |
Symbol | codA |
ID | 5731758 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1221159 |
End bp | 1222454 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641285725 |
Product | putative cytosine deaminase |
Protein accession | YP_001551237 |
Protein GI | 159903893 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGGCA AAAGAACAGA AGATTTAGAT ATTTCTTCTC TTGAGGCTTT TGTTCCTAGA GGCTTAATTA GTCAAAATTT AGCTTCAAAA GTTACTCTGG ATGGGCTTTC TCCCATTCGT ATTACTTGGC ATCAGGACCG TATTACTAAG CTGGAAGCAA TTGATATACC TAATAGATCA TCATTAAAAC TTCTTCTTCC TAGATTCGTT GAACCTCATG CTCATATTGA TAAAGCATTT ACTTGGCAAA ACTTCCCAAA TTTAGAGGGG AGCTACGAGG GTGCTTTAAA AGCAAACCTT CAAGAGCATC TAACCCGCTC AGTTATGCAA GTTCGACTTA GTGCGGAGAG ATGTTTACAG CTTGCTTTGA AAAATGGGCT GAGGGCAATT AGAAGTCATG TTGATAGTTT TGGGCCTTTT GCAGATCAAA CATGGGAGGT TCTAGTTGAT CTTCAAAATG AATGGAAGCA ATTAATTCAA TTGCAATTTG TAGCAATGGT CCCTTTGGAT TATTGGAGCA CTAAAAATGG TAAAACTTTT GCTAGAAAGG TCGCATGTCA TCAAGGACTC TTAGGAGGTG TGCTTACCCC ACCTTTTTGT AAAAGGGAGA CTAGAGATCA CTTATATAAA TTATTCAACC TGGCTAATGA TTTTGGTTGT GGAGTTGATT TACATATAGA TGAATCACAT ATTTCTCCAG GTGCTGGATT GAGGCAATTG ATAAACGTTC TTGATCAGAT TGACTTAAAA GTTCCTTTAA CTTGTAGTCA TTTAAGTAGC ATGAGCCTTC TAGGGAAACG ATCGTTGGTT GAATGTGCAG ATCGACTGGC CCACCATGAA GTAAATGTGG TGGCAATGCC TTTAACTAAT TCTTGGTTGC TTGGGGACAA AGAACTAAGC CCACCCTTAG AAAGGCCATT GGCTCCTATT GGGCATCTTC AAAATGCAGG AGTAAATGTT GCTATAGGAG GAGATAATGT TCAAGACCCT TGGTTCCCTG GCGGAAACTT TGATCCTCTT TCTTTGATGG CTTTCTCATT GCCTATAGCG CAGTTGGCAC CATGGAATAG ATTAGGACTT TCCCCTTTCA CAACTTCAGC TTCCAGAGTA ATGGGAATGG ATTGGGATGG AACTTTTTCA TTAGGTAGCC CTGCAGACTT TATTTACTTA GAGGCAAAAA ATTGGTCAGA GGTGCTTGCC TCTCCACCTA AGCGAAATGT AATAATTAAA GGTAAACATC TTCATGAGAA GAGTTTTGAT TTGAACCAAT CTACTTTTAA AAATTATTTA AAATGA
|
Protein sequence | MIGKRTEDLD ISSLEAFVPR GLISQNLASK VTLDGLSPIR ITWHQDRITK LEAIDIPNRS SLKLLLPRFV EPHAHIDKAF TWQNFPNLEG SYEGALKANL QEHLTRSVMQ VRLSAERCLQ LALKNGLRAI RSHVDSFGPF ADQTWEVLVD LQNEWKQLIQ LQFVAMVPLD YWSTKNGKTF ARKVACHQGL LGGVLTPPFC KRETRDHLYK LFNLANDFGC GVDLHIDESH ISPGAGLRQL INVLDQIDLK VPLTCSHLSS MSLLGKRSLV ECADRLAHHE VNVVAMPLTN SWLLGDKELS PPLERPLAPI GHLQNAGVNV AIGGDNVQDP WFPGGNFDPL SLMAFSLPIA QLAPWNRLGL SPFTTSASRV MGMDWDGTFS LGSPADFIYL EAKNWSEVLA SPPKRNVIIK GKHLHEKSFD LNQSTFKNYL K
|
| |