Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_17241 |
Symbol | codA |
ID | 4779432 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1409970 |
End bp | 1411283 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640085009 |
Product | putative cytosine deaminase |
Protein accession | YP_001015544 |
Protein GI | 124026429 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.918771 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTTTAG AAAAAGAAGA ATTCAATTTA TCTACTAAAG CATCAGGTCG TATTGATGTG TTAGTACCGA GATGCTTGAT TGGAGAGGGT GCAAATATTC TGGGAGTAAC AGTTGACTTT GAAGGGCTAT GTTCGCTTCA AGTTGAGTGG AGGCATGGAA AAATCTGCTC AATTAAAGGT TTAAAAGATG CTTCAAAAGT TCCTAATGAA ATCCTTCTAC CTAGATTTTC TGAACCTCAT GCTCATTTAG ATAAAGCATT TTCATGGTCT CGAGCTCCTA ATTATAAAGG GAGTTATCAA GAAGCTTTAG TAGCCAATTT AAATGACTAT AAAAGTAGGT CTCAAGGCCA ATTGCTTTTT AGTGTTGAAA AATCTCTGAA CCTAGCCCTT GTTAATGGTA TCCGTGCAAT TAGATCTCAT ATAGATAGCT TTGGAGAAAA TGTAATGAGA GATTGGGACC TGTTAGATGA TATTAGAAAA AAATGGCGAG ACAAAATTTT CTTACAATTT GTGGCTTTAG TCCCATTAGA ATTTTGGCAA ACGTATGAAG GTGAGCTTTT AGCGCAAAGA GTTGCTTTGA ATGGAGATCT CCTAGGAGGA GTCATAGCTC CTCCTTTTAA TAAAAAGAAG ACAATTCAGT CTTTATTACA CTTAGTTCAA CTTGCAAATA GACTTAATTG TGATATTGAT CTTCATATTG ATGAGTCTCA GTCTTGTCCT GCTGCAGGGG TGAAATTACT TCTTGAAGTA TTAGGCCGTA TTAAAAATGA GATATCAATA ACATGTAGTC ATTTGAGCAG TATGGCTTTA CTAAGAGAAA AATCGATTTC AAATTTGGCA AAGGAAATAG CTGAAAAGAA ATTAAATGTT GTTGCTTTAC CACTCACGAA TTCTTGGCTG CTCGGTAGAA ATGAACGATC TACTTCAATT AAAAGACCTC TAGCTCCAAT ATTTCAACTT CAAAAGGCTG GGGTTGTTGT ATCTGTAGGA GGAGACAATG TAAATGATGC ATGGTTTCCA TTCACTAATT TTGATCCAAT AAATTTAATG GCTTTTTCAA TGCCAATTGC TCATTTAACT CCTTGGGAGA GATTGGGCCT TTCTCCATTT ACTTCATCCG CAGCAAGTAT TCTTAATCTT CAATGGGATG GCGTTTTACA AAAAGGAAGT CCTGCCGATT TTGTTTTGTT AGATTCAAAT AGTTGGGTAA AAGCTTTGTC TGAAAGACCT AAAAGAAGAG TAGTAGTTAA TGGCGAATTT TTAAATGAAT TGCCTAAAAA CAAAAAATCA ACATTCAACA ATTCTCACTC ATGA
|
Protein sequence | MTLEKEEFNL STKASGRIDV LVPRCLIGEG ANILGVTVDF EGLCSLQVEW RHGKICSIKG LKDASKVPNE ILLPRFSEPH AHLDKAFSWS RAPNYKGSYQ EALVANLNDY KSRSQGQLLF SVEKSLNLAL VNGIRAIRSH IDSFGENVMR DWDLLDDIRK KWRDKIFLQF VALVPLEFWQ TYEGELLAQR VALNGDLLGG VIAPPFNKKK TIQSLLHLVQ LANRLNCDID LHIDESQSCP AAGVKLLLEV LGRIKNEISI TCSHLSSMAL LREKSISNLA KEIAEKKLNV VALPLTNSWL LGRNERSTSI KRPLAPIFQL QKAGVVVSVG GDNVNDAWFP FTNFDPINLM AFSMPIAHLT PWERLGLSPF TSSAASILNL QWDGVLQKGS PADFVLLDSN SWVKALSERP KRRVVVNGEF LNELPKNKKS TFNNSHS
|
| |