Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_15031 |
Symbol | codA |
ID | 4718225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 1286448 |
End bp | 1287683 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640079225 |
Product | putative cytosine deaminase |
Protein accession | YP_001009893 |
Protein GI | 123969035 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGTAATT CCGGCACAGC TGAGGTTCTT ATTCCCAGAA GCCTTTGTCT AATAGAAGAT ATAGATAACC TCATTATCGA TGTAGAGGAT TTATGTTCAG TTTCCATTAG TTGGGAGGAT GGATTAGTTT CTGAGTTAAA GCCTTTAAAA AATAAAATTA CAAAACCAAA AAATATTTTA TTCCCAAGAT TTGTTGAAAC GCATTCGCAT TTTGATAAAT CTTTTACATG GGCAGACTTT CCTAATCTGG AATCAAACTA CGGAGGAGCG TTATCAGTAA ATCTTGAAGA ACATAAAACT AGAACTACAG ATAAGGTTCT TGAGAGAGTT GATAAATCAT TAAAACTTGC CATACAAAAT GGATACCGAG CAATTAGAAG TCATATCGAT ACATACAAAG GTCAATCTAT TGACATTTGG ATTGAACTTT TTAAATTACA AAAAATATTT TCATCTGAGT TGACACTACA ATACGTTGCT CTAGCTCCAG TGGAATTCTG GGATACAACT GATGGAGAAG ATTTGGCAAA AATATTTTCC TCTAATGGAG GTATTTTAGG AGGTGTTATT GTACCCCCTT TCAATAAAAA AAATACAAGC AAATTTCTAG CAAAGATGCT TCTTCTTGCT AGAAAATATA AATTAGAAAT TGATTTGCAT ATAGATGAAT CAATTATTGA ACCTGGAGCG GGAATAAAAG TTTTATTAGA AACAATAGAA AATTTAAAAA TTAATAGTAT TCCAATCACT TGTAGTCATT TGAGTAGTCT TATTTCTCTA AGTAATAGTG AGATTTTAAA TTTAGGAGAA AAAATGGCTG AGAAAAATAT TAAGGTTGTT GCTTTACCCC TAACAAATTT TTGGCTGCTC AATCGAAGTA ATAAAACTAC TTCATTTAAA AGACCAGTTG CGCCAATAAA GCAATTACAA AAATCACATG TGGATGTATC TCTTGGTAGT GATAATGTTC AAGACCCTTG GTACCCATTT GGTAATTTTG ACCCTTTTTA TATGCTGTCT TGCTCGATGC CTATGCTTCA ACTAAATCCC TGGGAGCGAA TGACTCTATC TTCTATTTTT TTAGCTCCAA GCAGATTATT AAATTTAAAA TGGGATGGTT TAATTAAAAA AGGTTGTCCT GCTGATTTTG TGATTTTAGA TGCACAAAGG TGGGCAGATG TTTTTTCGAG CAATTTAAAG AGAAAAGTAT TTATAAACGG CGATTTACAT TCCTAA
|
Protein sequence | MSNSGTAEVL IPRSLCLIED IDNLIIDVED LCSVSISWED GLVSELKPLK NKITKPKNIL FPRFVETHSH FDKSFTWADF PNLESNYGGA LSVNLEEHKT RTTDKVLERV DKSLKLAIQN GYRAIRSHID TYKGQSIDIW IELFKLQKIF SSELTLQYVA LAPVEFWDTT DGEDLAKIFS SNGGILGGVI VPPFNKKNTS KFLAKMLLLA RKYKLEIDLH IDESIIEPGA GIKVLLETIE NLKINSIPIT CSHLSSLISL SNSEILNLGE KMAEKNIKVV ALPLTNFWLL NRSNKTTSFK RPVAPIKQLQ KSHVDVSLGS DNVQDPWYPF GNFDPFYMLS CSMPMLQLNP WERMTLSSIF LAPSRLLNLK WDGLIKKGCP ADFVILDAQR WADVFSSNLK RKVFINGDLH S
|
| |