Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_14901 |
Symbol | codA |
ID | 4912716 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 1259549 |
End bp | 1260784 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640161083 |
Product | putative cytosine deaminase |
Protein accession | YP_001091714 |
Protein GI | 126696828 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.352726 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGTAATT CCGGCACAGC TGAGGTTCTT ATTCCCAGAA GTCTTTTTGT AATAGAAGAT ATAGATAACC TCATTATCGA TGTAGAGGAT TTATGTTCAG TTTCAATTAG TTGGGAGGAT GGATTTGTTT CTGAGTTAAA GCCTTTAAAA AATAAAATTA CAAAACCAAA AAATATTTTA TTCCCAAGAT TTGTTGAACC GCATTCGCAT TTTGATAAAT CATTTACATG GGCAGCCTTT CCTAATCTGG AATCAAACTA TGGAGGAGCA TTATCAGCAA ATCTTCAAGA ACATAAAACT AGAAGTACAG ATAAGGTTCT TGAAAGAGTT GATAAATCAT TAAAACTTGC CATACAAAAT GGATACCGAG CAATAAGAAG TCATATTGAT ACATACAAAG GTCAATCTAT TGATATTTGG AGTGAACTTT TTAAATTACA AAAAAAATTT TCATCTGAGT TGACTTTACA ATACGTTGCT TTAGCTCCAT TGGAATTTTG GAATACAACT GATGGAGAAG ATTTGGCAAA AATGTTTTCT TCTAATGGAG GCATTTTAGG GGGTGTTGTT GTACCCCCTT TCAATAAAAA AGACACAAGC AAATTTCTAG CAAAGATGCT TCTTCTTGCG AGTAAAAATA AATTAGAAAT TGATTTGCAT ATAGATGAAT CAATTATTGA ACCTGGAGCT GGAATAAAAG TTTTATTAGA AACAATCGAA AATTTAAATA TTAATAGTAT TCCGATCACT TGTAGCCATT TGAGTAGTCT TATTTCTCTA ACTAATAGAG AGATTTTAAA GTTAGGGGAG AAAATGGCTG AGAGAAATAT TAAAGTTATT GCCTTACCAC TAACAAATTT TTGGCTGCTC AATCGAAGTA ATAAAACTAC TTCATTAAAA AGACCAGTTG CGCCAATAAA GCAATTACAA AATTCACATG TTGATGTATC TCTTGGTAGT GATAATGTGC AAGACCCTTG GTACCCATTT GGTGATTTTG ACCCTTTTTA TACGTTGTCT TGCTCGATGC CTATGCTTCA ACTAAATCCC TGGGAGAGAA TGACTCTATC TTCTATTTTT TTAGCTCCAA GCAGATTATT AAATTTAAAA TGGGATGGTT TAATTAAAAA AGGTTGTCCT GCTGATTTTG TGATTTTAGA TGCACAAAGA TGGGCAGATG TTTTTTCGAG AAATTTAAAG AGAAAAGTAT TTATAAACGG CGATTTATAT TGTTAA
|
Protein sequence | MSNSGTAEVL IPRSLFVIED IDNLIIDVED LCSVSISWED GFVSELKPLK NKITKPKNIL FPRFVEPHSH FDKSFTWAAF PNLESNYGGA LSANLQEHKT RSTDKVLERV DKSLKLAIQN GYRAIRSHID TYKGQSIDIW SELFKLQKKF SSELTLQYVA LAPLEFWNTT DGEDLAKMFS SNGGILGGVV VPPFNKKDTS KFLAKMLLLA SKNKLEIDLH IDESIIEPGA GIKVLLETIE NLNINSIPIT CSHLSSLISL TNREILKLGE KMAERNIKVI ALPLTNFWLL NRSNKTTSLK RPVAPIKQLQ NSHVDVSLGS DNVQDPWYPF GDFDPFYTLS CSMPMLQLNP WERMTLSSIF LAPSRLLNLK WDGLIKKGCP ADFVILDAQR WADVFSRNLK RKVFINGDLY C
|
| |