Gene P9211_13521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_13521 
SymbolcodA 
ID5731758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1221159 
End bp1222454 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content39% 
IMG OID641285725 
Productputative cytosine deaminase 
Protein accessionYP_001551237 
Protein GI159903893 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGGCA AAAGAACAGA AGATTTAGAT ATTTCTTCTC TTGAGGCTTT TGTTCCTAGA 
GGCTTAATTA GTCAAAATTT AGCTTCAAAA GTTACTCTGG ATGGGCTTTC TCCCATTCGT
ATTACTTGGC ATCAGGACCG TATTACTAAG CTGGAAGCAA TTGATATACC TAATAGATCA
TCATTAAAAC TTCTTCTTCC TAGATTCGTT GAACCTCATG CTCATATTGA TAAAGCATTT
ACTTGGCAAA ACTTCCCAAA TTTAGAGGGG AGCTACGAGG GTGCTTTAAA AGCAAACCTT
CAAGAGCATC TAACCCGCTC AGTTATGCAA GTTCGACTTA GTGCGGAGAG ATGTTTACAG
CTTGCTTTGA AAAATGGGCT GAGGGCAATT AGAAGTCATG TTGATAGTTT TGGGCCTTTT
GCAGATCAAA CATGGGAGGT TCTAGTTGAT CTTCAAAATG AATGGAAGCA ATTAATTCAA
TTGCAATTTG TAGCAATGGT CCCTTTGGAT TATTGGAGCA CTAAAAATGG TAAAACTTTT
GCTAGAAAGG TCGCATGTCA TCAAGGACTC TTAGGAGGTG TGCTTACCCC ACCTTTTTGT
AAAAGGGAGA CTAGAGATCA CTTATATAAA TTATTCAACC TGGCTAATGA TTTTGGTTGT
GGAGTTGATT TACATATAGA TGAATCACAT ATTTCTCCAG GTGCTGGATT GAGGCAATTG
ATAAACGTTC TTGATCAGAT TGACTTAAAA GTTCCTTTAA CTTGTAGTCA TTTAAGTAGC
ATGAGCCTTC TAGGGAAACG ATCGTTGGTT GAATGTGCAG ATCGACTGGC CCACCATGAA
GTAAATGTGG TGGCAATGCC TTTAACTAAT TCTTGGTTGC TTGGGGACAA AGAACTAAGC
CCACCCTTAG AAAGGCCATT GGCTCCTATT GGGCATCTTC AAAATGCAGG AGTAAATGTT
GCTATAGGAG GAGATAATGT TCAAGACCCT TGGTTCCCTG GCGGAAACTT TGATCCTCTT
TCTTTGATGG CTTTCTCATT GCCTATAGCG CAGTTGGCAC CATGGAATAG ATTAGGACTT
TCCCCTTTCA CAACTTCAGC TTCCAGAGTA ATGGGAATGG ATTGGGATGG AACTTTTTCA
TTAGGTAGCC CTGCAGACTT TATTTACTTA GAGGCAAAAA ATTGGTCAGA GGTGCTTGCC
TCTCCACCTA AGCGAAATGT AATAATTAAA GGTAAACATC TTCATGAGAA GAGTTTTGAT
TTGAACCAAT CTACTTTTAA AAATTATTTA AAATGA
 
Protein sequence
MIGKRTEDLD ISSLEAFVPR GLISQNLASK VTLDGLSPIR ITWHQDRITK LEAIDIPNRS 
SLKLLLPRFV EPHAHIDKAF TWQNFPNLEG SYEGALKANL QEHLTRSVMQ VRLSAERCLQ
LALKNGLRAI RSHVDSFGPF ADQTWEVLVD LQNEWKQLIQ LQFVAMVPLD YWSTKNGKTF
ARKVACHQGL LGGVLTPPFC KRETRDHLYK LFNLANDFGC GVDLHIDESH ISPGAGLRQL
INVLDQIDLK VPLTCSHLSS MSLLGKRSLV ECADRLAHHE VNVVAMPLTN SWLLGDKELS
PPLERPLAPI GHLQNAGVNV AIGGDNVQDP WFPGGNFDPL SLMAFSLPIA QLAPWNRLGL
SPFTTSASRV MGMDWDGTFS LGSPADFIYL EAKNWSEVLA SPPKRNVIIK GKHLHEKSFD
LNQSTFKNYL K