Gene PMN2A_0870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_0870 
Symbol 
ID3606252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp1375461 
End bp1376774 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content35% 
IMG OID637687737 
Productputative cytosine deaminase 
Protein accessionYP_292064 
Protein GI72382709 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTTTAG AAAAAGAAGA ATTCAATTTA TCTACTAAAG CATCAGGTCG TATTGATGTG 
TTGGTACCGA GATGCTTGAT TGGAGAGGGT GCAAATATTC TGGGACTAAC AGTTGACTTT
GAAGGGTTAT GTTCGCTTCA AGTTGAGTGG AGGCATGGGA AAATCTGCTC AATTAAAGGT
TTAAAAGATG CTTCAAAAGT TCCTAATGAA ATCCTTCTAC CTAGATTTTC CGAACCTCAT
GCTCATTTAG ATAAAGCATT TTCATGGTCT CGAGCTCCTA ATTATAAAGG GAGTTATCAA
GAAGCTTTAG TAGCTAATTT AAATGACTAT AAAAGTAGGT CTCAAGGCCA ATTGCTTTTT
AGTGTTGAAA AATCTTTGAA CCTAGCCCTT GTTAATGGTA TCCGTGCAAT TAGATCCCAT
ATAGATAGCT TTGGAGAAAA TGTAATGAGA GATTGGGACC TATTAGAAGA TATTAGAAAA
AAATGGCGAG ACAAAATTTT CTTACAATTT GTGGCTTTAG TCCCATTAGA ATTTTGGCAA
ACGTATGAAG GTGAGCTTTT AGCGCAAAGA GTTGCTTTGA ATGGAGATCT CCTAGGAGGA
GTCATAGCTC CTCCTTTTAA TAAAAAGAAG ACAATTAAGT CTTTATTACA CTTAGTTCAA
CTTGCAAATA GACTTAATTG TGATATTGAT CTTCATATTG ATGAGTCTCA GTCTTGTCCT
GCTGCAGGGG TGAAATTACT TCTTGAAGTA TTAGGCCATA TTAAAAATGA GATATCAATA
ACTTGTAGTC ATTTGAGCAG TATGGCTTTA CTAAGAGAAA AATCGATTTC AAATTTGGCA
AAGGCAATGG CTGAAAAGAA ATTAAATGTT GTTGCTTTAC CACTCACGAA TTCTTGGCTG
CTCGGTAGAA ATGAACGATC TACTTCAATT AAAAGACCTC TAGCTCCAAT ATTTCAACTT
CAAAAGGCTG GGGTTGTTGT ATCTGTAGGA GGAGACAATG TAAATGATGC ATGGTTTCCA
TTCACTAATT TTGATCCAAT AAATTTAATG GCTTTTTCAA TGCCAATTGC TCATTTAACT
CCTTGGGAGA GATTGGGCCT TTCTCCATTT ACTTCTTCCG CAGCAAGTAT TCTTAATCTT
CAATGGGATG GCGTTTTACA AAAAGGAAGT CCTGCCGATT TTGTTTTGTT AGATTCAAAT
AGTTGGGTAA AAGCTTTGTC TGAAAGACCT AAAAGAAGAG TAGTAGTTAA TGGCGAATTT
TTAAATGAAT TGCCTAAAAA CAAAAAATTA ACATTCAACA ATTCTCACCC ATGA
 
Protein sequence
MTLEKEEFNL STKASGRIDV LVPRCLIGEG ANILGLTVDF EGLCSLQVEW RHGKICSIKG 
LKDASKVPNE ILLPRFSEPH AHLDKAFSWS RAPNYKGSYQ EALVANLNDY KSRSQGQLLF
SVEKSLNLAL VNGIRAIRSH IDSFGENVMR DWDLLEDIRK KWRDKIFLQF VALVPLEFWQ
TYEGELLAQR VALNGDLLGG VIAPPFNKKK TIKSLLHLVQ LANRLNCDID LHIDESQSCP
AAGVKLLLEV LGHIKNEISI TCSHLSSMAL LREKSISNLA KAMAEKKLNV VALPLTNSWL
LGRNERSTSI KRPLAPIFQL QKAGVVVSVG GDNVNDAWFP FTNFDPINLM AFSMPIAHLT
PWERLGLSPF TSSAASILNL QWDGVLQKGS PADFVLLDSN SWVKALSERP KRRVVVNGEF
LNELPKNKKL TFNNSHP